When we’re speaking, we have an overwhelming compulsion to fill ‘dead air’ with words. So, we tend to inject lots of ‘ums’ and ‘uhs’.
I noticed that YouTube automatically generates a transcript from videos. So, I saved a copy of the transcript from the recent lunch-n-learn I did for Smart Data, to see how I did.
I threw together a quick parser:
#!/usr/bin/python3
def word_check(line_to_check, word_to_check):
= 0
current_count
if line_to_check.startswith(f"{word_to_check} "):
= current_count + 1
current_count if line_to_check.endswith(f" {word_to_check}"):
= current_count + 1
current_count = current_count + line_to_check.count(f" {word_to_check} ")
current_count
return current_count
if __name__ == "__main__":
= 0
um_count = 0
uh_count
with open("video_transcript.txt") as f:
= f.readlines()
contents
for line in contents:
= line.strip()
line
= um_count + word_check(line, "um")
um_count
= uh_count + word_check(line, "uh")
uh_count
print(f"You said 'um' {um_count} times")
print(f"You said 'uh' {uh_count} times")
Then, I ran it against the transcript of my 90 minute talk. Bear in mind, I was pretty confident that I do a decent job of minding my ‘uhs’, and ‘ums’.
My results?
You said 'um' 31 times You said 'uh' 165 times
Apparently, perception is not always reality. Who knew?