Humor in Multilingual Digital Assistants
By: Mary Tomasso
The sense of humor is a unique human trait - although not uniquely human it seems - and quite a complex one. As Paul McDonald states in The philosophy of humour: “The fact that even a simple joke uses simultaneously language skills, theory-of-mind, symbolism, abstract thinking, and social perception, makes humor arguably the most complex cognitive attribute humankind may have.”
It shouldn’t come as a surprise that humans expect humor to be encountered, and understood, even in interactions with virtual beings.
Virtual being is a broader, and rather new term, that refers to a character that doesn’t exist in reality, but that can interact with humans through digital means. An example would be Mica, LEAP’s virtual being in augmented reality, that can communicate with a viewer through the Magic Leap, the company's augmented reality glasses.
This kind of interaction is still not mainstream, the technology being not as accessible as an Amazon Echo device, for instance. Therefore, in this article, we will discuss humor in digital assistants and conversational agents.
Examples of a digital assistant, or intelligent personal assistant, are Siri, Cortana, Alexa and Google Assistant.
A conversational agent is any dialogue system that uses NLU (natural language understanding) and NLP (natural language processing) to maintain human-like conversations.
In order to understand how humor works in these contexts, first we have to understand what humor is.
What is humor?
According to the dictionary, humor is the quality in something that makes it funny; the ability to laugh at things that are funny. Similarly, the sense of humor is the ability to see the funny side of life.
Does this explain what humor is? The truth is, humans have tried to explain and theorize humor since antiquity.
Some of the most common theories on humor are: superiority theory, relief theory, and incongruity theory.
The superiority theory dates back to Plato and Aristotle and postulates that people find humor in, and laugh at, earlier versions of themselves and the misfortunes of others, because this makes them feel superior. That would probably explain why we laugh at someone stumbling, or falling.
The relief theory, by Herbert Spencer and Sigmund Freud, posits that laughter is a homeostatic mechanism that allows people to relieve “nervous energy.” This explains why jokes on taboo topics can make us laugh: the energy invested in suppressing “inappropriate” emotions, is released as laughter.
The incongruity theory, arisen in the 18th century, states that people laugh when they find an incongruity between expectations and reality, something that “violates our mental patterns and expectations.”
Humor can be conveyed by words, images or actions, but we will of course focus exclusively on verbally expressed humor.
Humor in translation
Translating verbally expressed humor is an extremely hard task, the main constraints being: conveying the same concept in different languages and for different cultures. Prof. Delia Chiaro has extensively treated the subject in Primer of humor research.
Prof. Chiaro explains how when dealing with the translation of a pun, which are notoriously untranslatable, the translator must come to some sort of compromise: “As long as the TT serves the same function as the ST, it is of little importance if the TT has to depart somewhat in formal terms from the original. Some feature of the ST is lost in exchange for a gain in the TL.”
How much more complicated does it become when humor involves human-computer interaction?
Humor in digital assistants
Humor in Human-Computer Interaction is the object of study of the rather new computational humor, a branch of computational linguistics.
Reasons to use humor in virtual agents might be: to engage and entertain users, and to mitigate performance limitations. If the virtual agent doesn’t understand, a bit of humor might help the user feel less frustrated.
Many experiments have been conducted around this topic. Brent Rose, writer and host of the successful WIRED video series Out of Office with Brent Rose, performed Stand-Up Comedy using only jokes from Siri, Alexa, Cortana and Google Home. The results showed that Cortana and Siri were the funniest of the four assistants.
The video is definitely worth watching, I cannot imagine a better way to demonstrate how bad the majority of the jokes told by digital assistants are. I think that somehow it also proved that humor is not only about a written text, otherwise the jokes would be funny, as they are written by humans, not by AI, but most of them aren’t.
Another study worth mentioning is the survey Humor in Human-Computer Interaction: A Short Survey. According to the survey “users rated significantly better the system that gave humorous comments in task-oriented interactions and overall an improved perception of systems qualities”.
Humor gives the digital assistant a human touch and overall creates a more likable experience for the users. Digital assistants with a sense of humor stand more chances to be liked, as they are perceived as more human.
The survey also found that social conversations increased up to 50% when a virtual agent used jokes in interaction with human users.
Humor is important in human-human interaction as well as in human-computer interaction. Studies on humor in Human-Computer Interaction actually show that similar beneficial effects can be encountered as encountered in human-human interaction.
Certainly one of the biggest problems is the unavailability of intelligent content frameworks in all languages and the lack of a multilingual dataset for humor. There are some interesting projects though, such as the UR-FUNNY: A Multimodal Language Dataset for Understanding Humor, which aims at understanding humor in a multimodal manner, “through the usage of words (text), gestures (vision) and prosodic cues (acoustic).”
As far as voice is concerned, a big constraint is that automatic speech recognition (ASR) often struggles with recognizing accents, dialects, slang, unclear speech, etc.. This makes it very hard to entertain pleasant conversations with a digital assistant and often leads to frustration.
What are the tools of the trade? What we should be looking for is more open datasets for humor detection available in more languages.
Two of the most promising projects I’ve found are:
- Leyzer: A Dataset for Multilingual Virtual Assistants “designed to study multilingual and cross-lingual NLU models and localization strategies in VAs”. The paper was published in 2020, and although the results as far as intent accuracy is concerned seem unsatisfactory, may set the baseline for further work, and hopefully in the future the dataset will be extended to more languages. The Leyzer dataset, the translation memories and the detailed experiment results presented are available here.
- Another interesting project is the Large Dataset and Language Model Fun-Tuning for Humor Recognition by Vladislav Blinov, Valeriia Bolotova-Baranova, and Pavel Braslavski. The authors have created a publicly available dataset for humor recognition in Russian that consists of more than 300,000 short texts in total (only half of them being funny). They found that most of the available humor-related datasets are in English only, are relatively small, and focus primarily on puns, neglecting other forms of humor.
They implemented a humor detection method based on the universal language model fine-tuning. This method is purely data-driven and has proven to generalize well, yielding overall positive results. The authors plan to reproduce the experiment with English data and hopefully there will be more similar projects in the future. The dataset is available here, whereas the paper can be consulted here.
We have explained what humor is in theory and shown some examples of applications in human-computer interaction, specifically in digital assistants.
However, humor is hardly something that can be explained in theory, that is why it is so difficult to reproduce it, and two different persons might tell the same joke, yet one might be funnier than the other one.
In my experience, the funniest conversational agents, or digital beings, are those that have a strong personality. These characters don’t just tell already-made jokes, but are scripted to respond with natural, human humor. One example is AnnA the bot. She’s bright and witty and will make you laugh.
While we completely rely on the availability of large multilingual datasets in order to create humor in multilingual digital assistants, we can encourage the creation of small, local, monolingual digital assistants with humor, that might in the future contribute to the creation of larger datasets and frameworks.