Gender-Ambiguous Voice Agents for Mitigating Gender Bias in Voice User Interfaces in the Context of Autonomous Driving
Resumo
Lately, Voice User Interfaces (VUIs) have been widely applied in smart devices. However, the current design of the voice agents used by these systems have resulted in harmful gender bias during this human-computer interaction. To reduce this bias, gender ambiguous voice agents have been considered as a potential solution. This exploratory study aims to investigate the effects of user gender and the application of gender ambiguous voice agents in VUIs. An online experiment (N=50) investigates the perception of four usability factors in the context of multiple automated driving scenarios. Users are presented with two types of gender ambiguous voice agents and two types of wording for each message to evaluate each scenario. Our findings reveal that using gender ambiguous voice agents in VUIs significantly closes the gap in scores between user genders and that the user’s gender has no significant effect on the perception of usability factors.
Palavras-chave
Texto completo:
PDFReferências
ANDREEVA, B.; DEMENKO, G.; WOLSKA, M.; MÖBIUS, B.; ZIMMERER, F.; JÜGLER, J.; et al. Comparison of pitch range and pitch variation in Slavic and Germanic languages. In: Proceedings to the 7th Speech Prosody Conference. Trinity College Dublin, Ireland. May 20-23, 2014. International Speech Communication Association, 2014. p. 776-780.
ARD, A. J. Amazon, can we have our name back? The Washington Post, 2021. Available in: https://www.washingtonpost.com/technology/interactive/2021/people-named-alexaname-change-amazon/. Accessed on 04 August 2024.
BALDWIN, C. L. Verbal collision avoidance messages during simulated driving: perceived urgency, alerting effectiveness and annoyance. Ergonomics, v. 54, n. 4, p. 328-337, 2011.
BAZILINSKYY, P.; DE WINTER, J. C. Analyzing crowdsourced ratings of speech-based take-over requests for automated driving. Applied ergonomics, v. 64, p. 56-64, 2017.
BEATTIE, D.; BAILLIE, L.; HALVEY, M. A comparison of artificial driving sounds for automated vehicles. In: Proceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing. 2015. p. 451-462.
BENT, T.; BRADLOW, A. R. The interlanguage speech intelligibility benefit. The Journal of the Acoustical Society of America, v. 114, n. 3, p. 1600-1610, 2003.
BENTLEY, F.; LUVOGT, C.; SILVERMAN, M.; WIRASINGHE, R.; WHITE, B.; LOTTRIDGE, D. Understanding the long-term use of smart speaker assistants. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, v. 2, n. 3, p. 1-24, 2018.
BURT, J. L.; BARTOLOME, D. S.; BURDETTE, D. W.; AND COMSTOCK JR, J. R. A psychophysiological evaluation of the perceived urgency of auditory warning signals. Ergonomics, v. 38, n. 11, p. 2327-2340, 1995.
CHEN, S.-Y.; FENG, Z.; YI, X; A general introduction to adjustment for multiple comparisons. Journal of thoracic disease, v. 9, n. 6, p. 1725, 2017.
COLLINS, B. How Google’s Waze has changed from its early days as car travel is remade. CNBC Disruptor 50, 2022. Available in: https://www.cnbc.com/2022/12/01/how-googles-waze-changed-from-its-early-days-as-car-travel-is-remade.html. Accessed on 02 January 2025.
COLLINS DICTIONARY. How do you use the declarative to make statements in English? Collins Dictionary, 2024a. Available in: https://grammar.collinsdictionary.com/easy-learning/how-do-you-use-the-declarative-to-make-statements-in-english. Accessed on 13 September 2024.
COLLINS DICTIONARY. What type of sentence are commands? Collins Dictionary, 2024b. Available in: https://grammar.collinsdictionary.com/easy-learning/what-type-of-sentence-are-commands. Accessed on 13 September 2024.
DU, N.; ZHOU, F.; PULVER, E. M.; TILBURY, D. M.; ROBERT, L. P.; PRADHAN, A. K.; YANG, X. J. Predicting driver takeover performance in conditionally automated driving. Accident Analysis & Prevention, v. 148, p. 105748, 2020.
EICHELBERGER, A. H.; MCCARTT, A. T. Volvo drivers’ experiences with advanced crash avoidance and related technologies. Traffic injury prevention, v. 15, n. 2, p. 187-195, 2014.
FIELD, A. Discovering statistics using IBM SPSS statistics. Sage publications limited, 2024.
FORSTER, Y.; NAUJOKS, F.; NEUKUM, A. Increasing anthropomorphism and trust in automated driving functions by adding speech output. In: 2017 IEEE intelligent vehicles symposium (IV). IEEE, 2017. p. 365-372, 2017.
GARCIA-CHICO, J. L.; CORKER, K. M. An analysis of operational errors and the interaction with TCAS RAs. In: Proceedings 7th USA/Europe Air Traffic Management R&D Seminar (ATM2007), Barcelona, Spain. 2007. p. 2-5, 2007.
GUILLAUME, A. Intelligent auditory alarms. The sonification handbook. Berlin: Logos Verlag, 2011.
GUSTAVSSON, E. Virtual servants: Stereotyping female front‐office employees on the internet. Gender, Work & Organization, v. 12, n. 5, p. 400-419, 2005.
HELLIER, E.; EDWORTHY, J.; WEEDON, B.; WALTERS, K.; ADAMS, A. The perceived urgency of speech warnings: Semantics versus acoustics. Human Factors, v. 44, n. 1, p. 1-17, 2002.
HOY, M. B. Alexa, Siri, Cortana, and More: An Introduction to Voice Assistants. Medical Reference Services Quarterly, 37(1), p. 81–88, 2018. https://doi.org/10.1080/02763869.2018.1404391
HUANG, Y.; HE, L.; WEI, W.; GALE, W.; LI, J.; GONG, Y. Using personalized speech synthesis and neural language generator for rapid speaker adaptation. In: ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2020. p. 7399-7403, 2020.
KISELLA, B. Over 60% of Car Buyers That Have Used Voice Assistants Factor Availability into Purchase Decision. Voicebot.ai, 2020. Available in: https://voicebot.ai/2020/02/25/over-60-of-car-buyers-that-have-used-voice-assistantsfactoravailability-into-purchase-decision. Accessed on 10 August 2024.
KISELLA, B. Voice Shopping Rises to 45 Million U.S. Adults in 2021. Voicebot.ai, 2021. Available in: https://voicebot.ai/2021/12/24/voice-shopping-rises-to-45-million-u-sadults-in-2021. Accessed on 10 August 2024.
KISELLA, B. Voice Assistant Use in Healthcare Nearly Tripled Over Two Years and Demand Still Outstrips Supply – New Report. Voicebot.ai, 2022. Available in: https://voicebot.ai/2022/01/07/voice-assistant-use-in-healthcare-nearly-tripled-overtwoyears-and-demand-still-outstrips-supply-new-report/ Accessed on 10 August 2024.
KUZMINYKH, A.; SUN, J.; GOVINDARAJU, N.; AVERY, J.; LANK, E. Genie in the bottle: Anthropomorphized perceptions of conversational agents. In: Proceedings of the 2020 CHI conference on human factors in computing systems. 2020. p. 1-13.
LATORRE, J.; LACHOWICZ, J.; LORENZO-TRUEBA, J.; MERRITT, T.; DRUGMAN, T.; RONANKI, S.; KLIMKOV, V. Effect of data reduction on sequence-to-sequence neural TTS. In: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2019. p. 7075-7079, 2019.
LOPATOVSKA, I.; WILLIAMS, H. Personification of the Amazon Alexa: BFF or a mindless companion. Proceedings of the 2018 Conference on Human Information Interaction and Retrieval, pp. 265–268, 2018.
MURRAR, S.; BRAUER, M. Mixed model analysis of variance. The SAGE encyclopedia of educational research, measurement, and evaluation, v. 1, p. 1075-1078, 2018.
NASS, C.; STEUER, J.; TAUBER, E. R. Computers are social actors. In: Proceedings of the SIGCHI conference on Human factors in computing systems. 1994. p. 72-78.
NAUJOKS, F.; FORSTER, Y.; WIEDEMANN, K.; NEUKUM, A. Speech improves human automation cooperation in automated driving. In: Mensch und Computer 2016– Workshopband, 2016.
NAUJOKS, F.; PURUCKER, C.; WIEDEMANN, K.; NEUKUM, A.; WOLTER, S.; STEIGER, R. Driving performance at lateral system limits during partially automated driving. Accident Analysis & Prevention, v. 108, p. 147-162, 2017.
NØRGAARD, N. How to create a genderless voice | Nis Nørgaard | TEDxUniversityofNicosia. Tedx Talks, 2019. Available in: https://www.youtube.com/watch?v=qH6KB7MrOPw. Accessed on 06 August 2024.
OORD, A.; LI, Y.; BABUSCHKIN, I.; SIMONYAN, K.; VINYALS, O.; KAVUKCUOGLU, K.; DRIESSCHE, G.; LOCKHART, E.; et al. Parallel wavenet: Fast high-fidelity speech synthesis. In: International conference on machine learning. PMLR, 2018. p. 3918-3926, 2018.
POLITIS, I.; BREWSTER, S.; POLLICK, F. Language-based multimodal displays for the handover of control in autonomous cars. In: Proceedings of the 7th international conference on automotive user interfaces and interactive vehicular applications. 2015. p. 3-10, 2015.
Q. Meet Q: The First Genderless Voice - FULL SPEECH. Meet Q - The First Genderless Voice, 2019. Available in: https://www.youtube.com/watch?v=lvv6zYOQqm0. Accessed on 15 August 2024.
SOCIETY OF AUTOMOTIVE ENGINEERS INTERNATIONAL. Taxonomy and Definitions for Terms Related to Driving Automation Systems for On-Road Motor Vehicles – Standard: J3016™. Society of Automotive Engineers International, Surfaced Vehicle Recommended Practice. SAE International, v. 4970, n. 724, p. 1-5, 2021.
SHAHAB, Q.; TERKEN, J.; EGGEN, B. Auditory messages for speed advice in advanced driver assistance systems. In: Proceedings of the 2nd International Conference on Automotive User Interfaces and Interactive Vehicular Applications. 2010. p. 50-56, 2010.
SUTTON, S. J. Gender ambiguous, not genderless: Designing gender in voice user interfaces (VUIs) with sensitivity. In: Proceedings of the 2nd conference on conversational user interfaces. 2020. p. 1-8, 2020.
TOLMEIJER, S.; ZIERAU, N.; JANSON, A.; WAHDATEHAGH, J.; LEIMEISTER, J. M.; BERNSTEIN, A. Female by default?–exploring the effect of voice assistant gender and pitch on trait and trust attribution. In: Extended abstracts of the 2021 CHI conference on human factors in computing systems. 2021. p. 1-7, 2021.
TYMBURIBA ELIAN, M.; R. SAKAY, O. E.; YAMANAKA, T. Effects of Gender and Semantics on Auditory Displays: An Exploratory Study on User Interface Design. In: Congress of the International Association of Societies of Design Research. Singapore: Springer Nature Singapore, 2021. p. 1465-1473, 2022a.
TYMBURIBA ELIAN, M.; MASUKO, S.; YAMANAKA, T. Who are you talking to? Considerations on designing gender ambiguous voice user interfaces. In: 9th International Conference on Kansei Engineering and Emotion Research. KEER2022. Proceedings. p. 711-720, 2022b.
TYMBURIBA ELIAN, M.; BAO, S.; MASUKO, S; YAMANAKA, T. Designing gender ambiguous voice agents-effects of gender ambiguous voice agents on usability of voice user interfaces. International Journal of Affective Engineering, v. 22, n. 1, p. 53-62, 2023.
UNESCO. I'd blush if I could: closing gender divides in digital skills through education. UNESCO, 2019.
WOGALTER, M. S.; CONZOLA, V. C.; SMITH-JACKSON, T. L. Research-based guidelines for warning design and evaluation. Applied ergonomics, v. 33, n. 3, p. 219-230, 2002.
WOGALTER, M. S.; SILVER, N. C. Warning signal words: Connoted strength and understandability by children, elders, and non-native English speakers. Ergonomics, v. 38, n. 11, p. 2188-2206, 1995.
Wu, Z.; Swietojanski, P.; Veaux, C.; Renals, S.; King, S. A study of speaker adaptation for DNN-based speech synthesis. In: Interspeech 2015. International Speech Communication Association, 2015.
ZEEB, K.; BUCHNER, A.; SCHRAUF, M. What determines the take-over time? An integrated model approach of driver take-over after automated driving. Accident analysis & prevention, v. 78, p. 212-221, 2015.
ZEEB, K.; BUCHNER, A.; SCHRAUF, M. Is take-over time all that matters? The impact of visual-cognitive load on driver take-over quality after conditionally automated driving. Accident analysis & prevention, v. 92, p. 230-239, 2016.
DOI: https://doi.org/10.35522/eed.v33i1.2096
Apontamentos
- Não há apontamentos.
Direitos autorais 2025 Matheus Tymburibá Elian, Luiz Felipe Vecchieti, Toshimasa Yamanaka

Esta obra está licenciada sob uma licença Creative Commons Atribuição - NãoComercial 4.0 Internacional.
Revista Estudos em Design, Rio de Janeiro, RJ, Brasil, ISSN Impresso: 0104-4249, ISSN Eletrônico: 1983-196X

Esta obra está licenciada sob uma licença Creative Commons Atribuição-NãoComercial 4.0 Internacional.