Akshay Deshmukh | Human-Centered AI

About

Hi — I'm Akshay, a Human-Centered AI researcher working at the intersection of multimodal interaction, intelligent interfaces, and usability engineering. My core question is deceptively simple: when people speak to, gesture at, or glance toward an AI system, does it actually understand them? And if not — why not, and how do we measure the gap?

My research builds and evaluates AI systems that integrate speech, gesture, and visual input as first-class modalities — not as afterthoughts bolted onto existing interfaces. I develop psychometric scales, run empirical studies in both controlled and real-world settings, and model the cognitive costs that accumulate when people must switch between interaction modes. This work has produced two validated evaluation instruments: a usability scale purpose-built for voice user interfaces, and a context-aware scale for multimodal automotive interfaces.

Over the past decade I have moved fluidly between academia and industry — stress-testing large language models and foundation models at Fraunhofer IAIS, building a standards-aligned voice evaluation lab at Deutsche Telekom, and conducting early HCI research at Fraunhofer IGD. That back-and-forth between rigorous evaluation and real deployment is the lens through which I approach every research question.

"In the world of voice interfaces, the best designs are those that don't just respond, but truly listen to the human heart."

— Akshay Deshmukh, Ph.D. Dissertation (2025)

A bit more about me

I was born and raised in the city of Bengaluru, India — the Garden City, where the air smells of jasmine and filter coffee, and where I spent my early years equally fascinated by how things worked and why people behaved the way they did around them. That curiosity never left.

Since then I have been, professionally speaking, something of a nomad — moving from India to Germany and staying put just long enough in each city to develop opinions about its public transport. Fun fact: if you plot the coordinates of every city I have lived in for an extended period on a globe, they trace a surprisingly coherent arc from the Indian subcontinent to central Europe.

          Bengaluru
          →
          Darmstadt
          →
          Sankt Augustin
          →
          Freiberg
        

Outside of research, I trek whenever mountains are within reasonable striking distance, grow cherry tomatoes on my terrace in Darmstadt (mangoes back home in Bengaluru, naturally), and have been known to represent my home state in table tennis — though I now primarily use that skill to win arguments about reaction time in HCI experiments.

I also practice yoga and meditation, which I find essential for anyone who spends their days thinking about cognitive load.

fun fact 01

My career trajectory follows a Fitts' Law curve — the further the target (tenure), the longer the movement time, but the larger the target gets as you approach it.

fun fact 02

I have collected more usability questionnaire responses than I have eaten dosas. Both numbers are in the thousands.

Currently

Research Associate / Assistant Professor at TU Bergakademie Freiberg, Germany (2024 – Present). I teach Human-Centered AI, Intelligent User Interfaces, and Multimodal Systems across B.Sc. and M.Sc. programmes, supervise theses on topics ranging from driver emotion detection to Human-in-the-Loop AI design, and pursue active research in ubiquitous computing and XR-based interaction.

Previously

As Senior HMI-UX Researcher at Fraunhofer IAIS (2022–2024), I led the human-centered evaluation of OpenGPT-X and the Foundation Model Playground — defining what "usable" means for large-scale AI systems when the users aren't engineers. Before that, as UX Research Engineer at Deutsche Telekom (2019–2022), I established a Voice Lab from the ground up, aligning evaluation protocols with ETSI and Alexa standards and applying mixed-method assessment to voice assistants deployed at scale.

I hold a Ph.D. in Human-Computer Interaction (2020–2025) from the Universitat Jaume I / Fraunhofer IAIS, and an M.Sc. in Distributed Software Systems from TU Darmstadt.

Research Areas

Multimodal AI Systems

Building and evaluating AI systems that treat speech, gesture, and visual input as equal, interchangeable modalities. A key focus is the Modality-Switch Cognitive Cost — quantifying what users actually lose when they are forced to change interaction channels mid-task.

Usability & UX Evaluation

Designing evaluation instruments that go beyond generic questionnaires. This includes the VUIQ (voice usability) and CAUS (context-aware automotive usability) — purpose-built, empirically validated scales for AI-driven interfaces where standard tools fall short.

Voice User Interfaces

Investigating how conversational AI performs across real users, real accents, and real conditions — not just clean speech in ideal environments. Research spans multilingual VUIs, speaker personalization, and deployment robustness in healthcare and smart-home contexts.

Human-AI Interaction

Designing AI systems that respond not just to what users say, but to how they feel. This includes affect-adaptive dialogue agents, Human-in-the-Loop (HITL) learning pipelines, and XR interfaces that adapt based on detected cognitive load and emotional state.

→ Looking to pursue a Master's thesis on multimodal interaction, VUI evaluation, or Human-AI collaboration? I supervise motivated students working on empirically grounded, publishable research. Reach out with a brief statement of interest.

Selected Projects

CAUS — Context-Aware Usability Scale

Usability in a moving vehicle isn't the same as usability on a desktop. CAUS is a 20-item psychometric scale purpose-built to evaluate multimodal interaction — touch, voice, and gesture — under real driving conditions. Validated through a controlled on-road study (N=36, 216 trials), CAUS achieved high internal consistency (α=0.91) and showed voice interaction significantly outperforms touch in high-complexity urban traffic.

Automotive HCIUsability ScaleMultimodalAutoUI 2025

EIVA — Emotionally Intelligent Voice Assistant

What matters more — detecting your emotion correctly, or responding to it well? EIVA is a modular research platform built on a BiLSTM + attention speech emotion model (64.1% real-world accuracy) with expressive LLM-generated responses. A controlled study (N=30, 1,620 utterances) found expressive strategies matter far more than detection accuracy — and 90% of users want repair when the system gets it wrong.

Affective ComputingVoice AIEmotion RecognitionCUI 2026

MSCC — Modality-Switch Cognitive Cost

Every time a user switches from voice to touch — or touch to gesture — there is a measurable cognitive cost. MSCC is a novel event-level usability metric that quantifies exactly that cost, giving multimodal designers a first-class instrument to measure the hidden usability tax of switching. Rather than a design choice, modality switching is now a measurable, optimisable variable.

Multimodal HCICognitive LoadUsability MetricMUM 2025

Automated User Evaluation Analysis Tool

Traditional usability evaluations are expensive, time-consuming, and mostly one-shot affairs. This M.Sc. thesis project (Fraunhofer IGD, TU Darmstadt) built a web-based platform that automates the full evaluation lifecycle — from questionnaire distribution to result analysis — making continuous, repeatable UX measurement possible within an agile development cycle. The tool integrates SUS, AttrakDiff, and INTUI scales and was validated on the Smarter Privacy project.

Usability AutomationUX ToolsHCIM.Sc. Thesis · 2018

SPEAKER — German Voice Assistant Platform

A large-scale German-made voice assistant platform developed by Fraunhofer IIS and Fraunhofer IAIS, funded by the German Federal Ministry for Economic Affairs. SPEAKER addresses data sovereignty, GDPR compliance, and enterprise-grade NLU for B2B applications — including deployments for Deutsche Bahn, DATEV, and the police. As part of the HMI-UX research team at Fraunhofer IAIS, the work involved human-centered evaluation and usability assessment of the platform, developing evaluation frameworks covering usability, coherence, and interaction quality.

Voice AINLUFraunhofer IAISB2B PlatformGDPR

OpenGPT-X — European Large Language Model

OpenGPT-X is a flagship German-European initiative to develop sovereign, GDPR-compliant large language models as a trustworthy alternative to US-based systems. Led by Fraunhofer IAIS in collaboration with major industrial and research partners, the project builds multilingual LLMs tailored for European languages and enterprise use cases. As part of the HMI-UX research team at Fraunhofer IAIS, the work involved human-centered evaluation of the models — developing frameworks to assess usability, interaction coherence, and real-world effectiveness beyond standard benchmark metrics.

Large Language ModelsGenerative AIFraunhofer IAISEuropean AIHCI Evaluation

NeuroCoPilot — Assistive Sign Language System

⚙ IN PROGRESS

An assistive fingerspelling system that bridges the gap between digital commands and physical sign-language expression. NeuroCoPilot translates text or voice commands into sign-language letters and delivers them simultaneously through two channels — visually on a screen display and physically via Electrical Muscle Stimulation (EMS) that actuates real finger movements on the user's hand. Designed to support both ASL (American Sign Language) and DGS (Deutsche Gebärdensprache), the system targets deaf communication, sign-language education, and assistive rehabilitation.

Assistive HCISign LanguageEMSASL · DGSAccessibility

SynaestheticSkin — Physiological Presence Wearable

⚙ IN PROGRESS

When people are physically apart, digital tools transmit information about a person — not the felt experience of a person. SynaestheticSkin is a wearable that continuously reads Person A's physiological state and delivers it to Person B through four simultaneous sensory channels: thermal (skin temperature via Peltier module), EMS (heartbeat rhythm pulsed onto the forearm), vibrotactile (breathing rate mirrored on the chest), and olfactory (a shared bond scent modulated by emotional state). The system transmits ambient, involuntary physiological signals — no intentional action required — to approximate the sensory texture of physical co-presence over distance.

Wearable HCIHapticsEMSAffective ComputingRemote Presence

News

2025Paper Towards Context-Aware Usability Assessments in Vehicles: CAUS Scale accepted at AutoUI 2025 — DOI
2025Dissertation published: Measuring User Experience in Voice and Multimodal User Interfaces — Universitat Jaume I
2025Paper Mind the Switch: Measuring Modality-Switch Cognitive Costs (MSCC) published in MUM '25 — DOI
2025Paper AI-Based Framework for Usability in Digital Health accepted in Information Research
2024Joined TU Bergakademie Freiberg as Research Associate / Assistant Professor
2024Paper User Experience and Usability of Voice User Interfaces: A Systematic Literature Review in Information — DOI
2024Paper Validation of System Usability Scale as a Usability Metric to Evaluate Voice User Interfaces in PeerJ Computer Science
2022–24Led user-centered design & AI evaluation for OpenGPT-X at Fraunhofer IAIS
2019–22Established Voice Lab for AI system testing at Deutsche Telekom, aligned with ETSI and Alexa standards

Publications

2025

Mind the Switch: Measuring Modality-Switch Cognitive Costs (MSCC) as a Metric for Multimodal Interaction

Akshay Madhav Deshmukh, Bastian Pfleging

MUM '25 – 24th Intl. Conference on Mobile and Ubiquitous Multimedia→ DOI

Towards Context-Aware Usability Assessments in Vehicles: Developing the CAUS Scale for Multimodal Interfaces

Akshay Madhav Deshmukh

AutoUI 2025 – Adjunct Proceedings→ DOI

Measuring User Experience in Voice and Multimodal User Interfaces (Ph.D. Dissertation)

Akshay Madhav Deshmukh

Universitat Jaume I→ Repository

AI-Based Framework for Usability in Digital Health

Deshmukh, A. M.

Information Research Accepted

2024

User Experience and Usability of Voice User Interfaces: A Systematic Literature Review

Deshmukh, A. M., & Chalmeta, R.

Information, 15(9), 579→ DOI

Validation of System Usability Scale as a Usability Metric to Evaluate Voice User Interfaces

Deshmukh, A. M., & Chalmeta, R.

PeerJ Computer Science, 10, e1918

Under Review / Forthcoming

CAUS: A Context-Aware Usability Scale for Evaluating Multimodal Interaction in Driving

AutoUI 2026 Under Review

Quantifying Modality-Switch Cognitive Cost: An Event-Level Usability Metric for Multimodal Interaction

ICMI 2026 Under Review

From Detection to Alignment: Evaluating Affect-Adaptive Response Strategies in Voice-Only Conversational Agents

CUI 2026, Bremen Under Review

Development and Validation of a User Experience Evaluation Questionnaire for Voice User Interfaces

Under Review

Comparative Usability Evaluation of Voice and App-Based Interaction Modalities in Smart Home AI Assistants

Under Review

Experience

Research Associate / Assistant Professor

2024 – Present

TU Bergakademie Freiberg, Germany

Teaching Human-Centered AI, Intelligent User Interfaces, and Multimodal Systems to the next generation of computing researchers. Supervising M.Sc. theses on topics I genuinely find fascinating — from real-time driver emotion detection to Human-in-the-Loop AI design systems. Alongside teaching, I pursue active research in ubiquitous computing, XR interaction, and emergency response interfaces (Feuerwehr-App, Saxony).

Senior HMI-UX Researcher

Nov 2022 – Oct 2024

Fraunhofer IAIS, Sankt Augustin, Germany

Led the human-centered evaluation programme for OpenGPT-X, the Foundation Model Playground, and SPEAKER — three of Germany's flagship AI research initiatives. The central challenge: defining what "good" looks like for AI systems when the end users aren't AI researchers. Built evaluation frameworks measuring usability, interaction coherence, cognitive load, and real-world robustness across multimodal deployment contexts.

UX Research Engineer

Feb 2019 – Oct 2022

Deutsche Telekom (via axxessio GmbH)

Built Deutsche Telekom's Voice Lab from the ground up — establishing evaluation standards aligned with ETSI specifications and Alexa certification requirements. Ran systematic usability studies for the Magenta Smart Speaker and TelekomBot, designed end-to-end conversational AI assessment protocols covering ASR accuracy, NLU understanding, and holistic interaction quality, and applied mixed-method evaluation to improve robustness in healthcare and smart-home deployments.

Research Assistant / Directed Researcher

Dec 2015 – Feb 2018

Fraunhofer IGD, Darmstadt, Germany

Conducted HCI research across the full spectrum of usability evaluation methods — expert review, participatory study, quantitative survey — and synthesised findings into an automated evaluation analysis tool that tracked usability progress across the software development lifecycle. Applied across four projects: Trend Analysis, FUPOL, Process Adaptation, and Smarter Privacy.

Visiting Researcher

Jun 2011 – Jul 2014

Indian Institute of Science (IISc), Bangalore, India

Early-career research on inclusive interface design for India's extraordinarily diverse population — users spanning multiple languages, literacy levels, and degrees of digital experience. This work shaped a lasting commitment to designing technology that works for people beyond the "average user" assumption.

Teaching & Supervision

I teach research-driven, project-oriented courses at TU Bergakademie Freiberg, with a focus on getting students building and evaluating real systems — not just reading about them.

Interactive Ubiquitous Systems & Intelligent User Interfaces (full course — B.Sc./M.Sc.)
User-Centered Design (full course — B.Sc./M.Sc.)
Data Structures in C (lectures, exercises & practicals)

Thesis Topics Supervised

Evaluating UX in Multimodal Voice Interfaces — studying how combining voice with touch reshapes the interaction experience (M.Sc.)
Personalization in Voice User Interfaces — measuring the real impact of tailored responses on user satisfaction and engagement (B.Sc.)
Designing VUIs for Non-Native German Speakers — addressing usability and comprehension gaps often overlooked in mainstream VUI design (M.Sc.)
Driver Emotion Detection via In-Cabin Sensors — building an AI system that adapts the vehicle interface in real time based on detected stress or fatigue (M.Sc., Ongoing)
Human-in-the-Loop Learning for AI-Powered Design Systems — exploring how structured human feedback improves generative design quality and usability (M.Sc., Ongoing)

Grants & Collaborations

DAAD-DST-PPP — International research collaboration between BITS Pilani (India) and TUBAF (Germany) on emotionally intelligent voice assistants