Publications.

Selected peer-reviewed papers, preprints, and conference presentations from the AIMHealth group, listed newest first.

Year All 2025 2024 2023

2025

14 publications

Integrating Expert Knowledge into Large Language Models Improves Performance for Psychiatric Reasoning and Diagnosis

Sarma, K. V., Hanss, K. E., Halls, A. J. M., Krystal, A., Becker, D. F., Glowinski, A. L., Butte, A. J. Psychiatry Research · 2025

PDF

journal

Competence or confidence? Assessing the accuracy, reliability, and confidence of large language models in psychiatry

Hanss, K. E., Sarma, K. V., Glowinski, A. L., Krystal, A., Saunders, R., Halls, A. J. M., Gorrell, S., Reilly, E. JMIR · 2025

PDF PubMed

journal

Quantifying device type and handedness biases in a remote Parkinson's disease AI-powered assessment

Tumpa, Z. N., Zawad, M. R. S., Sollis, L., Parab, S., Chen, I. Y., Washington, P. NPJ Digital Medicine · 2025

PDF

journal

"You're Not Crazy": A Case of New Onset AI-Associated Psychosis

Pierre, J. M., Gaeta, B., Raghavan, G., Sarma, K. V. Innovations in Clinical Neuroscience · 2025

PDF

journal

Simulated Reasoning and Self-Verification in Generalist Large Language Models for Psychiatric Diagnostic Performance: Cross-Sectional Study

Sarma, K. V., Hanss, K. E., Halls, A., Becker, D., Glowinski, A., Krystal, A. medRxiv [Preprint] · 2025

PDF

preprint

Characterizing Dementia Phenotypes from Unstructured EHR Notes with Generative AI and Interpretable Machine Learning

Tang, A. S., Zeng, B. Z. D., Rankin, K. P., Miller, B., Gorno-Tempini, M. L., Seeley, W. W., Rosen, H. J., Rabinovici, G. D., Oskotsky, T. T., Sirota, M., Pinheiro-Chagas, P. medRxiv · 2025

PDF

preprint

Agentic Generative Artificial Intelligence System for Classification of Pathology-Confirmed Primary Progressive Aphasia Variants

Gallingani, C., Miller, Z. A., Mandelli, M. L., Rosen, H. J., Ezzes, Z., Lin, M., Rodriguez, D., Seeley, W. W., Miller, B., Gorno-Tempini, M. L., Pinheiro-Chagas, P. medRxiv · 2025

PDF

preprint

When Testing AI Tests Us: Safeguarding Mental Health on the Digital Frontlines

Pendse, S. R., Gergle, D., Kornfield, R., Meyerhoff, J., Mohr, D., Suh, J., Wescott, A., Williams, C., Schleider, J. ACM Conference on Fairness, Accountability, and Transparency (FAccT) · 2025

PDF

conference

The Typing Cure: Experiences with Large Language Model Chatbots for Mental Health Support

Song, I., Pendse, S. R., Kumar, N., De Choudhury, M. Proceedings of the ACM on Human-Computer Interaction (PACM HCI), CSCW · 2025

PDF

conference

The Role of Partisan Culture in Mental Health Language Online

Pendse, S. R., Rochford, B., Kumar, N., De Choudhury, M. Proceedings of the ACM on Human-Computer Interaction (PACM HCI), CSCW · 2025

PDF

conference

Implicit Gender, Racial, and Ethnic Biases in Large Language Models: An Audit Study of Automated Psychiatric Diagnoses

Pendse, S. R., Jain, M., Kumar, N., De Choudhury, M. Preprint · 2025

PDF

preprint

Leveraging Large Language Models to Code Content Fidelity in Virtual School-Based Behavioral Parent Training

Langfus J, Hanss K, Chung S, Nili A, Haack L, Pfiffner L. International Society for Research in Child and Adolescent Psychopathology Biennial Meeting · 2025

conference

Dr. AI Will See You Now: The Opportunities, Challenges, and Risks of ChatGPT, Gemini, and Other Large Language Models in Psychiatry

Sarma, K. V., Hanss, K. E., Galatzer-Levy, I. R., Tolou-Shams, M. APA Annual Meeting · 2025

conference

The Robo-Doctor is Always In: Assessing and Comparing the Psychiatric Diagnostic Capabilities of ChatGPT and other Large Language Models

Sarma, K. V., Hanss, K. E., Glowinski, A. L., Krystal, A., Halls, A. J. M., Butte, A. J. APA Annual Meeting · 2025

conference

2024

17 publications

Can Artificial Intelligence Make the Diagnosis? Evaluating the Accuracy of Large Language Models in Diagnosing Child and Adolescent Psychiatry Clinical Cases

Hanss, K., Sarma, K. V., Halls, A., Gorrell, S., Reilly, E. Journal of the American Academy of Child & Adolescent Psychiatry · 2024

PDF

journal

Rethinking technology innovation for mental health: framework for multi-sectoral collaboration

Suh, J., Pendse, S. R., Lewis, R., Howe, E., Saha, K., Okoli, E., Amores, J., Ramos, G., Shen, J., Borghouts, J., Sharma, A., Pedrelli, P., Friedman, L., Jackman, C., Benhalim, Y., Ong, D. C., Segal, S., Althoff, T., Czerwinski, M. Nature Mental Health · 2024

PDF

journal

Missed Opportunities for Human-Centered AI Research: Understanding Stakeholder Collaboration in Mental Health AI Research

Yoo, D. W., Woo, H., Pendse, S. R., Lu, N. Y., Birnbaum, M. L., Abowd, G. D., De Choudhury, M. Proceedings of the ACM on Human-Computer Interaction · 2024

PDF

conference

Advancing a consent-forward paradigm for digital mental health data

Pendse, S. R., Stapleton, L., Kumar, N., De Choudhury, M., Chancellor, S. Nature Mental Health · 2024

journal

Quantifying the Pollan Effect: Investigating the Impact of Emerging Psychiatric Interventions on Online Mental Health Discourse

Pendse, S. R., Kumar, N., De Choudhury, M. Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems · 2024

conference

Towards Inclusive Futures for Worker Wellbeing

Pendse, S. R., Massachi, T., Mahdavimoghaddam, J., Butler, J., Suh, J., Czerwinski, M. Proceedings of the ACM on Human-Computer Interaction (PACM HCI), CSCW · 2024

conference

Challenges in the Differential Classification of Individual Diagnoses from Co-Occurring Autism and ADHD Using Survey Data

Jaiswal, A., Wall, D. P., Washington, P. IEEE EMBS International Conference on Biomedical Health Informatics · 2024

PDF

conference

Ethics of the Use of Social Media as Training Data for AI Models Used for Digital Phenotyping

Jaiswal, A., Shah, A., Harjadi, C., Windgassen, E., Washington, P. JMIR Formative Research · 2024

PDF

journal

Using #ActuallyAutistic on Twitter for Precision Diagnosis of Autism Spectrum Disorder: Machine Learning Study

Jaiswal, A., Washington, P. JMIR Formative Research · 2024

PDF

journal

Digitally Diagnosing Multiple Developmental Delays Using Crowdsourcing Fused With Machine Learning

Washington, P. JMIR Research Protocols · 2024

PDF

journal

A Comparison of Personalized and Generalized Approaches to Emotion Recognition Using Consumer Wearable Devices: Machine Learning Study

Li, J., Washington, P. JMIR AI · 2024

PDF

journal

Can Large Language Models Reason about Behavioral Health? Evaluating the Psychiatric Knowledge Base and Reasoning Capabilities of GPT-4

Sarma, K. V.*, Hanss, K. E.*, Glowinski, A. L., Butte, A. J., Halls, A. J. M. UCSF Health Services Research Symposium · 2024

conference

The Ethics of Artificial Intelligence in Psychiatry: A Beginner's Exploration.

He, C. X, Sarma, K. V., Hu, R. APA Mental Health Services Conference · 2024

conference

Can Large Language Model-based AI Reason about Behavioral Health? Preliminary Evaluation of a Decision Tree-Based LLM Algorithm for Psychiatric Case Diagnosis

Sarma, K. V., Hanss, K. E., Glowinski, A. L., Krystal, A., Halls, A. J. M., Butte, A. J. ACNP 63rd Annual Meeting · 2024

PDF PubMed

conference

Improving the Performance of LLM-Based Semi-Automated Psychiatric Case Diagnosis using Decision Tree-Based Prompting

Sarma, K. V., Hanss, K. E., Glowinski, A. L., Butte, A. J., Halls, A. J. M. AMIA Annual Meeting · 2024

conference

AI vs. DSM -- Can It Make the Diagnosis? Measuring GPT-4's Psychiatric Knowledge and Reasoning

Sarma, K. V., Hanss, K. E., Elkin, D., Halls, A. UCSF School of Medicine Leadership Retreat · 2024

conference

Grading the Machine: Assessing ChatGPT's Psychiatric Knowledge through Boards-Style Assessment

Hanss, K. E.*, Sarma, K. V.*, Saunders, R., Elkin, D. APA Annual Meeting · 2024

conference

2023

1 publication

A Review of and Roadmap for Data Science and Machine Learning for the Neuropsychiatric Phenotype of Autism

Washington, P., Wall, D. P. Annual Review of Biomedical Data Science · 2023

PDF

journal