Exploring the Limits of Conformer CTC-Encoder for Speech Emotion Recognition using Large Language ModelsEdmilson Da Silva MoraisHagai Aronowitzet al.2025INTERSPEECH 2025
Learnable Channel Converter for Multi-Spectral Image to RGB Visualization using a Vision-Text ModelHaoxiang QiuTomoya Sakaiet al.2025IGARSS 2025
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2025ACL 2025
BioVERSE: A Modular Framework for Integrating Biomedical Modalities with Language Models in Precision MedicineChing-Huei TsouMichal Ozery-Flatoet al.2025ISMB 2025
Granite Vision: A Demo for Efficient Visual Document UnderstandingPengyuan LiGranite Vision Team2025CVPR 2025
CodeGenWrangler: Data Wrangling task automation using Code-Generating ModelsAkella AshleshaAbhijit Manatkaret al.2025NAACL 2025
Enterprise Benchmarks for Large Language Model EvaluationBing ZhangMikio Takeuchiet al.2025NAACL 2025
Creating Conversational Datasets for Retrieval-Augmented Generation Applications is Hard: Challenges & Research OpportunitiesMaeda HanafiKshitij Fadniset al.2025CHI 2025
LLM based Text Generation for Improved Low-resource Speech Recognition ModelsTohru NaganoGakuto Kurataet al.2025ICASSP 2025