Publications

834 results for Trustworthy AI

Highlight All the Phrases: Enhancing LLM Transparency through Visual Factuality Indicators
- - Hyo Jin Do
  - Rachel Ostrand
  - et al.
- 2025
- AIES 2025
Localizing Persona Representations in LLMs
- - Celia Cintas
  - Miriam Rateike
  - et al.
- 2025
- AIES 2025
Learn more about our Trustworthy AI work
When in Doubt, Cascade: Towards Building Efficient and Capable Guardrails
- - Manish Nagireddy
  - Inkit Padhi
  - et al.
- 2025
- AIES 2025
Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble
- - Zhiqi Wang
  - Chengyu Zhang
  - et al.
- 2025
- CCS 2025
StructText: A Synthetic Table-to-Text Approach for Benchmark Generation with Multi-Dimensional Evaluation
- - Satyananda Kashyap
  - Sola Shirai
  - et al.
- 2025
- VLDB 2025
The WHY in Business Processes: Unification of Causal Process Models
- - Yuval David
  - Fabiana Fournier
  - et al.
- 2025
- BPM 2025
How to generalize machine learning models to both canonical and non-canonical peptides
- - Raúl Fernández Díaz
  - Rodrigo Ochoa
  - et al.
- 2025
- ACS Fall 2025
The Impact of Domain Adaptation on the Activation Space of LLMs
- - Assala Benmalek
  - Celia Cintas
  - et al.
- 2025
- DLI 2025
3rd TrustAI Workshop: Building Public Awareness and Engagement
- - Miriam Rateike
  - Brian Mboya
  - et al.
- 2025
- DLI 2025
DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis
- - Prashanth Vijayaraghavan
  - Soroush Vosoughi
  - et al.
- 2025
- IJCAI 2025