NESTFUL: A Benchmark for Evaluating LLMs on Nested Sequences of API CallsKinjal BasuIbrahim Abdelazizet al.2025EMNLP 2025
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular TasksIbrahim AbdelazizKinjal Basuet al.2024EMNLP 2024
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMsKinjal BasuIbrahim Abdelazizet al.2024ACL 2024
Pointwise Mutual Information Based Metric and Decoding Strategy for Faithful Generation in Document Grounded DialogsYatin NandwaniVineet Kumaret al.2023EMNLP 2023
DG2: Data Augmentation Through Document Grounded Dialogue GenerationQingyang WuSong Fenget al.2022SIGDIAL 2022
Does Structure Matter? Encoding Documents for Machine Reading ComprehensionHui WanSong Fenget al.2021NAACL-HLT 2021
Does Structure Matter? Encoding Documents for Machine Reading ComprehensionHui WanSong Fenget al.2021NAACL 2021