ICML 2024 Past Large language modelsAgentsSafety & alignment

Trustworthy Multi-modal Foundation Models and AI Agents (TiFA)

ICML 2024 TiFA Workshop

Submission deadline
May 31, 2024, 12:00 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (19)

Fetched from OpenReview (v2) on 2026-06-10.

  1. Bias Begets Bias: the Impact of Biased Embeddings on Diffusion Models

    Sahil Kuchlous, Marvin Li, Jeffrey George Wang · PDF
  2. Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity

    zhuo zhi, Ziquan Liu, Moe Elbadawi, Adam Daneshmend, Mine Orlu, Abdul W Basit, Andreas Demosthenous, Miguel R. D. Rodrigues · PDF
  3. Can Editing LLMs Inject Harm?

    Canyu Chen, Baixiang Huang, Zekun Li, Zhaorun Chen, Shiyang Lai, Xiongxiao Xu, Jia-Chen Gu, Jindong Gu, Huaxiu Yao, Chaowei Xiao, Xifeng Yan, William Yang Wang, Philip Torr, Dawn Song, Kai Shu · PDF
  4. Chained Tuning Leads to Biased Forgetting

    Megan Ung, Alicia Yi Sun, Samuel Bell, Levent Sagun, Adina Williams · PDF
  5. Decomposed evaluations of geographic disparities in text-to-image models

    Abhishek Sureddy, Dishant Padalia, Nandhinee Periyakaruppan, Oindrila Saha, Adina Williams, Adriana Romero-Soriano, Megan Richards, Polina Kirichenko, Melissa Hall · PDF
  6. Games for AI-Control: Models of Safety Evaluations of AI Deployment Protocols

    Charlie Griffin, Buck Shlegeris, Alessandro Abate · PDF
  7. MaPPing Your Model: Assessing the Impact of Adversarial Attacks on LLM-based Programming Assistants

    John Heibel, Daniel Lowd · PDF
  8. Models That Prove Their Own Correctness

    Noga Amit, Shafi Goldwasser, Orr Paradise, Guy N. Rothblum · PDF
  9. On the Difficulty of Faithful Chain-of-Thought Reasoning in Large Language Models

    Sree Harsha Tanneru, Dan Ley, Chirag Agarwal, Himabindu Lakkaraju · PDF
  10. On the Multi-modal Vulnerability of Diffusion Models

    Dingcheng Yang, Yang Bai, Xiaojun Jia, Yang Liu, Xiaochun Cao, Wenjian Yu · PDF
  11. Physical Backdoor Attack can Jeopardize Driving with Vision-Large-Language Models

    Zhenyang Ni, Rui Ye, Yuxi Wei, Zhen Xiang, Yanfeng Wang, Siheng Chen · PDF
  12. Towards Adversarially Robust Vision-Language Models: Insights from Design Choices and Prompt Formatting Techniques

    Rishika Bhagwatkar, Shravan Nayak, Reza Bayat, Alexis Roger, Daniel Z Kaplan, Pouya Bashivan, Irina Rish · PDF
  13. TrustAgent: Towards Safe and Trustworthy LLM-based Agents through Agent Constitution

    Wenyue Hua, Xianjun Yang, Mingyu Jin, Zelong Li, Wei Cheng, Ruixiang Tang, Yongfeng Zhang · PDF
  14. Unfamiliar Finetuning Examples Control How Language Models Hallucinate

    Katie Kang, Eric Wallace, Claire Tomlin, Aviral Kumar, Sergey Levine · PDF
  15. VACoDe: Visual Augmented Contrastive Decoding

    Sihyeon Kim, Boryeong Cho, Sangmin Bae, Sumyeong Ahn, Se-Young Yun · PDF
  16. Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs

    Jinmin Li, Kuofeng Gao, Yang Bai, Jingyun Zhang, Shu-Tao Xia · PDF
  17. Wasserstein Modality Alignment Makes Your Multimodal Transformer More Robust

    zhuo zhi, Ziquan Liu, Qiangqiang Wu, Miguel R. D. Rodrigues · PDF
  18. WebCanvas: Benchmarking Web Agents in Online Environments

    Yichen Pan, Dehan Kong, Sida Zhou, Cheng Cui, Yifei Leng, Bing Jiang, Hangyu Liu, Yanyi Shang, Shuyan Zhou, Tongshuang Wu, Zhengyang Wu · PDF
  19. Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

    Rylan Schaeffer, Hailey Schoelkopf, Brando Miranda, Gabriel Mukobi, Varun Madan, Adam Ibrahim, Herbie Bradley, Stella Biderman, Sanmi Koyejo · PDF