news | Weiting Tan

Sep 25, 2025	My internship work at Bytedance Seed, Process-Supervised Reinforcement Learning for Interactive Multimodal Tool-Use Agents and FlowPortrait: Reinforcement Learning for Audio-Driven Portrait Video Generation, are now released on arXiv.
Aug 20, 2025	Seeing is Believing: Emotion-Aware Audio-Visual Language Modeling for Expressive Speech Generation accepted to EMNLP 2025!
May 20, 2025	STAR and SSR accepted to IWSLT 2025! Please come to our presentation, see you in Vienna!
May 01, 2025	Starting a new internship at Bytedance Seed! Will be working on iterative (multi-turn, multi-modal) tool-use agents.
Sep 25, 2024	DiffNorm accepted to NeurIPS 2024! Please come to our poster, see you in Vancouver!
Aug 02, 2024	I will work on Multi-modal LLMs as a part-time student researcher at Meta in Fall 2024 & Spring 2025.
Feb 20, 2024	I will be interning at Meta AI (FAIR) in summer 2024, working on Speech Large Language Model. Looking forward to the new project!
Apr 07, 2023	I will be staying at Johns Hopkins University for my PhD, working with Prof. Philipp Koehn!