Yuntian Deng

About

I am an assistant professor at the University of Waterloo. My research interests are Natural Language Processing and Machine Learning. I received my PhD from Harvard University, where I was advised by Prof. Alexander Rush and Prof. Stuart Shieber. I did a postdoc under the supervision of Prof. Yejin Choi.

Work

University of Waterloo

Assistant Professor

Canada

University of Waterloo

Assistant Professor

Canada

Aug 2024

→

Present

Allen Institute for Artificial Intelligence

Postdoc

Jul 2023

→

Jul 2024

Nvidia (United States)

Intern

May 2022

→

Dec 2022

Facebook (United States)

Intern

May 2019

→

Dec 2019

Bloomberg (United States)

Intern

Jan 2017

→

Aug 2017

Education

Carnegie Mellon University

United States of America

Jan 2017

→

Aug 2017

Master of Science in Language Technologies

Tsinghua University

China

Jan 2017

→

Aug 2017

Bachelor of Engineering

Harvard University

United States of America

Jan 2017

→

Aug 2017

PhD in Computer Science

Publications

Discrete Transformer

Summary

journal-article

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Jan 2024

Published by

arXiv preprint arXiv:2406.08464

Summary

journal-article

MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures

Jan 2024

Published by

arXiv preprint arXiv:2406.06565

Summary

journal-article

WILDBENCH: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Jan 2024

Published by

arXiv preprint arXiv:2406.04770

Summary

journal-article

WildChat: 1M ChatGPT Interaction Logs in the Wild

Jan 2024

Published by

ICLR

Summary

conference-paper

WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries

Jan 2024

Published by

arXiv preprint arXiv:2407.17468

Summary

journal-article

WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild

Jan 2024

Published by

arXiv preprint arXiv:2409.03753

Summary

journal-article

From explicit cot to implicit cot: Learning to internalize cot step by step

Jan 2024

Published by

arXiv preprint arXiv:2405.14838

Summary

journal-article

GenSLMs: Genome-scale language models reveal SARS-CoV-2 evolutionary dynamics

Jan 2023

Published by

The International Journal of High Performance Computing Applications

Summary

journal-article

Implicit chain of thought reasoning via knowledge distillation

Jan 2023

Published by

arXiv preprint arXiv:2311.01460

Summary

journal-article

Instruction in the wild: A user-based instruction dataset

Jan 2023

Published by

GitHub repository

Summary

journal-article

Structure Modeling for Language Models

Jan 2023

Summary

dissertation-thesis

Tree Prompting: Efficient Task Adaptation without Fine-Tuning

Jan 2023

Published by

EMNLP

Summary

conference-paper

DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery through Sophisticated AI System Technologies

Jan 2023

Published by

arXiv preprint arXiv:2310.04610

Summary

journal-article

Model Criticism for Long-Form Text Generation

Jan 2022

Published by

EMNLP

Summary

conference-paper

Semi-Parametric Inducing Point Networks and Neural Processes

Jan 2022

Published by

ICLR

Summary

conference-paper

Markup-to-Image Diffusion Models with Scheduled Sampling

Jan 2022

Published by

ICLR

Summary

conference-paper

Rationales for Sequential Predictions

Jan 2021

Published by

EMNLP

Summary

conference-paper

Sequence-to-Lattice Models for Fast Translation

Jan 2021

Published by

EMNLP Findings

Summary

conference-paper

Low-Rank Constraints for Fast Inference in Structured Models

Jan 2021

Published by

NeurIPS

Summary

conference-paper

Residual Energy-Based Models for Text Generation

Jan 2020

Published by

ICLR

Summary

conference-paper

Cascaded Text Generation with Markov Transformers

Jan 2020

Published by

NeurIPS

Summary

conference-paper

Challenges in end-to-end neural scientific table recognition

Jan 2019

Published by

ICDAR

Summary

conference-paper

Neural Linguistic Steganography

Jan 2019

Published by

EMNLP

Summary

conference-paper

Real or Fake? Learning to Discriminate Machine from Human Generated Text

Jan 2019

Published by

arXiv preprint arXiv:1906.03351

Summary

journal-article

AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference

Jan 2019

Published by

DAC

Summary

conference-paper

Latent Alignment and Variational Attention

Jan 2018

Published by

NeurIPS

Summary

conference-paper

Visual Attention Model for Cross-sectional Stock Return Prediction and End-to-End Multimodal Market Representation Learning

Jan 2018

Published by

FLAIRS

Summary

conference-paper

Bottom-Up Abstractive Summarization

Jan 2018

Published by

EMNLP

Summary

conference-paper

Learning Latent Space Models with Angular Constraints

Jan 2017

Published by

ICML

Summary

conference-paper

OpenNMT: Open-Source Toolkit for Neural Machine Translation

Jan 2017

Published by

ACL Demo

Summary

conference-paper

Image-to-Markup Generation with Coarse-to-Fine Attention

Jan 2017

Published by

ICML

Summary

conference-paper

Neural Machine Translation with Recurrent Attention Modeling

Jan 2016

Published by

EACL

Summary

conference-paper

Learning Concept Taxonomies from Multi-modal Data

Jan 2016

Published by

ACL

Summary

conference-paper

Dropout with Expectation-linear Regularization

Jan 2016

Published by

ICLR

Summary

conference-paper

On the generalization error bounds of neural networks under diversity-inducing mutual angular regularization

Jan 2015

Published by

arXiv preprint arXiv:1511.07110

Summary

journal-article

Entity Hierarchy Embedding

Jan 2015

Published by

ACL

Summary

conference-paper

Diversifying Restricted Boltzmann Machine for Document Modeling

Jan 2015

Published by

KDD

Summary

conference-paper

Latent variable modeling with diversity-inducing mutual angular regularization

Jan 2015

Published by

arXiv preprint arXiv:1512.07336

Summary

journal-article