Shannon Zejiang Shen

2025

LaText: Interleave Latent and Text Chain-of-Thought for efficient reasoning

2025 November • Talk at the Scale ML Seminar Series @ MIT

I gave a talk on our recent work on LaText, a novel approach to interleave latent and text chain-of-thought for efficient reasoning.

Rethinking the Design and Evaluation of Human and LLM Collaboration

2025 May • Talk at Stanford HCI Group Lunch Seminar

I shared an initial version of our collaborative effort scaling paper, and discussed the HCI aspects of our previous work on Symbolic Generation.

2024

Co-LLM: Training LLMs to Decode Collaboratively

2024 August • Talk at University of Washington

This talk is hosted by Luke Zettlemoyer’s group. We go through the details of our ACL paper Co-LLM. You can find the slides here.

Developing User-Friendly Language Language Model Systems

2024 May • Talk at Google Research

This talk is hosted by Chiyuan Zhang and Yangsibo Hunag. We focused on the Co-LLM project and had a deep dive in the methodology and experiments. Slides available upon request.

LayoutParser and Historical Document Image Processing

2024 May • RSAP panel at the American Literature Association conference

We reviewed the LayoutParser design and functionality, as well as approaches to tackle historical image processing and extraction in 2024. Slides available upon request.

Developing User-Friendly Language Language Model Systems

2024 March • Talk at Ranjay Krishna’s Group @ UW

We start with the analogy between web interface development and llm development: LLM can produces raw text (as if htmls for the web pages) – what is the CSS and javascript in the context of LLMs? We then talk about two recent projects, Co-LLM and SymGen, drawing connections between our methods and web technologies like CSS, API calls, etc. Slides available upon request.

Towards Verifiable Text Generation for Developing Trustworthy LLMs

2024 March • Talk at MIT Sloan AI/ML Conference

In this short talk, we cover our latest research on SymGen, a novel approach to generating verifiable text for developing trustworthy LLMs. Slides available upon request.

LayoutParser and Historical Document Image Processing

2024 March • Discussion on Image Extraction, hosted by Thomas Smits at University of Amsterdam

We reviewed the LayoutParser design and functionality, as well as approaches to tackle historical image processing and extraction in 2024. Slides available upon request.

Visual Design in Scholarly Communication

2024 Jan • Instructor for an MIT IAP Class

A series of lectures over the MIT IAP period, co-taught with Lucas Torroba Hennigen, focused on visual design in scholarly communication. Visual design is a crucial element in various forms of scientific communication, ranging from papers, slides, to even videos. While there is an increasing need for researchers to produce high-quality visuals, it remains to be a time-consuming and sometimes very challenging task. Despite the significant role they play, there is a noticeable lack of formal education dedicated to this aspect. This subject aims to cover several key topics about visual designs in scholarly communication.

2023

Redesigning Clinical Documentation

2023 April • Talk at Nigam Shah’s Group Meeting @ Stanford

We took the inspiration from our position paper on AI supported expository writing and discuss how to apply such ideas in clinical documentation. This is a joint presentation with Monica Agrawal and Hunter Lang.

2022

Multi-LexSum: Real-world Summaries of Civil Rights Lawsuits at Multiple Granularities

2022 Dec • Talk at Natural Legal Language Processing workshop @ EMNLP 2022

A presentation of our work on the Multi-LexSum dataset, containing real-world summaries of civil rights lawsuits at multiple granularities.

Visual Content Extraction for Scientific Documents

2022 Nov • Guest Lecture in CSE 599D @ UW, hosted by Prof. Jeff Heer

We reviewed the general problem of visual content extraction in scientific documents, as well as the current state-of-the-art methods and challenges. Slides available upon request.