Posts by Collection

We analyze how LLMs learn new knowledge through the lens of knowledge circuit evolution, identifying computational subgraphs that facilitate knowledge storage and processing.

Download [pdf].

Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

Brian K Chen, Tianyang Hu, Hui Jin, Hwee Kuan Lee, Kenji Kawaguchi, 2025.

Published in The Forty-First International Conference on Machine Learning (ICML 2024).

We find a way to convert the prompts into the model weights by introducing an extra bias term into the attention module.

Download [pdf].

Towards understanding how transformer perform multi-step reasoning with matching operation

Zhiwei Wang, Yunji Wang, Zhongwang Zhang, Zhangchen Zhou, Hui Jin, Tianyang Hu, Jiacheng Sun, Zhenguo Li, Yaoyu Zhang, Zhi-Qin John Xu, 2025.

Submitted to The Forty-second International Conference on Machine Learning (ICML 2025).

We propose a buffer mechanism and found evidence that supports such mechanism being employed by language models during the reasoning process. We propose a method to enhance the model’s reasoning capability, significantly improving data utilization efficiency in logical reasoning datasets.

Download [pdf].

talks

Talk 1 on Relevant Topic in Your Field

Published: March 01, 2012

This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!

Tutorial 1 on Relevant Topic in Your Field

Published: March 01, 2013

More information here

Talk 2 on Relevant Topic in Your Field

Published: February 01, 2014

More information here

Conference Proceeding talk 3 on Relevant Topic in Your Field

Published: March 01, 2014

This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.

teaching

2020 Fall: PIC 10B Intermediate Programming

Undergraduate course, UCLA, 2020

Abstract data types and their implementation using C++ class mechanism; dynamic data structures, including linked lists, stacks, queues, trees, and hash tables; applications; object-oriented programming and software reuse; recursion; algorithms for sorting and searching.

2021 Fall: PIC 16A Python with Applications I

Undergraduate course, UCLA, 2021

Core Python language constructs, applications, text processing, data visualization, interaction with spreadsheets and machine learning.