KAIST LK Lab

2025

[Feb 15, 2025] 7 papers are accepted to NAACL 2025.

Following papers are accepted to NAACL 2025:
- Generative Prompt Internalization
- The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
- Evaluating Multimodal Generative AI with Korean Educational Standards
- CORG: Generating Answers from Complex, Interrelated Contexts
- KMMLU: Measuring Massive Multitask Language Understanding in Korean
The following paper is accepted to NAACL 2025 findings:
- Ask Optimal Questions: Aligning Large Language Models with Retriever's Preference in Conversation
The following paper is accepted to NAACL 2025 Industry Track:
- Towards Reliable and Practical Phishing Detection

[Feb 15, 2025] 3 papers are accepted to ICLR 2025.

Following papers are accepted to ICLR 2025:
- Latent action pretraining from videos
- How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
- (Oral) Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition

[Feb 15, 2025] 1 paper is accepted to AAAI 2025.

Following paper is accepted to AAAI 2025:
- RouterRetriever: Routing over a Mixture of Expert Embedding Models

2024

[Oct 4, 2024] 2 papers are accepted to NeurIPS 2024.

Following papers are accepted to NeurIPS 2024:
- How Do Large Language Models Acquire Factual Knowledge During Pretraining?
- Aligning to Thousands of Preferences via System Message Generalization

[Sep 24, 2024] 7 papers are accepted to EMNLP 2024.

Following papers are accepted to EMNLP 2024:
- Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
- Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization
- Exploring the Practicality of Generative Retrieval on Dynamic Corpora
- On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning
- Rethinking the Role of Proxy Rewards in Language Model Alignment
- Instruction Matters, a Simple yet Effective Task Selection Approach in Instruction Tuning for Specific Tasks
The following paper is accepted to EMNLP 2024 Findings:
- Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards

[May 24, 2024] 6 papers are accepted to ACL 2024.

Following papers are accepted to ACL 2024:
- Semiparametric Token-Sequence Co-Supervision
- LangBridge: Multilingual Reasoning Without Multilingual Supervision
- Aligning Large Language Models by On-Policy Self-Judgment
- Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?
- ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot Retrieval
The following paper is accepted to ACL 2024 Findings:
- Prometheus-Vision: Vision-Language Model as a Judge for Fine-Grained Evaluation

[Mar 14, 2024] 5 papers are accepted to NAACL 2024.

Following papers are accepted to NAACL 2024:
- REPLUG: Retrieval-Augmented Black-Box Language Models
- Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision
- KTRL+F: Knowledge-Augmented In-Document Search
- How Well Do Large Language Models Truly Ground?
- Carpe diem: On the Evaluation of World Knowledge in Lifelong Language Models

[Feb 28, 2024] 4 new grad students have joined our lab.

We are welcoming Jinho Park (MS), Juyoung Suk (MS), Hyeonbin Hwang (MS), Seongyun Lee (MS). We are also welcoming MS->PhD conversion of Hoyeon Chang.

[Feb 28, 2024] 1 grad student has graduated.

Hanseok Oh (MS) has graduated.

[Jan 22, 2024] 1 paper is accepted to TACL 2024.

Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis by Sohee Yang et al. is accepted to TACL 2024. [code]

[Jan 17, 2024] 3 papers are accepted to ICLR 2024.

Following papers are accepted to ICLR 2024:
- (Spotlight) FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
- Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
- SuRe: Improving Open-domain Question Answering of LLMs via Summarized Retrieval

2023

[Dec 12, 2023] 1 paper is accepted to AAAI 2024.

Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following by Seonghyeon Ye et al. is accepted to AAAI 2024. [code]

[Nov 27, 2023] Seonghyeon Ye has received the Qualcomm Innovation Fellowship Korea 2023 Award.

Glad to share that Seonghyeon Ye has received the Qualcomm Innovation Fellowship Korea 2023 Award! [link]

[Oct 11, 2023] 6 papers are accepted to EMNLP 2023.

Following papers are accepted to EMNLP 2023:
- The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
- Aligning Large Language Models through Synthetic Feedback
- Tree of Clarifications: Answering Ambiguous Questions with Retrieval-Augmented Large Language Models
- Cream : Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models
Following papers are accepted to EMNLP 2023 Findings:
- Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt
The following paper is accepted to EMNLP 2023 Industry Track:
- An Integrated Search System for Korea Weather Data

[Sep 25, 2023] 1 paper is accepted to NeurIPS 2023.

A Bayesian Perspective On Training Data Attribution by Elisa Nguyen et al. is accepted to NeurIPS 2023. [code]

[Aug 28, 2023] 3 new grad students have joined our lab.

We are welcoming Geewook Kim (PhD), Dongkeun Yoon (MS+PhD) and Suehyun Park(MS).

[Aug 28, 2023] 4 grad students have graduated.

Joel Jang (MS), Soyoung Yoon (MS), Yongrae Jo (MS), and Eunbi Choi (MS) have graduated.

[May 3, 2023] 7 papers are accepted to ACL 2023.

Following papers are accepted to ACL 2023:
- Knowledge Unlearning for Mitigating Privacy Risks in Language Models
- Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation
- Gradient Ascent Post-training Enhances Language Model Generalization
Following papers are accepted to ACL 2023 Findings:
- Nonparametric Decoding for Generative Retrieval
- Fixed Input Parameterization for Efficient Prompting
- Two Examples are Better than One: Context Regularization for Gradient-based Prompt Tuning
- Comparing and Contrasting Claims on Contentious Issues

[Apr 25, 2023] 1 paper is accepted to ICML 2023.

Exploring the Benefits of Training Expert Language Models over Instruction Tuning by Joel Jang et al. is accepted to ICML 2023. [code]

[Feb 27, 2023] 3 new grad students have joined our lab.

We are welcoming Doyoung Kim (MS), Seungone Kim (MS), and Jiyeon Kim (MS). We are also welcoming MS->MS+PhD conversion of Seonghyeon Ye and MS->PhD conversion of Hyunji Lee.

[Apr 25, 2023] Sohee Yang has graduated (MS).

Sohee is joining UCL / DeepMind as a PhD Student / Research Scientist Intern.

[Jan 25, 2023] 1 paper is accepted to ICLR 2023.

Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners by Seonghyeon Ye et al. is accepted to ICLR 2023. [code] [demo]

2022

[Nov 7, 2022] Joel Jang has received the Qualcomm Innovation Fellowship Korea 2022 Award.

Glad to share that Joel Jang has received the Qualcomm Innovation Fellowship Korea 2022 Award! [link]

[Oct 28, 2022] Minjoon will give a talk at Samsung AI Forum 2022.

Minjoon will give a talk at Samsung AI Forum 2022 on the topic of Generative Retrieval. [news]

[Oct 20, 2022] 1 paper accepted to NeurIPS 2022 Workshop on Transfer Learning for NLP.

Following paper is accepted to NeurIPS 2022 Workshop on Transfer Learning for NLP:
- Can Large Language Models Truly Understand Prompts? A Case Study with Negated Prompts

[Oct 7, 2022] 5 papers accepted to EMNLP 2022 (4 main, 1 findings).

Following papers are accepted to EMNLP 2022:
- TemporalWiki: A Lifelong Benchmark for Training and Evaluating Ever-Evolving Language Models
- Generative Multi-hop Retrieval
- Saving Dense Retriever from Shortcut Dependency in Conversational Search
- Generating Information-Seeking Conversations from Unlabeled Documents
Following paper is accepted to EMNLP 2022 Findings:
- Keep Me Updated! Memory Management in Long-term Conversations

[Sept 17, 2022] 2 papers accepted to NeurIPS 2022 Datasets and Benchmarks.

Following papers are accepted to NeurIPS 2022 Datasets and Benchmark:
- A Multi-Task Benchmark for Korean Legal Language Understanding and Judgement Prediction
- EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records

[Sept 2, 2022] 2022-2023 Winter Internship (KAIRI) application is now open.

Please see here for instructions. The deadline is 2022.09.12.

[Aug 30, 2022] “AI for Law” course is featured at KAIST News.

We are teaching a new AI+X course “AI for Law” in Fall 2022 semester. It is featured at KAIST News. [link]

[Aug 29, 2022] 3 new grad students have joined our lab.

We are welcoming Hoyeon Chang (MS+PhD), Sungdong Kim (MS+PhD), and Hyowon Cho (MS).

[Feb 28, 2022] 2 new grad students have joined our lab.

We are welcoming Haebin Shin (MS, Samsung Research) and Seonghyeon Ye (MS). We are also welcoming MS → MS+PhD conversion of Joel Jang.

[Jan 24, 2022] 1 paper is accepted to ICLR 2022.

Towards Continual Knowledge Learning of Language Models by Joel Jang et al. is accepted to ICLR 2022.

2021

[Sept 27, 2021] LK Lab has won VALUE Challenge Retrieval Track at ICCV 2021.

KAIST LK Lab (Hanseok Oh and Minjoon Seo) and Twelve Labs (Aiden Lee) have won VALUE Challenge Retrieval Track at ICCV 2021. The results are published at ICCV21 CLVL Workshop: ViSeRet: A simple yet effective approach to moment retrieval via fine-grained video segmentation.

[Aug 30, 2021] 2 new grad students have joined our lab.

We are welcoming Hanseok Oh (MS) and Yongrae Jo (MS)!

[Jun 2, 2021] 1 paper is accepted to Interspeech 2021.

Label Embedding for Chinese Grapheme-to-Phoneme Conversion by Eunbi Choi et al. is accepted to Interspeech 2021.

[May 5, 2021] 2 papers are accepted to ACL 2021 Findings.

Following papers are accepted to ACL 2021 Findings:
- Spatial Dependency Parsing for Semi-Structured Document Information Extraction by Hwang et al. (Sohee Yang and Minjoon Seo)
- SSMix: Saliency-based Span Mixup for Text Classification by Soyoung Yoon et al.

[Mar 10, 2021] 1 paper is accepted to NAACL 2021.

Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering by Sohee Yang and Minjoon Seo is accepted to NAACL 2021 as a short paper.

[Mar 1, 2021] Today is the first day of LK Lab.

We are welcoming seven starting members of the lab: Sohee Yang (MS+PhD), Miyoung Ko (PhD), Soyoung Yoon (MS), Joel Jang (MS), Jinkyung Jo (PhD), Eunbi Choi (MS), and Hyunji Lee (MS).