Out-of-Distribution Generalization via Composition: A Lens Through Induction Heads in Transformers

Study on Out-of-Distribution Generalization and Composition Mechanisms in Large Language Models Paper Background In recent years, large language models (LLMs) such as GPT-4 have demonstrated remarkable creativity in handling novel tasks, often solving problems with just a few examples. These tasks require models to generalize on distributions diffe...

Efficient Scaling of Large Language Models with Mixture of Experts and 3D Analog In-Memory Computing

Efficient Scaling of Large Language Models with Mixture of Experts and 3D Analog In-Memory Computing Academic Background In recent years, large language models (LLMs) have demonstrated remarkable capabilities in natural language processing, text generation, and other fields. However, as the scale of these models continues to grow, the costs of trai...

Silver Lining in the Fake News Cloud: Can Large Language Models Help Detect Misinformation?

Can Large Language Models Tackle Misinformation? — In-Depth Research on LLMs In today’s digital era of rapid information dissemination, the spread of misinformation and fake news has become a significant societal challenge. The widespread use of the internet and social media has dramatically lowered the barriers to information sharing, enabling any...

Empowering PET Imaging Reporting with Retrieval-Augmented Language Models and Reading Reports Database: A Pilot Study

The Application of Large Language Models in PET Imaging Reports: A Single-Center Pilot Study Combining Retrieval-Augmented Generation With the rapid development of artificial intelligence, large language models (LLMs) have gained widespread attention for their zero-shot learning and natural language processing capabilities in the medical domain. Al...

Unifying Large Language Models and Knowledge Graphs: A Roadmap

Unified Large Language Models and Knowledge Graphs Background In recent years, numerous research achievements have emerged in the fields of natural language processing and artificial intelligence. Notably, large language models (LLMs) such as ChatGPT and GPT-4 have demonstrated remarkable performance. However, despite their excellent generalization...

Large Language Models to Identify Social Determinants of Health in Electronic Health Records

Using Large Language Models to Identify Social Determinants of Health from Electronic Health Records Background and Research Motivation Social Determinants of Health (SDOH) have a significant impact on patient health outcomes. However, these factors are often incompletely recorded or missing in the structured data of Electronic Health Records (EHR)...

Exploring the Psychology of LLMs' Moral and Legal Reasoning

Current Situation Nowadays, large language models (LLMs) have demonstrated expert-level performance in multiple fields, which has sparked great interest in understanding their internal reasoning processes. Comprehending how LLMs generate these remarkable results is crucial for the future development of artificial intelligence agents and ensuring th...