Top Research Papers on LLMs
Delve into the most influential research papers on LLMs and uncover key insights into language learning models. Our handpicked selection provides a comprehensive overview, making it easy for researchers and enthusiasts to stay updated with the latest advancements in the field. Whether you're a beginner or an expert, these papers will provide valuable knowledge and inspire new ideas.
Looking for research-backed answers?Try AI Search
Large Language ModelsIn the latest edition of Stats, STAT!, Fralick and colleagues explain the statistics behind large language models - used in chat bots like ChatGPT and Bard. While these new tools may seem remarkably intelligent, at their core they just assemble sentences based on statistics from large amounts of text.
Author Stephan Raaijmakers provides a comprehensive introduction to Large Language Models, describing what exactly they are capable of from a technical and creative standpoint and offering a more grounded approach to how this groundbreaking—and increasingly ubiquitous—form of artificial intelligence will shape the authors' society for years to come.
This essay argues that such failures—so-called hallucinations—are not accidental glitches but are instead a by-product of the design of the transformer architecture on which large language models are built, given its foundation on the distributional hypothesis, a nonreferential approach to meaning.
Large Language Model (LLM) as a System of Multiple Expert Agents: An Approach to solve the Abstraction and Reasoning Corpus (ARC) Challenge
13 Citations 2023J. Tan, M. Motani
ArXiv
This work firstly converts the input image into multiple suitable text-based abstraction spaces, then utilises the associative power of LLMs to derive the input-output relationship and map this to actions in the form of a working program, similar to Voyager / Ghost in the MineCraft.
Large language models (LLMs) for inferring genomic characteristics and facilitating genomic literacy in prostate cancer (PCa) patients.
No citations 2025S. Naqvi, Umair Ayub, Muhammad Ali Khan + 9 more
Journal of Clinical Oncology
It is suggested that large language models can effectively extract genomic characteristics from unstructured reports and can potentially improve genomic literacy among prostate cancer patients by providing easy-to-interpret variant summaries, however, targeted prompting may be required to increase lexical diversity for improved engagement.
Large Language Models (LLMs): Representation Matters, Low-Resource Languages and Multi-Modal Architecture
11 Citations 2023G. Mani, Galane Basha Namomsa
2023 IEEE AFRICON
It is argued that the importance of representation as well as multi-modality are likely key to making the new generation of systems more powerful, usable, accessible and utile for all.
Large Language Model Programs
16 Citations 2023Imanol Schlag, Sainbayar Sukhbaatar, Asli Celikyilmaz + 4 more
ArXiv
This work presents a method which further expands the capabilities of an LLM by embedding it within an algorithm or program, and presents an illustrative example of evidence-supported question-answering.
It is implausible to think that LLMs possess communicative desires and perform speech acts, yet it is implausible not to think that LLMs possess communicative desires and perform speech acts.
Interacting with a contemporary LLM-based conversational agent can create an illusion of being in the presence of a thinking creature, yet such systems are fundamentally not like us.
Large Language Model (LLM) for Telecommunications: A Comprehensive Survey on Principles, Key Techniques, and Opportunities
158 Citations 2024Hao Zhou, Chengming Hu, Ye Yuan + 11 more
IEEE Communications Surveys & Tutorials
This work presents LLM fundamentals, including model architecture, pre-training, fine-tuning, inference and utilization, model evaluation, and telecom deployment, and introduces LLM-enabled key techniques and telecom applications in terms of generation, classification, optimization, and prediction problems.
Large language models (LLMs) as jurors: Assessing the potential of LLMs in legal contexts.
No citations 2025Yongjie Sun, Angelo Zappalà, Eleonora Di Maso + 3 more
Law and human behavior
OBJECTIVE We explored the potential of large language models (LLMs) in legal decision making by replicating Fraser et al. (2023) mock jury experiment using LLMs (GPT-4o, Claude 3.5 Sonnet, and GPT-o1) as decision makers. We investigated LLMs' reactions to factors that influenced human jurors, including defendant race, social status, number of allegations, and reporting delay in sexual assault cases. HYPOTHESES We hypothesized that LLMs would show higher consistency than humans, with no explicit but potential implicit biases. We also examined potential mediating factors (race-crime congruence...
LLaSM: Large Language and Speech Model
59 Citations 2023Yu Shu, Siwei Dong, Guangyao Chen + 5 more
ArXiv
LLaSM is an end-to-end trained large multi-modal speech-language model with cross- modal conversational abilities, capable of following speech-and-language instructions.
Large language models (LLMs) in radiology exams for medical students: Performance and consequences
4 Citations 2024Jennifer Gotta, Quang Anh Le Hong, V. Koch + 16 more
RöFo - Fortschritte auf dem Gebiet der Röntgenstrahlen und der bildgebenden Verfahren
This study highlights the potential of LLMs as accessible knowledge resources for medical students, with GPT-4 exhibits significantly improved performance compared to its predecessors GPT-3.5 and Perplexity AI, with GPT-4 particularly excelling in higher-order questions.
Large Language Models for Telecom
4 Citations 2023M. Debbah
2023 Eighth International Conference on Fog and Mobile Edge Computing (FMEC)
This talk will discuss the recent progress on LLM features and the potential of LLM in enabling intelligent wireless communication systems.
Large Language Models (LLMs) in Engineering Education: A Systematic Review and Suggestions for Practical Adoption
29 Citations 2024S. Filippi, Barbara Motyl
Inf.
The use of large language models (LLMs) is now spreading in several areas of research and development. This work is concerned with systematically reviewing LLMs’ involvement in engineering education. Starting from a general research question, two queries were used to select 370 papers from the literature. Filtering them through several inclusion/exclusion criteria led to the selection of 20 papers. These were investigated based on eight dimensions to identify areas of engineering disciplines that involve LLMs, where they are most present, how this involvement takes place, and which LLM-based t...
ChatGPT and large language models (LLMs) awareness and use. A prospective cross-sectional survey of U.S. medical students
22 Citations 2024Conner Ganjavi, M. Eppler, Devon O'Brien + 11 more
PLOS Digital Health
An electronic survey for students across North American medical colleges to gauge their views on and current use of ChatGPT and similar technologies found that 96% of respondents had heard of ChatGPT and 52% had used it for medical school coursework.
Large Language Models (LLMs) and Generative AI in Cybersecurity and Privacy: A Survey of Dual-Use Risks, AI-Generated Malware, Explainability, and Defensive Strategies
2 Citations 2025Kiarash Ahi, Saeed Valizadeh
2025 Silicon Valley Cybersecurity Conference (SVCC)
This paper presents a comprehensive survey of the beneficial and malicious applications of LLMs in cybersecurity, including zero-day detection, DevSecOps, federated learning, synthetic content analysis, and explainable AI (XAI), and practical recommendations for responsible and transparent LLM deployment.
Transformative potential of Large Language Models (LLMs) in data mining on Electronic Health Records.
No citations 2024Amadeo Wals Zurita, Héctor Miras del Rio, Nerea Ugarte Ruiz de Aguirre + 4 more
journal unavailable
The LLMs studied show competence comparable to that of medical specialists in the interpretation of clinical reports, even in complex and confusingly worded texts, and represent a preferred option over human analysis for data mining and structuring information in extensive sets of clinical reports.
Large Language Models (LLMs) Inference Offloading and Resource Allocation in Cloud-Edge Computing: An Active Inference Approach
52 Citations 2024Ying He, Jingcheng Fang, F. Yu + 1 more
IEEE Transactions on Mobile Computing
This paper proposes a novel approach based on active inference for LLMs inference task offloading and resource allocation in cloud-edge computing that has superior performance over mainstream DRLs, improves in data utilization efficiency, and is more adaptable to changing task load scenarios.
Reprogramming Foundational Large Language Models(LLMs) for Enterprise Adoption for Spatio-Temporal Forecasting Applications: Unveiling a New Era in Copilot-Guided Cross-Modal Time Series Representation Learning
1 Citations 2024Sakhinana Sagar Srinivas, Chidaksh Ravuru, Geethan Sannidhi + 1 more
ArXiv
A hybrid approach that combines the strengths of open-source large and small-scale language models (LLMs and LMs) with traditional forecasting methods, outperforming existing methods by significant margins in terms of forecast accuracy.