mayank1729 's Collections Papers
updated
How Do Large Language Models Acquire Factual Knowledge During
Pretraining?
Paper
• 2406.11813
• Published
• 31
From RAGs to rich parameters: Probing how language models utilize
external knowledge over parametric information for factual queries
Paper
• 2406.12824
• Published
• 21
Tokenization Falling Short: The Curse of Tokenization
Paper
• 2406.11687
• Published
• 16
Iterative Length-Regularized Direct Preference Optimization: A Case
Study on Improving 7B Language Models to GPT-4 Level
Paper
• 2406.11817
• Published
• 13
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens
Grounding
Paper
• 2406.19263
• Published
• 10
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
Generation
Paper
• 2406.19215
• Published
• 32
LiteSearch: Efficacious Tree Search for LLM
Paper
• 2407.00320
• Published
• 40
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
• 2406.20094
• Published
• 104
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via
Dynamic Sparse Attention
Paper
• 2407.02490
• Published
• 26
Is It Really Long Context if All You Need Is Retrieval? Towards
Genuinely Difficult Long Context NLP
Paper
• 2407.00402
• Published
• 22
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language
Models by Learning from Knowledge Graphs
Paper
• 2407.00653
• Published
• 13
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on
Edge
Paper
• 2407.00088
• Published
• 12
Show Less, Instruct More: Enriching Prompts with Definitions and
Guidelines for Zero-Shot NER
Paper
• 2407.01272
• Published
• 8
To Forget or Not? Towards Practical Knowledge Unlearning for Large
Language Models
Paper
• 2407.01920
• Published
• 17
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
• 2407.01489
• Published
• 65
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Paper
• 2407.01370
• Published
• 89
How do you know that? Teaching Generative Language Models to Reference
Answers to Biomedical Questions
Paper
• 2407.05015
• Published
• 4
SEED-Story: Multimodal Long Story Generation with Large Language Model
Paper
• 2407.08683
• Published
• 24
Inference Performance Optimization for Large Language Models on CPUs
Paper
• 2407.07304
• Published
• 53
Case2Code: Learning Inductive Reasoning with Synthetic Data
Paper
• 2407.12504
• Published
• 8
Gemma 2: Improving Open Language Models at a Practical Size
Paper
• 2408.00118
• Published
• 78
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper
• 2408.15545
• Published
• 38