Papers - a mayank1729 Collection

mayank1729 's Collections

Papers

updated Sep 4, 2024

How Do Large Language Models Acquire Factual Knowledge During Pretraining?

Paper • 2406.11813 • Published Jun 17, 2024 • 31
From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

Paper • 2406.12824 • Published Jun 18, 2024 • 21
Tokenization Falling Short: The Curse of Tokenization

Paper • 2406.11687 • Published Jun 17, 2024 • 16
Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level

Paper • 2406.11817 • Published Jun 17, 2024 • 13
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding

Paper • 2406.19263 • Published Jun 27, 2024 • 10
SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented Generation

Paper • 2406.19215 • Published Jun 27, 2024 • 32
LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29, 2024 • 40
Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28, 2024 • 104
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Paper • 2407.02490 • Published Jul 2, 2024 • 26
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Paper • 2407.00402 • Published Jun 29, 2024 • 22
Chain-of-Knowledge: Integrating Knowledge Reasoning into Large Language Models by Learning from Knowledge Graphs

Paper • 2407.00653 • Published Jun 30, 2024 • 13
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge

Paper • 2407.00088 • Published Jun 25, 2024 • 12
Show Less, Instruct More: Enriching Prompts with Definitions and Guidelines for Zero-Shot NER

Paper • 2407.01272 • Published Jul 1, 2024 • 8
To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

Paper • 2407.01920 • Published Jul 2, 2024 • 17
Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 65
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Paper • 2407.01370 • Published Jul 1, 2024 • 89
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Paper • 2407.05015 • Published Jul 6, 2024 • 4
SEED-Story: Multimodal Long Story Generation with Large Language Model

Paper • 2407.08683 • Published Jul 11, 2024 • 24
Inference Performance Optimization for Large Language Models on CPUs

Paper • 2407.07304 • Published Jul 10, 2024 • 53
Case2Code: Learning Inductive Reasoning with Synthetic Data

Paper • 2407.12504 • Published Jul 17, 2024 • 8
Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31, 2024 • 78
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding

Paper • 2408.15545 • Published Aug 28, 2024 • 38