AI Papers
updated
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper
• 2310.09263
• Published
• 40
A Zero-Shot Language Agent for Computer Control with Structured
Reflection
Paper
• 2310.08740
• Published
• 15
The Consensus Game: Language Model Generation via Equilibrium Search
Paper
• 2310.09139
• Published
• 14
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper
• 2310.09199
• Published
• 28
CodeChain: Towards Modular Code Generation Through Chain of
Self-revisions with Representative Sub-modules
Paper
• 2310.08992
• Published
• 12
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with
Refined Data Generation
Paper
• 2312.14187
• Published
• 49
Reasons to Reject? Aligning Language Models with Judgments
Paper
• 2312.14591
• Published
• 18
Exploiting Novel GPT-4 APIs
Paper
• 2312.14302
• Published
• 14
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Paper
• 2312.14233
• Published
• 16
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation
Paper
• 2312.14385
• Published
• 7
Shai: A large language model for asset management
Paper
• 2312.14203
• Published
• 6
Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Paper
• 2312.14878
• Published
• 15
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for
Single Image Talking Face Generation
Paper
• 2312.13578
• Published
• 29
Generative Multimodal Models are In-Context Learners
Paper
• 2312.13286
• Published
• 36
Mini-GPTs: Efficient Large Language Models through Contextual Pruning
Paper
• 2312.12682
• Published
• 9
LLM in a flash: Efficient Large Language Model Inference with Limited
Memory
Paper
• 2312.11514
• Published
• 260
Topic-VQ-VAE: Leveraging Latent Codebooks for Flexible Topic-Guided
Document Generation
Paper
• 2312.11532
• Published
• 6
ProTIP: Progressive Tool Retrieval Improves Planning
Paper
• 2312.10332
• Published
• 8
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
Paper
• 2312.10003
• Published
• 44
Self-Evaluation Improves Selective Generation in Large Language Models
Paper
• 2312.09300
• Published
• 16
Extending Context Window of Large Language Models via Semantic
Compression
Paper
• 2312.09571
• Published
• 16
Challenges with unsupervised LLM knowledge discovery
Paper
• 2312.10029
• Published
• 10
Faithful Persona-based Conversational Dataset Generation with Large
Language Models
Paper
• 2312.10007
• Published
• 11
Perspectives on the State and Future of Deep Learning - 2023
Paper
• 2312.09323
• Published
• 8
Zebra: Extending Context Window with Layerwise Grouped Local-Global
Attention
Paper
• 2312.08618
• Published
• 13
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
Paper
• 2306.08568
• Published
• 33
WizardMath: Empowering Mathematical Reasoning for Large Language Models
via Reinforced Evol-Instruct
Paper
• 2308.09583
• Published
• 7
Blending Is All You Need: Cheaper, Better Alternative to
Trillion-Parameters LLM
Paper
• 2401.02994
• Published
• 52
MoE-Mamba: Efficient Selective State Space Models with Mixture of
Experts
Paper
• 2401.04081
• Published
• 74
CogAgent: A Visual Language Model for GUI Agents
Paper
• 2312.08914
• Published
• 31
ORPO: Monolithic Preference Optimization without Reference Model
Paper
• 2403.07691
• Published
• 72
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real
Computer Environments
Paper
• 2404.07972
• Published
• 51
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Paper
• 2404.07413
• Published
• 38
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Paper
• 2405.06932
• Published
• 20