Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:1706.03762

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 19
RoBERTa: A Robustly Optimized BERT Pretraining Approach

Paper • 1907.11692 • Published Jul 26, 2019 • 9
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 17

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Paper • 2312.10003 • Published Dec 15, 2023 • 44
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Paper • 2005.11401 • Published May 22, 2020 • 11
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Paper • 2201.11903 • Published Jan 28, 2022 • 14
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

beginner-papers

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

Papers that I have read

Papers that I have read

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77
Salesforce/blip3-kale

Viewer • Updated Feb 3 • 235M • 1.81k • 37
Running on Zero

9k

9k

FLUX.1 [dev]

🖥

Generate images from text prompts
Running on Zero

MCP

1.35k

1.35k

Chatterbox TTS

🍿

Expressive Zeroshot TTS

Agents: An Open-source Framework for Autonomous Language Agents

Paper • 2309.07870 • Published Sep 14, 2023 • 42
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game

Paper • 2310.18940 • Published Oct 29, 2023
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

Paper • 2404.03648 • Published Apr 4, 2024 • 29

Round and Round We Go! What makes Rotary Positional Encodings useful?

Paper • 2410.06205 • Published Oct 8, 2024 • 2
RoFormer: Enhanced Transformer with Rotary Position Embedding

Paper • 2104.09864 • Published Apr 20, 2021 • 14
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 19
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 19
RoBERTa: A Robustly Optimized BERT Pretraining Approach

Paper • 1907.11692 • Published Jul 26, 2019 • 9
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 17

Papers that I have read

Papers that I have read

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77
Salesforce/blip3-kale

Viewer • Updated Feb 3 • 235M • 1.81k • 37
Running on Zero

9k

9k

FLUX.1 [dev]

🖥

Generate images from text prompts
Running on Zero

MCP

1.35k

1.35k

Chatterbox TTS

🍿

Expressive Zeroshot TTS

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

Agents: An Open-source Framework for Autonomous Language Agents

Paper • 2309.07870 • Published Sep 14, 2023 • 42
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game

Paper • 2310.18940 • Published Oct 29, 2023
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent

Paper • 2404.03648 • Published Apr 4, 2024 • 29

ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Paper • 2312.10003 • Published Dec 15, 2023 • 44
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Paper • 2005.11401 • Published May 22, 2020 • 11
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Paper • 2201.11903 • Published Jan 28, 2022 • 14
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

Round and Round We Go! What makes Rotary Positional Encodings useful?

Paper • 2410.06205 • Published Oct 8, 2024 • 2
RoFormer: Enhanced Transformer with Rotary Position Embedding

Paper • 2104.09864 • Published Apr 20, 2021 • 14
Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

beginner-papers

Attention Is All You Need

Paper • 1706.03762 • Published Jun 12, 2017 • 77

A collection of arXiv papers from Chip Huyen's AI Engineering organized by chapter and ordered by when each appears in the book.

Will we run out of data? An analysis of the limits of scaling datasets in Machine Learning

Paper • 2211.04325 • Published Oct 26, 2022
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Paper • 1810.04805 • Published Oct 11, 2018 • 19
On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021 • 1
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks

Paper • 2204.07705 • Published Apr 16, 2022 • 2

Previous
1
2
3
...
7
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs