Spaces:

Magpie-Align
/

README

Running

App Files Files Community

README / README.md

Zhangchen Xu

Update README.md

2da6fa8 verified 10 months ago

preview code

raw

history blame

7.69 kB

	---
	title: README
	emoji: 🐦
	colorFrom: pink
	colorTo: indigo
	sdk: static
	pinned: false
	---

	Hi, I am a magpie 🐦!

	🕸️ Project Website: [https://magpie-align.github.io/](https://magpie-align.github.io/)

	📄 Technical Report: [https://arxiv.org/abs/2406.08464](https://arxiv.org/abs/2406.08464)

	🤗 HF Paper Page: [https://huggingface.co/papers/2406.08464](https://huggingface.co/papers/2406.08464)

	😬 Codes: [https://github.com/magpie-align/magpie](https://github.com/magpie-align/magpie)

	🤗 Magpie Demo: [https://huggingface.co/spaces/davanstrien/magpie](https://huggingface.co/spaces/davanstrien/magpie) (Thanks a lot for the implementation from @davanstrien!)

	🐦 Chat with Magpie: [https://huggingface.co/spaces/flydust/Chat-with-Magpie](https://huggingface.co/spaces/flydust/Chat-with-Magpie)

	Questions? Please contact [Zhangchen](mailto:[email protected]) by email or raise an issue in [Github](https://github.com/magpie-align/magpie/issues/new/choose).

	## Dataset Navigation 🧭
	### [Meta Llama 3](https://huggingface.co/collections/meta-llama/meta-llama-3-66214712577ca38149ebb2b6)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Llama 3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) \| [Magpie-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Llama-3-Magpie-Pro-1M-v0.1) \| SFT \| 1M Raw conversations built with Meta Llama 3 70B.
	\| [Llama 3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) \| [Magpie-Pro-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Pro-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.
	\| [Llama 3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) \| [Magpie-Pro-MT-300K](https://huggingface.co/datasets/Magpie-Align/Magpie-Pro-MT-300K-v0.1) \| SFT \| Select 300K difficult questions and extend to multi-turn conversations.
	\| [Llama 3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) \| [Magpie-Pro-DPO-100K](https://huggingface.co/datasets/Magpie-Align/Magpie-Pro-DPO-100K-v0.1) \| DPO \| DPO dataset via Best-of-N sampling and rewards.
	\| [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) \| [Magpie-Air-3M](https://huggingface.co/datasets/Magpie-Align/Llama-3-Magpie-Air-3M-v0.1) \| SFT \| 3M Raw conversations built with Meta Llama 3 8B.
	\| [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) \| [Magpie-Air-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Air-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality data.
	\| [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) \| [Magpie-Air-MT-300K](https://huggingface.co/datasets/Magpie-Align/Magpie-Air-MT-300K-v0.1) \| SFT \| Select 300K difficult questions and extend to multi-turn conversations.
	\| [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) \| [Magpie-Air-DPO-100K](https://huggingface.co/datasets/Magpie-Align/Magpie-Air-DPO-100K-v0.1) \| DPO \| DPO dataset via Best-of-N sampling and rewards.

	### [Meta Llama 3.1](https://huggingface.co/collections/meta-llama/llama-31-669fc079a0c406a149a5738f)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-1M-v0.1) \| SFT \| 1M Raw conversations built with Meta Llama 3.1 70B.
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-500K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-500K-Filtered) \| SFT \| Apply a filter and select 500K high quality conversations.
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-MT-500K](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-MT-500K-Filtered) \| SFT \| Select 500K difficult questions and extend to multi-turn conversations.
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-DPO-100K](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-DPO-100K-v0.1) \| SFT \| DPO dataset via Best-of-N sampling and rewards.

	### [Qwen2](https://huggingface.co/collections/Qwen/qwen2-6659360b33528ced941e557f)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Qwen2 72B Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) \| [Magpie-Qwen2-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1) \| SFT \| 1M Raw conversations built with Qwen2 72B Instruct.
	\| [Qwen2 72B Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) \| [Magpie-Qwen2-Pro-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Pro-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.
	\| [Qwen2 72B Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) \| [Magpie-Qwen2-Pro-200K-Chinese](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Pro-200K-Chinese) \| SFT \| Apply a filter and select 200K high quality Chinese conversations.
	\| [Qwen2 72B Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) \| [Magpie-Qwen2-Pro-200K-English](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Pro-200K-English) \| SFT \| Apply a filter and select 200K high quality English conversations.
	\| [Qwen2 7B Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct) \| [Magpie-Qwen2-Air-3M](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Air-3M-v0.1) \| SFT \| 3M Raw conversations built with Qwen2 7B Instruct.
	\| [Qwen2 7B Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct) \| [Magpie-Qwen2-Air-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen-Air-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.

	### [Phi-3](https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Phi-3 Medium Instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) \| [Magpie-Phi3-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Phi3-Pro-1M-v0.1) \| SFT \| 1M Raw conversations built with Phi-3 Medium Instruct.
	\| [Phi-3 Medium Instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) \| [Magpie-Phi3-Pro-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Phi3-Pro-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.

	### [Gemma-2](https://huggingface.co/collections/google/gemma-2-release-667d6600fd5220e7b967f315)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it) \| [Magpie-Gemma2-Pro-534K](https://huggingface.co/datasets/Magpie-Align/Magpie-Gemma2-Pro-534K-v0.1) \| SFT \| 534K conversations built with Gemma-2-27b-it.
	\| [Gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it) \| [Magpie-Gemma2-Pro-200K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Gemma2-Pro-200K-Filtered) \| SFT \| Apply a filter and select 200K conversations.

	---
	title: README
	emoji: 🐦
	colorFrom: pink
	colorTo: indigo
	sdk: static
	pinned: false
	---

	Hi, I am a magpie 🐦!

	🕸️ Project Website: [https://magpie-align.github.io/](https://magpie-align.github.io/)

	📄 Technical Report: [https://arxiv.org/abs/2406.08464](https://arxiv.org/abs/2406.08464)

	🤗 HF Paper Page: [https://huggingface.co/papers/2406.08464](https://huggingface.co/papers/2406.08464)

	😬 Codes: [https://github.com/magpie-align/magpie](https://github.com/magpie-align/magpie)

	🤗 Magpie Demo: [https://huggingface.co/spaces/davanstrien/magpie](https://huggingface.co/spaces/davanstrien/magpie) (Thanks a lot for the implementation from @davanstrien!)

	🐦 Chat with Magpie: [https://huggingface.co/spaces/flydust/Chat-with-Magpie](https://huggingface.co/spaces/flydust/Chat-with-Magpie)

	Questions? Please contact [Zhangchen](mailto:[email protected]) by email or raise an issue in [Github](https://github.com/magpie-align/magpie/issues/new/choose).

	## Dataset Navigation 🧭
	### [Meta Llama 3](https://huggingface.co/collections/meta-llama/meta-llama-3-66214712577ca38149ebb2b6)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Llama 3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) \| [Magpie-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Llama-3-Magpie-Pro-1M-v0.1) \| SFT \| 1M Raw conversations built with Meta Llama 3 70B.
	\| [Llama 3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) \| [Magpie-Pro-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Pro-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.
	\| [Llama 3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) \| [Magpie-Pro-MT-300K](https://huggingface.co/datasets/Magpie-Align/Magpie-Pro-MT-300K-v0.1) \| SFT \| Select 300K difficult questions and extend to multi-turn conversations.
	\| [Llama 3 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct) \| [Magpie-Pro-DPO-100K](https://huggingface.co/datasets/Magpie-Align/Magpie-Pro-DPO-100K-v0.1) \| DPO \| DPO dataset via Best-of-N sampling and rewards.
	\| [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) \| [Magpie-Air-3M](https://huggingface.co/datasets/Magpie-Align/Llama-3-Magpie-Air-3M-v0.1) \| SFT \| 3M Raw conversations built with Meta Llama 3 8B.
	\| [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) \| [Magpie-Air-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Air-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality data.
	\| [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) \| [Magpie-Air-MT-300K](https://huggingface.co/datasets/Magpie-Align/Magpie-Air-MT-300K-v0.1) \| SFT \| Select 300K difficult questions and extend to multi-turn conversations.
	\| [Llama 3 8B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) \| [Magpie-Air-DPO-100K](https://huggingface.co/datasets/Magpie-Align/Magpie-Air-DPO-100K-v0.1) \| DPO \| DPO dataset via Best-of-N sampling and rewards.

	### [Meta Llama 3.1](https://huggingface.co/collections/meta-llama/llama-31-669fc079a0c406a149a5738f)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-1M-v0.1) \| SFT \| 1M Raw conversations built with Meta Llama 3.1 70B.
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-500K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-500K-Filtered) \| SFT \| Apply a filter and select 500K high quality conversations.
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-MT-500K](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-MT-500K-Filtered) \| SFT \| Select 500K difficult questions and extend to multi-turn conversations.
	\| [Llama 3.1 70B Instruct](https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct) \| [Magpie-Llama-3.1-Pro-DPO-100K](https://huggingface.co/datasets/Magpie-Align/Magpie-Llama-3.1-Pro-DPO-100K-v0.1) \| SFT \| DPO dataset via Best-of-N sampling and rewards.

	### [Qwen2](https://huggingface.co/collections/Qwen/qwen2-6659360b33528ced941e557f)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Qwen2 72B Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) \| [Magpie-Qwen2-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Pro-1M-v0.1) \| SFT \| 1M Raw conversations built with Qwen2 72B Instruct.
	\| [Qwen2 72B Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) \| [Magpie-Qwen2-Pro-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Pro-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.
	\| [Qwen2 72B Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) \| [Magpie-Qwen2-Pro-200K-Chinese](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Pro-200K-Chinese) \| SFT \| Apply a filter and select 200K high quality Chinese conversations.
	\| [Qwen2 72B Instruct](https://huggingface.co/Qwen/Qwen2-72B-Instruct) \| [Magpie-Qwen2-Pro-200K-English](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Pro-200K-English) \| SFT \| Apply a filter and select 200K high quality English conversations.
	\| [Qwen2 7B Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct) \| [Magpie-Qwen2-Air-3M](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen2-Air-3M-v0.1) \| SFT \| 3M Raw conversations built with Qwen2 7B Instruct.
	\| [Qwen2 7B Instruct](https://huggingface.co/Qwen/Qwen2-7B-Instruct) \| [Magpie-Qwen2-Air-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Qwen-Air-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.

	### [Phi-3](https://huggingface.co/collections/microsoft/phi-3-6626e15e9585a200d2d761e3)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Phi-3 Medium Instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) \| [Magpie-Phi3-Pro-1M](https://huggingface.co/datasets/Magpie-Align/Magpie-Phi3-Pro-1M-v0.1) \| SFT \| 1M Raw conversations built with Phi-3 Medium Instruct.
	\| [Phi-3 Medium Instruct](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) \| [Magpie-Phi3-Pro-300K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Phi3-Pro-300K-Filtered) \| SFT \| Apply a filter and select 300K high quality conversations.

	### [Gemma-2](https://huggingface.co/collections/google/gemma-2-release-667d6600fd5220e7b967f315)
	\|Model Name \| Dataset \| Type \| Description \|
	\|-------------\|:-------\|:-------\|:-------\|
	\| [Gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it) \| [Magpie-Gemma2-Pro-534K](https://huggingface.co/datasets/Magpie-Align/Magpie-Gemma2-Pro-534K-v0.1) \| SFT \| 534K conversations built with Gemma-2-27b-it.
	\| [Gemma-2-27b-it](https://huggingface.co/google/gemma-2-27b-it) \| [Magpie-Gemma2-Pro-200K-Filtered](https://huggingface.co/datasets/Magpie-Align/Magpie-Gemma2-Pro-200K-Filtered) \| SFT \| Apply a filter and select 200K conversations.