Perplexity

company

Verified

https://www.perplexity.ai/

AI & ML interests

None defined yet.

Recent Activity

mkrimmel-pplx new activity 3 days ago

perplexity-ai/pplx-embed-v1-4b:Can't serve the model using TEI

mkrimmel-pplx new activity 3 days ago

perplexity-ai/pplx-embed-v1-4b:fix: add fast tokenizer

abcdabcd987 authored a paper 6 days ago

FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

View all activity

Papers

RDMA Point-to-Point Communication for LLM Systems

View all Papers

in perplexity-ai/pplx-embed-v1-4b 3 days ago

Can't serve the model using TEI

#12 opened about 1 month ago by

TomaszZietkiewicz

fix: add fast tokenizer

#13 opened 3 days ago by

authored 2 papers 6 days ago

FlashInfer: Efficient and Customizable Attention Engine for LLM Inference Serving

Paper • 2501.01005 • Published Jan 2, 2025 • 2

RDMA Point-to-Point Communication for LLM Systems

Paper • 2510.27656 • Published Oct 31, 2025 • 8

in perplexity-ai/pplx-embed-v1-0.6b about 1 month ago

Update config for transformers.js

#10 opened about 1 month ago by

in perplexity-ai/pplx-embed-context-v1-4b about 1 month ago

Add `transformers` tag in `README.md`

#5 opened about 1 month ago by

in perplexity-ai/pplx-embed-context-v1-0.6b about 1 month ago

Adding `transformers` as the library name

#8 opened about 1 month ago by

in perplexity-ai/pplx-embed-v1-0.6b about 1 month ago

Update README.md

#9 opened about 1 month ago by

in perplexity-ai/pplx-embed-v1-4b about 1 month ago

Update README.md

#9 opened about 1 month ago by

in perplexity-ai/pplx-embed-v1-0.6b about 1 month ago

add tei back to readme

#8 opened about 1 month ago by

in perplexity-ai/pplx-embed-v1-4b about 1 month ago

add tei back to readme

#8 opened about 1 month ago by

in perplexity-ai/draco about 1 month ago

License request

#4 opened about 1 month ago by

updated a dataset about 1 month ago

perplexity-ai/draco

Viewer • Updated Feb 20 • 100 • 454 • 87

updated 2 models about 1 month ago

perplexity-ai/pplx-embed-context-v1-4b

Feature Extraction • 4B • Updated Mar 2 • 16.2k • 32

perplexity-ai/pplx-embed-context-v1-0.6b

Feature Extraction • 0.6B • Updated Mar 2 • 158k • 52

authored 2 papers over 2 years ago

Punica: Multi-Tenant LoRA Serving

Paper • 2310.18547 • Published Oct 28, 2023 • 2

Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Paper • 2310.19102 • Published Oct 29, 2023 • 11

authored a paper over 2 years ago

FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated Learning

Paper • 2210.12873 • Published Oct 23, 2022