feat(http): move from openai only to http frontend package 5460614 Morgan Funtowicz commited on about 1 month ago
misc(embeddings): use attention_mask to effectively compute the number of used tokens 276cf66 Morgan Funtowicz commited on May 5
misc(config): add proper way to detect if cpu may support bfloat16 a8540ed Morgan Funtowicz commited on May 5
feat(embeddings): fix invalid nested array when allocating the embedding responses cb9be17 Morgan Funtowicz commited on May 4
feat(embeddings): expose some more to Python and return corresponding embedding (with copy for now) 38fa9fc Morgan Funtowicz commited on May 3
feat(embeddings): allow the copy of vector elements (optimize later) 69894ec Morgan Funtowicz commited on May 3