45 19 457

Smoliakov PRO

Yehor

https://t.me/doing_something

AI & ML interests

Speech-to-Text, Text-to-Speech, Voice over Internet Protocol

Recent Activity

updated a dataset 2 days ago

Yehor/reddit_clean-uk

published a dataset 2 days ago

Yehor/reddit_clean-uk

liked a dataset 2 days ago

SophieTr/reddit_clean

View all activity

Organizations

reacted to MohamedRashad's post with 👍 7 days ago

Post

1161

If someone is interested in trying the new rednote-hilab/dots.ocr model. I made this space for you:

MohamedRashad/Dots-OCR

replied to their post 25 days ago

Added a vice versa model: from Ukrainian to English - https://huggingface.co/spaces/Yehor/uk-en-translator

replied to their post 26 days ago

Also, now images: https://huggingface.co/spaces/Yehor/vision-en-uk-translator

replied to their post 26 days ago

Now you can translate your audios as well: https://huggingface.co/spaces/Yehor/audio-en-uk-translator

posted an update 27 days ago

Post

639

A new lightweight model to do machine translation from English to Ukrainian using recently published LFM2 model. Use demo Yehor/en-uk-translator to test it.

Facts:
- Fine-tuned with 40M samples (filtered by quality metric) from ~53.5M for 1.4 epochs
- 354M params
- Requires 1 GB of RAM to run with bf16
- BLEU on FLORES-200: 27.24
- Tokens per second: 229.93 (bs=1), 1664.40 (bs=10), 8392.48 (bs=64)
- License: lfm1.0

Mode page: Yehor/kulyk-en-uk

4 replies

posted an update 4 months ago

Post

852

Esoteric practices: inference models in PHP!

Repository: https://github.com/egorsmkv/speech-to-text-using-php

posted an update 4 months ago

Post

2490

Made a workable program that uses IREE runtime using Rust to inference wav2vec2-bert model for Automatic Speech Recognition.

1 reply

reacted to leonardlin's post with 👍 4 months ago

Post

2678

Happy to announce the release of Shisa V2, our latest generation of our bilingual Japanese-English language models. After hundreds of ablations and months of work, we're releasing some of the strongest open Japanese models at 7B, 8B, 12B, 14B, 32B and 70B! Full announcement here https://shisa.ai/posts/shisa-v2/ or visit the Shisa V2 HF collection: shisa-ai/shisa-v2-67fc98ecaf940ad6c49f5689

replied to their post 4 months ago

Also, tested it on A100 with TensorRT:

https://colab.research.google.com/drive/1-agoo5ll-hWEecWQAtO1FM39sqavJxph?usp=sharing

Results are not so obvious, but it works base_rfdetr_fp16.onnx model and gives ~10ms/img

posted an update 4 months ago

Post

2696

I have made a Rust project with integration of the latest state-of-the-art model for object detection, it outperforms YOLO!

Check it out: https://github.com/egorsmkv/rf-detr-usls

2 replies

replied to their post 4 months ago

This program does what datasets does. When you push dataset created by the audiofolder script, it creates parquet data and shard them internally.

So, you can use audios-to-dataset instead if you need faster speeds than datasets provides.

posted an update 4 months ago

Post

2115

Convert your audio data to Parquet/DuckDB files with blazingly fast speeds!

Repository with pre-built binaries: https://github.com/crs-org/audios-to-dataset

2 replies

replied to their post 4 months ago

My channel in Telegram: https://t.me/doing_something

posted an update 4 months ago

Post

2259

Create spectrogram using Rust!

Slightly improved nice project that creates spectrogram and built binaries for different platform using cross-rs I've mentioned earlier in my channel.

Repo: https://github.com/crs-org/sonogram

1 reply

replied to their post 4 months ago

Tested the tool on Windows, it works as expected. It's now more easier to work with audio data than before!

posted an update 4 months ago

Post

670

Added more built executables to extract-audio I've released recently.

See my previous post - https://huggingface.co/posts/Yehor/654118712490771

Repository: https://github.com/crs-org/extract-audio

1 reply

posted an update 4 months ago

Post

1948

Made a simple Python script to generate Argilla project for audio annotation from a dataset:

https://github.com/egorsmkv/argilla-audio-annotation

1 reply

posted an update 4 months ago

Post

2054

Are you interesting in different runtimes for AI models?

Check out IREE (iree.dev), it convert models to MLIR and then execute on different platforms.

I have tested it in Rust on CPU and CUDA: https://github.com/egorsmkv/eerie-yolo11

replied to their post 5 months ago

See a demo: https://colab.research.google.com/drive/1prztEZIf8nNFUSaptY8Jv16VO8Crjnzb?usp=sharing

posted an update 5 months ago

Post

2238

Extract audio datasets with Rust on blazingly fast speeds!

With this tool you can extract audio files from a parquet or arrow file generated by Hugging Face datasets library.

Repository: https://github.com/egorsmkv/extract-audio

1 reply

Smoliakov PRO

AI & ML interests

Recent Activity

Organizations

Yehor's activity