AI & ML interests

We empower medical institutions with cutting-edge speech solutions, streamlining workflows and enhancing patient care.

Yehorย 
posted an update 29 days ago
view post
Post
646
A new lightweight model to do machine translation from English to Ukrainian using recently published LFM2 model. Use demo Yehor/en-uk-translator to test it.

Facts:
- Fine-tuned with 40M samples (filtered by quality metric) from ~53.5M for 1.4 epochs
- 354M params
- Requires 1 GB of RAM to run with bf16
- BLEU on FLORES-200: 27.24
- Tokens per second: 229.93 (bs=1), 1664.40 (bs=10), 8392.48 (bs=64)
- License: lfm1.0

Mode page: Yehor/kulyk-en-uk
  • 4 replies
ยท
Yehorย 
posted an update 4 months ago
Yehorย 
posted an update 4 months ago
view post
Post
2491
Made a workable program that uses IREE runtime using Rust to inference wav2vec2-bert model for Automatic Speech Recognition.
  • 1 reply
ยท
Yehorย 
posted an update 4 months ago
view post
Post
2697
I have made a Rust project with integration of the latest state-of-the-art model for object detection, it outperforms YOLO!

Check it out: https://github.com/egorsmkv/rf-detr-usls
  • 2 replies
ยท
Yehorย 
posted an update 4 months ago
Yehorย 
posted an update 4 months ago
view post
Post
2259
Create spectrogram using Rust!

Slightly improved nice project that creates spectrogram and built binaries for different platform using cross-rs I've mentioned earlier in my channel.

Repo: https://github.com/crs-org/sonogram
  • 1 reply
ยท
Yehorย 
posted an update 4 months ago
Yehorย 
posted an update 5 months ago
Yehorย 
posted an update 5 months ago
view post
Post
2054
Are you interesting in different runtimes for AI models?

Check out IREE (iree.dev), it convert models to MLIR and then execute on different platforms.

I have tested it in Rust on CPU and CUDA: https://github.com/egorsmkv/eerie-yolo11
Yehorย 
posted an update 5 months ago
view post
Post
2238
Extract audio datasets with Rust on blazingly fast speeds!

With this tool you can extract audio files from a parquet or arrow file generated by Hugging Face datasets library.

Repository: https://github.com/egorsmkv/extract-audio
  • 1 reply
ยท
Yehorย 
posted an update 5 months ago
Yehorย 
posted an update 5 months ago
Yehorย 
posted an update 5 months ago
view post
Post
2887
Published a stable version of Ukrainian Text-to-Speech library on GitHub and PyPI.

Features:

- Multi-speaker model: 2 female (Tetiana, Lada) + 1 male (Mykyta) voices;
- Fine-grained control over speech parameters, including duration, fundamental frequency (F0), and energy;
- High-fidelity speech generation using the RAD-TTS++ acoustic model;
- Fast vocoding using Vocos;
- Synthesizes long sentences effectively;
- Supports a sampling rate of 44.1 kHz;
- Tested on Linux environments and Windows/WSL;
- Python API (requires Python 3.9 or later);
- CUDA-enabled for GPU acceleration.

Repository: https://github.com/egorsmkv/tts_uk
Yehorย 
posted an update 6 months ago