nomadicsynth's picture

nomadicsynth PRO

nomadicsynth

AI & ML interests

Yes.

Recent Activity

reacted to openfree's post with 👀 2 days ago
🎙️ Voice Clone AI Podcast Generator: Create Emotionally Rich Podcasts with Your Own Voice! 🚀 Project Introduction Hello! Today we're excited to introduce an AI-powered solo podcast generator that creates high-quality voice cloning with authentic emotional expression. Transform any PDF document, web URL, or keyword into a professional podcast with just a few clicks! 📚➡️🎧 https://huggingface.co/spaces/VIDraft/Voice-Clone-Podcast ✨ Key Features 1. 🎯 Multiple Input Methods URL: Simply paste any blog or article link PDF: Upload research papers or documents directly Keyword: Enter a topic and AI searches for the latest information to create content 2. 🎭 Emotionally Expressive Voice Cloning Powered by Chatterbox TTS: 🎤 Voice Cloning: Learn and replicate your unique voice perfectly 📢 Natural intonation and emotional expression 🌊 Customizable emotion intensity with Exaggeration control ⚡ Seamless handling of long texts with automatic chunking 3. 🤖 State-of-the-Art LLM Script Generation Professional-grade English dialogue using Private-BitSix-Mistral 12 natural conversational exchanges Real-time web search integration for up-to-date information Fully editable generated scripts! ✏️ 💡 Use Cases 📖 Educational Content Transform complex research papers into easy-to-understand podcasts Create English learning materials in your own voice 📰 News & Information Convert international articles into engaging audio content Produce global trend analysis podcasts 🎨 Creative Content Tell stories in English with your own voice Build your global personal brand with custom audio content 🛠️ Tech Stack 🧠 LLM: Llama CPP + Private-BitSix-Mistral 🗣️ TTS: Chatterbox (Voice Cloning & Emotional Expression) 🔍 Search: Brave Search API 📄 Document Processing: LangChain + PyPDF 🖥️ Interface: Gradio 🎉 What Makes Us Special 🎤 Voice Cloning: Perfect voice replication from just a short audio sample 😊 Emotion Contro 📏 Unlimited Length 🔄 Real-time Updates
reacted to clefourrier's post with 👀 9 days ago
Saying Claude 4 is "the best coding model in the world" from the SWEBench scores is super misleading, and here is why: If you look at the announcement table, their model has the best scores, but... if you look at the very bottom, in font 4, you'll see that the metric they report is actually not the same metric as the one used for the other models! Comparing "pass@1 averaged 10 times" to "normal pass@1" is like grading one student by allowing them to take the test 10 times and averaging question scores, when the other students only get one chance at grading. The first way to grade (avg@10) is actually quite good statistically, much better than what model creators usually report, because models tend to be quite inconsistent - sometimes good, sometimes bad... But! You want to do it for all models then, and report with error bars. The issue is that, if you do... well, it's going to be harder to say your model is the best, because the error bars will overlap between models, by a lot. Also, you'll see that 2 numbers are reported: the first one is using avg@10 (what I explained above), and the second, highest one is using this plus many other tricks: - test time compute (so having the model generate a tree of answers and selecting the best as you go, more or less) - removing the times when the model breaks the tests - and using another model to select the most promising solution! You can't really say it's better than the rest, mostly because it's **way less efficient** to achieve a similar result. It's honestly a bit sad because from user reports, the model sounds good - however, this announcement is overblown numbers wise, and I'm quite sure it's more a problem of "too much marketing" than of "bad science" Another thing which makes the comparison invalid is the complete absence of open source from the report - don't think they are aware of DeepSeek/ Qwen/The new mistral for code/and all the cool specialised models found on the hub?
View all activity

Organizations

Neon Cortex's profile picture

nomadicsynth's activity

reacted to openfree's post with 👀 2 days ago
view post
Post
2540
🎙️ Voice Clone AI Podcast Generator: Create Emotionally Rich Podcasts with Your Own Voice!

🚀 Project Introduction
Hello! Today we're excited to introduce an AI-powered solo podcast generator that creates high-quality voice cloning with authentic emotional expression.
Transform any PDF document, web URL, or keyword into a professional podcast with just a few clicks! 📚➡️🎧

VIDraft/Voice-Clone-Podcast

✨ Key Features
1. 🎯 Multiple Input Methods

URL: Simply paste any blog or article link
PDF: Upload research papers or documents directly
Keyword: Enter a topic and AI searches for the latest information to create content

2. 🎭 Emotionally Expressive Voice Cloning
Powered by Chatterbox TTS:

🎤 Voice Cloning: Learn and replicate your unique voice perfectly
📢 Natural intonation and emotional expression
🌊 Customizable emotion intensity with Exaggeration control
⚡ Seamless handling of long texts with automatic chunking

3. 🤖 State-of-the-Art LLM Script Generation

Professional-grade English dialogue using Private-BitSix-Mistral
12 natural conversational exchanges
Real-time web search integration for up-to-date information
Fully editable generated scripts! ✏️

💡 Use Cases
📖 Educational Content

Transform complex research papers into easy-to-understand podcasts
Create English learning materials in your own voice

📰 News & Information

Convert international articles into engaging audio content
Produce global trend analysis podcasts

🎨 Creative Content

Tell stories in English with your own voice
Build your global personal brand with custom audio content

🛠️ Tech Stack
🧠 LLM: Llama CPP + Private-BitSix-Mistral
🗣️ TTS: Chatterbox (Voice Cloning & Emotional Expression)
🔍 Search: Brave Search API
📄 Document Processing: LangChain + PyPDF
🖥️ Interface: Gradio
🎉 What Makes Us Special

🎤 Voice Cloning: Perfect voice replication from just a short audio sample
😊 Emotion Contro 📏 Unlimited Length 🔄 Real-time Updates
  • 1 reply
·
reacted to clefourrier's post with 👀 9 days ago
view post
Post
1751
Saying Claude 4 is "the best coding model in the world" from the SWEBench scores is super misleading, and here is why:

If you look at the announcement table, their model has the best scores, but... if you look at the very bottom, in font 4, you'll see that the metric they report is actually not the same metric as the one used for the other models!


Comparing "pass@1 averaged 10 times" to "normal pass@1" is like grading one student by allowing them to take the test 10 times and averaging question scores, when the other students only get one chance at grading.

The first way to grade (avg@10) is actually quite good statistically, much better than what model creators usually report, because models tend to be quite inconsistent - sometimes good, sometimes bad...
But! You want to do it for all models then, and report with error bars.
The issue is that, if you do... well, it's going to be harder to say your model is the best, because the error bars will overlap between models, by a lot.

Also, you'll see that 2 numbers are reported: the first one is using avg@10 (what I explained above), and the second, highest one is using this plus many other tricks:
- test time compute (so having the model generate a tree of answers and selecting the best as you go, more or less)
- removing the times when the model breaks the tests
- and using another model to select the most promising solution!
You can't really say it's better than the rest, mostly because it's **way less efficient** to achieve a similar result.

It's honestly a bit sad because from user reports, the model sounds good - however, this announcement is overblown numbers wise, and I'm quite sure it's more a problem of "too much marketing" than of "bad science"

Another thing which makes the comparison invalid is the complete absence of open source from the report - don't think they are aware of DeepSeek/ Qwen/The new mistral for code/and all the cool specialised models found on the hub?
  • 1 reply
·
posted an update 9 days ago
view post
Post
2652
Anyone using AI and ML to help neurodivergent people? I'd love to hear what you're doing.
  • 1 reply
·
replied to ProCreations's post 11 days ago
view reply

Every time you use a HF space you randomly start dancing for 5 minutes

This one fr i'm dancing all day anyway idk how people survive in cubicles

reacted to codelion's post with 🚀 12 days ago
view post
Post
2806
🧬 Hey everyone! Just released **OpenEvolve** - an open-source implementation of Google DeepMind's AlphaEvolve system.

It's an evolutionary coding agent that uses LLMs to discover and optimize algorithms. I successfully replicated DeepMind's results on circle packing (99.97% match!) and evolved a random search into a simulated annealing algorithm.

✨ Key features:
- Evolves entire codebases (not just single functions)
- Works with any OpenAI-compatible API
- LLM ensemble approach for better results
- Multi-objective optimization

👉 Check it out:
GitHub: https://github.com/codelion/openevolve
Blog post: https://huggingface.co/blog/codelion/openevolve

Would love to hear your thoughts or answer any questions about it!
reacted to ProCreations's post with 👀 15 days ago
reacted to seawolf2357's post with 👀 16 days ago
view post
Post
6004
Samsung Hacking Incident: Samsung Electronics' Official Hugging Face Account Compromised
Samsung Electronics' official Hugging Face account has been hacked. Approximately 17 hours ago, two new language models (LLMs) were registered under Samsung Electronics' official Hugging Face account. These models are:

https://huggingface.co/Samsung/MuTokenZero2-32B
https://huggingface.co/Samsung/MythoMax-L2-13B

The model descriptions contain absurd and false claims, such as being trained on "1 million W200 GPUs," hardware that doesn't even exist.
Moreover, community participants on Hugging Face who have noticed this issue are continuously posting that Samsung Electronics' account has been compromised.
There is concern about potential secondary and tertiary damage if users download these LLMs released under the Samsung Electronics account, trusting Samsung's reputation without knowing about the hack.
Samsung Electronics appears to be unaware of this situation, as they have not taken any visible measures yet, such as changing the account password.
Source: https://discord.gg/openfreeai
  • 2 replies
·
posted an update 18 days ago
view post
Post
360
How do you talk about AI’s promise without sounding like you’re selling out to the next tech gold rush?
  • 1 reply
·
reacted to AdinaY's post with 👀 20 days ago
view post
Post
2511
Matrix Game 🎮 an interactive foundation model for controllable game world generation, released by Skywork AI.

Skywork/Matrix-Game

✨ 17B with MIT licensed
✨ Diffusion-based image-to-world video generation via keyboard & mouse input
✨ GameWorld Score benchmark for Minecraft world models
✨ Massive Matrix Game Dataset with fine-grained action labels
reacted to ArturoNereu's post with 🔥 20 days ago
view post
Post
4247
I’ve been learning AI for several years (coming from the games industry), and along the way, I curated a list of the tools, courses, books, papers, and models that actually helped me understand things.

I turned this into a GitHub repo:
https://github.com/ArturoNereu/AI-Study-Group

If you’re just getting started, I recommend:

📘 Deep Learning – A Visual Approach: https://www.glassner.com/portfolio/deep-learning-a-visual-approach
🎥 Dive into LLMs with Andrej Karpathy: https://youtu.be/7xTGNNLPyMI?si=aUTq_qUzyUx36BsT
🧠 The 🤗 Agents course](https://huggingface.co/learn/agents-course/

The repo has grown with help from the community (Reddit, Discord, etc.) and I’ll keep updating it.

If you have any favorite resources, I’d love to include them.
replied to ProCreations's post 20 days ago
view reply

Is the GPU offer still available? 😂

replied to ProCreations's post 20 days ago
reacted to blaise-tk's post with 🔥 20 days ago
view post
Post
4133
Today we launch Dione.

A few months ago it was just a wild idea I shared with @bygimenez , now it's real.

Dione (Beta) is here, the easiest way to discover and install open-source apps, especially AI ones.

Think of it as the Steam of open source. Installing open-source tools is often a mess. Dione fixes that.

Beautiful UI and workflow. Soon multi-platform, multilingual & fully open-source.
Users can even write and share their own installation scripts. This is just the beginning.

🚀 Join our exclusive Beta
https://getdione.app/beta/join
  • 2 replies
·
reacted to smirki's post with 🚀 22 days ago
view post
Post
2725
✨ We’re live! Introducing TFrameX, the agentic framework for AI builders.

After nights of development, we’re finally open-sourcing TFrameX, a powerful AI agent communication and coordination library.
TFrameX lets you:

🤖 Run agents in dynamic flows
🔁 Compose reusable patterns like Sequential, Parallel, Router, and more
🧠 Enable agent-to-agent collaboration and delegation
⚡ Build modular, complex multi-agent systems that just work

👉 GitHub: TFrameX
https://github.com/TesslateAI/TFrameX

But we didn’t stop there.

We also built a sleek visual builder to design, deploy, and debug your agent patterns without writing boilerplate!

🧩 Visual Studio for TFrameX: https://github.com/TesslateAI/Studio

If you’re building agent frameworks, LLM tools, or agentic apps, TFrameX gives you the tools to move fast and reason deeply.
replied to their post 26 days ago
view reply

I'm attempting to use a 7B LLM, Llama in this case, with an embedding head stuck on the end instead of the lm_head. I used an LLM to rank a ton of randomly selected pairs of papers based on if they have good connections, and trained the embedding head on triplets mined from those ranked pairs.

The idea is for the embedding head to learn to align features from paper abstracts that complement each other.

this is the first version and yeah, I'm not overly impressed. I think I'm seeing results that kinda vibe with the concept sometimes, but I think the ranking criteria for the dataset were a bit loose. I'm going to try making a new dataset with better, more strict, more nuanced criteria and train a second version of the model from that.

replied to their post 27 days ago
view reply

Thanks for letting me know. I've fixed the issue. Feel free to try again.

posted an update 27 days ago
view post
Post
2173
I Did a Thing!

I made an embedding model to find answers in research papers. It goes deeper than plain "semantic search" by identifying deeply reasoned connections and interdisciplinary insights that might have been overlooked. The goal is to find the solutions that might have been missed and to uncover answers that are already out there.

I’ve set up a demo Space - nomadicsynth/inkling . It’s early days, and I’d love some feedback on the model’s results. Try it out and let me know what you think!

Oh, and if it finds your Nobel-winning answer, I want a cut! 😉
·
replied to their post 27 days ago
view reply

I think we can extract that Harvard knowledge and distribute it in the form of properly open models. Get them chatting with our LLMs and train on the collected knowledge. Mwahaha!

They do it to us, after all.

replied to their post 27 days ago
replied to their post 27 days ago
view reply

it's pretty exciting to see the newer, more powerful hardware coming down the pipeline. with better hardware becoming more prolific I hope we can see expansions of concepts like BOINC and Folding@Home and newer ideas as well. It's impossible for you or I to compete with BigAI, but with federated learning techniques and enough people I think it can be done, at least "good enough". It sounds like the kind of thing you might be interested in, yeah?