AI & ML interests

A central place for all AI creators wanting to use the different AI Tools that provides HuggingFace for their film creations

Recent Activity

Nymbo 
posted an update 2 days ago
view post
Post
290
I have a few updates to my MCP server I wanna share: New Memory tool, improvements to web search & speech generation.

# Memory_Manager Tool

We now have a Memory_Manager tool. Ask ChatGPT to write all its memories verbatim, then tell gpt-oss-20b to save each one using the tool, then take them anywhere! It stores memories in a memories.json file in the repo, no external database required.

The Memory_Manager tool is currently hidden from the HF space because it's intended for local use. It's enabled by providing a HF_READ_TOKEN in the env secrets, although it doesn't actually use the key for anything. There's probably a cleaner way of ensuring memory is only used locally, I'll come back to this.

# Fetch & Websearch

The Fetch_Webpage tool has been simplified a lot. It now converts the page to Markdown and returns the page with three length settings (Brief, Standard, Full). This is a lot more reliable than the old custom extraction method.

The Search_DuckDuckGo tool has a few small improvements. The input is easier for small models to get right, and the output is more readable.

# Speech Generation

I've added the remaining voices for Kokoro-82M, it now supports all 54 voices with all accents/languages.

I also removed the 30 second cap by making sure it computes all chunks in sequence, not just the first. I've tested it on outputs that are ~10 minutes long. Do note that when used as an MCP server, the tool will timeout after 1 minute, nothing I can do about that for right now.

# Other Thoughts

Lots of MCP use cases involve manipulating media (image editing, ASR, etc.). I've avoided adding tools like this so far for two reasons:

1. Most of these solutions would require assigning it a ZeroGPU slot.
2. The current process of uploading files like images to a Gradio space is still a bit rough. It's doable but requires additional tools.

Both of these points make it a bit painful for local usage. I'm open to suggestions for other tools that rely on text.
Bils 
posted an update 8 days ago
view post
Post
1102
Introducing ShortiFoley 🎵 — an AI tool that transforms short videos into realistic Foley audio.
Built on Tencent’s HunyuanVideo-Foley with SigLIP2 + CLAP, and designed for media automation pipelines like n8n
✅ Generate Foley from video
✅ Autosave results with metadata
✅ MCP endpoints for workflows
Bils/ShortiFoley
Nymbo 
posted an update 15 days ago
view post
Post
751
I built a general use MCP space ~ Fetch webpages, DuckDuckGo search, Python code execution, Kokoro TTS, Image Gen, Video Gen.

# Tools

1. Fetch webpage
2. Web search via DuckDuckGo (very concise, low excess context)
3. Python code executor
4. Kokoro-82M speech generation
5. Image Generation (use any model from HF Inference Providers)
6. Video Generation (use any model from HF Inference Providers)

The first four tools can be used without any API keys whatsoever. DDG search is free and the code execution and speech gen is done on CPU. Having a HF_READ_TOKEN in the env variables will show all tools. If there isn't a key present, The Image/Video Gen tools are hidden.

Nymbo/Tools
Nymbo 
posted an update 23 days ago
view post
Post
966
Anyone using Jan-v1-4B for local MCP-based web search, I highly recommend you try out Intelligent-Internet/II-Search-4B

Very impressed with this lil guy and it deserves more downloads. It's based on the original version of Qwen3-4B but find that it questions reality way less often. Jan-v1 seems to think that everything it sees is synthetic data and constantly gaslights me
Zaesar 
updated a Space about 1 month ago
Zaesar 
updated a collection about 2 months ago
Bils 
posted an update about 2 months ago
view post
Post
365
I wanted to share a project I’ve been working on: a fully autonomous content creation pipeline that generates and publishes ASMR YouTube Shorts. I've turned it into a workflow template that others can use, and I thought this community would appreciate the technical orchestration behind it.
This project demonstrates how to chain multiple specialized AI models together—moving beyond simple text generation to create complete, multi-modal video content.
https://n8n.partnerlinks.io/bilsimaging
#ASMR #SatisfyingVideos #AIworkflow #ContentAutomation
#n8nCommunity #YouTubeShorts #AItools #Tech