10 2 8

Rakshit Aralimatti PRO

RakshitAralimatti

AI & ML interests

Poor GPU Guy

Recent Activity

upvoted a paper 4 days ago

Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective

reacted to hexgrad's post with 🔥 28 days ago

Wanted: Peak Data. I'm collecting audio data to train another TTS model: + AVM data: ChatGPT Advanced Voice Mode audio & text from source + Professional audio: Permissive (CC0, Apache, MIT, CC-BY) This audio should *impress* most native speakers, not just barely pass their audio Turing tests. Professional-caliber means S or A-tier, not your average bloke off the street. Traditional TTS may not make the cut. Absolutely no low-fi microphone recordings like Common Voice. The bar is much higher than last time, so there are no timelines yet and I expect it may take longer to collect such mythical data. Raising the bar means evicting quite a bit of old data, and voice/language availability may decrease. The theme is *quality* over quantity. I would rather have 1 hour of A/S-tier than 100 hours of mid data. I have nothing to offer but the north star of a future Apache 2.0 TTS model, so prefer data that you *already have* and costs you *nothing extra* to send. Additionally, *all* the new data may be used to construct public, Apache 2.0 voicepacks, and if that arrangement doesn't work for you, no need to send any audio. Last time I asked for horses; now I'm asking for unicorns. As of writing this post, I've currently got a few English & Chinese unicorns, but there is plenty of room in the stable. Find me over on Discord at `rzvzn`: https://discord.gg/QuGxSWBfQy

published a model about 1 month ago

RakshitAralimatti/DeepSeek-R1-Distill-Qwen-7B-Q4_K_M-GGUF

View all activity

Organizations

RakshitAralimatti's activity

upvoted a paper 4 days ago

Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI Perspective

Paper • 2503.01933 • Published 7 days ago • 10

reacted to hexgrad's post with 🔥 28 days ago

Post

6869

Wanted: Peak Data. I'm collecting audio data to train another TTS model:
+ AVM data: ChatGPT Advanced Voice Mode audio & text from source
+ Professional audio: Permissive (CC0, Apache, MIT, CC-BY)

This audio should *impress* most native speakers, not just barely pass their audio Turing tests. Professional-caliber means S or A-tier, not your average bloke off the street. Traditional TTS may not make the cut. Absolutely no low-fi microphone recordings like Common Voice.

The bar is much higher than last time, so there are no timelines yet and I expect it may take longer to collect such mythical data. Raising the bar means evicting quite a bit of old data, and voice/language availability may decrease. The theme is *quality* over quantity. I would rather have 1 hour of A/S-tier than 100 hours of mid data.

I have nothing to offer but the north star of a future Apache 2.0 TTS model, so prefer data that you *already have* and costs you *nothing extra* to send. Additionally, *all* the new data may be used to construct public, Apache 2.0 voicepacks, and if that arrangement doesn't work for you, no need to send any audio.

Last time I asked for horses; now I'm asking for unicorns. As of writing this post, I've currently got a few English & Chinese unicorns, but there is plenty of room in the stable. Find me over on Discord at rzvzn: https://discord.gg/QuGxSWBfQy

4 replies

published a model about 1 month ago

RakshitAralimatti/DeepSeek-R1-Distill-Qwen-7B-Q4_K_M-GGUF

Updated Jan 29 • 25

updated a model about 1 month ago

RakshitAralimatti/DeepSeek-R1-Distill-Qwen-7B-Q4_K_M-GGUF

Updated Jan 29 • 25

New activity in mllmTeam/PhoneLM-0.5B 2 months ago

GGUF Models

#1 opened 2 months ago by

RakshitAralimatti

updated 2 models 3 months ago

RakshitAralimatti/Llama-3B-Instruct-RASA-CMD-CALM-Q5_K_M-GGUF

Updated Dec 19, 2024 • 12

RakshitAralimatti/Llama-3B-Instruct-RASA-CMD-CALM

Updated Dec 19, 2024 • 9

New activity in rasa/command-generation-calm-demo-v1 3 months ago

Model Selection dout

#3 opened 3 months ago by

RakshitAralimatti

updated 2 models 3 months ago

RakshitAralimatti/Qwen2.5-Coder-1.5B-Instruct-RASA-CALM-CMD-GGUF

Updated Dec 18, 2024 • 29

RakshitAralimatti/Qwen2.5-Coder-1.5B-Instruct-RASA-CALM

Text Generation • Updated Dec 13, 2024 • 51 • 1

New activity in RakshitAralimatti/Qwen2.5-Coder-1.5B-Instruct-RASA-CALM 3 months ago

Adding `safetensors` variant of this model

#1 opened 3 months ago by

SFconvertbot

updated a model 3 months ago

RakshitAralimatti/Mistral-7b-Lora-Medical-ChatSupport-Q8_0-GGUF

Question Answering • Updated Dec 3, 2024 • 11

New activity in Samyak29/speaker-segmentation-fine-tuned-hindi 4 months ago

Getting Error

#1 opened 6 months ago by

RakshitAralimatti

New activity in LLM360/TxT360 4 months ago

Size of the Dataset?

#9 opened 4 months ago by

RakshitAralimatti

New activity in mlfoundations/dclm-baseline-1.0 4 months ago

Total Size of dataset ?

#14 opened 4 months ago by

RakshitAralimatti

upvoted a paper 5 months ago

SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments

Paper • 2410.11331 • Published Oct 15, 2024 • 8

authored a paper 5 months ago

SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments

Paper • 2410.11331 • Published Oct 15, 2024 • 8

New activity in varma007ut/Indian_Legal_Assitant 5 months ago

Dataset Used for Fine-tuning

#1 opened 5 months ago by

RakshitAralimatti

reacted to bartowski's post with ❤️ 7 months ago

Post

10083

So turns out I've been spreading a bit of misinformation when it comes to imatrix in llama.cpp

It starts true; imatrix runs the model against a corpus of text and tracks the activation of weights to determine which are most important

However what the quantization then does with that information is where I was wrong.

I think I made the accidental connection between imatrix and exllamav2's measuring, where ExLlamaV2 decides how many bits to assign to which weight depending on the goal BPW

Instead, what llama.cpp with imatrix does is it attempts to select a scale for a quantization block that most accurately returns the important weights to their original values, ie minimizing the dequantization error based on the importance of activations

The mildly surprising part is that it actually just does a relatively brute force search, it picks a bunch of scales and tries each and sees which one results in the minimum error for weights deemed important in the group

But yeah, turns out, the quantization scheme is always the same, it's just that the scaling has a bit more logic to it when you use imatrix

Huge shoutout to @compilade for helping me wrap my head around it - feel free to add/correct as well if I've messed something up

5 replies

liked a model 7 months ago

SandLogicTechnologies/Meta-Llama-3-8B-Instruct-GGUF

Text Generation • Updated Sep 10, 2024 • 68 • 2