Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Aisha Halder's picture

6 18

Aisha Halder

Ahalder

·

AishoHalder

AI & ML interests

AI & ML,Networking,P2P

Organizations

None yet

Ahalder 's collections 16

openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated 27 days ago • 92.1k • 1.19k

Snowflake/snowflake-arctic-embed-l-v2.0

Sentence Similarity • 0.6B • Updated Apr 25 • 165k • • 187

OpenGVLab/InternVL2-2B

Image-Text-to-Text • 2B • Updated Mar 25 • 665k • 71

Image generation

UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion

Paper • 2401.13388 • Published Jan 24, 2024 • 12
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models

Paper • 2401.13974 • Published Jan 25, 2024 • 14
Runtime error

420

420

Real ESRGAN

🏃
Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Mar 25 • 9 • 39

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 17
distilbert/distilbert-base-uncased-finetuned-sst-2-english

Text Classification • 0.1B • Updated Dec 19, 2023 • 3.24M • • 790
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25, 2024 • 21
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8, 2024 • 22

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 72

Video generattion

Running on Zero

42

42

Vchitect 2.0

🐢

Generate videos from text prompts

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4 • 73.7k • 1.5k

google/timesfm-2.0-500m-pytorch

Time Series Forecasting • 0.5B • Updated Apr 16 • 5.81k • 182

tensoropera/Fox-1-1.6B

Text Generation • 2B • Updated Nov 21, 2024 • 985 • 33

Image Processing

Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

Paper • 2401.13627 • Published Jan 24, 2024 • 75
Runtime error

420

420

Real ESRGAN

🏃
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 1.4k • 1.68k
NexaAIDev/OmniVLM-968M

0.5B • Updated Dec 17, 2024 • 654 • 520

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29, 2024 • 51
Nfiniteai/product-masks-sample

Viewer • Updated Sep 5, 2024 • 2.71k • 24 • 14
HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 3.53k • 317
rulins/MassiveDS-140B

Viewer • Updated Jul 17, 2024 • 3.08M • 1.69k • 7

Speech and Audio

facebook/wav2vec2-base-960h

Automatic Speech Recognition • 0.1B • Updated Nov 14, 2022 • 998k • 357
ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 61
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 20
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 55 • 67

finegrain/finegrain-box-segmenter

Mask Generation • 0.1B • Updated Sep 11, 2024 • 4.04k • 121
Running on Zero

494

494

Finegrain Object Cutter

✂

Create HD cutouts from any image with just a prompt

mixedbread-ai/mxbai-colbert-large-v1

0.3B • Updated Mar 13 • 24.9k • 52
jinaai/jina-embeddings-v3

Feature Extraction • 0.6B • Updated Feb 24 • 3.54M • 1.03k
Running

8

8

Paper Whisperer

📈

Paper Whisperer

Runtime error

81

81

Dailypapershackernews

📈
Prithvi WxC: Foundation Model for Weather and Climate

Paper • 2409.13598 • Published Sep 20, 2024 • 45
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles

Paper • 2410.05262 • Published Oct 7, 2024 • 11
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

Paper • 2410.15316 • Published Oct 20, 2024 • 12

openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated 27 days ago • 92.1k • 1.19k

google/timesfm-2.0-500m-pytorch

Time Series Forecasting • 0.5B • Updated Apr 16 • 5.81k • 182

Snowflake/snowflake-arctic-embed-l-v2.0

Sentence Similarity • 0.6B • Updated Apr 25 • 165k • • 187

tensoropera/Fox-1-1.6B

Text Generation • 2B • Updated Nov 21, 2024 • 985 • 33

OpenGVLab/InternVL2-2B

Image-Text-to-Text • 2B • Updated Mar 25 • 665k • 71

Image Processing

Scaling Up to Excellence: Practicing Model Scaling for Photo-Realistic Image Restoration In the Wild

Paper • 2401.13627 • Published Jan 24, 2024 • 75
Runtime error

420

420

Real ESRGAN

🏃
microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 1.4k • 1.68k
NexaAIDev/OmniVLM-968M

0.5B • Updated Dec 17, 2024 • 654 • 520

Image generation

UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion

Paper • 2401.13388 • Published Jan 24, 2024 • 12
BootPIG: Bootstrapping Zero-shot Personalized Image Generation Capabilities in Pretrained Diffusion Models

Paper • 2401.13974 • Published Jan 25, 2024 • 14
Runtime error

420

420

Real ESRGAN

🏃
Vchitect/Vchitect-2.0-2B

Text-to-Video • Updated Mar 25 • 9 • 39

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29, 2024 • 51
Nfiniteai/product-masks-sample

Viewer • Updated Sep 5, 2024 • 2.71k • 24 • 14
HuggingFaceFV/finevideo

Viewer • Updated Dec 16, 2024 • 39.5k • 3.53k • 317
rulins/MassiveDS-140B

Viewer • Updated Jul 17, 2024 • 3.08M • 1.69k • 7

DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter

Paper • 1910.01108 • Published Oct 2, 2019 • 17
distilbert/distilbert-base-uncased-finetuned-sst-2-english

Text Classification • 0.1B • Updated Dec 19, 2023 • 3.24M • • 790
FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25, 2024 • 21
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8, 2024 • 22

Speech and Audio

facebook/wav2vec2-base-960h

Automatic Speech Recognition • 0.1B • Updated Nov 14, 2022 • 998k • 357
ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 61
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

Paper • 2409.10819 • Published Sep 17, 2024 • 20
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 55 • 67

Genie: Generative Interactive Environments

Paper • 2402.15391 • Published Feb 23, 2024 • 72

finegrain/finegrain-box-segmenter

Mask Generation • 0.1B • Updated Sep 11, 2024 • 4.04k • 121
Running on Zero

494

494

Finegrain Object Cutter

✂

Create HD cutouts from any image with just a prompt

Video generattion

Running on Zero

42

42

Vchitect 2.0

🐢

Generate videos from text prompts

mixedbread-ai/mxbai-colbert-large-v1

0.3B • Updated Mar 13 • 24.9k • 52
jinaai/jina-embeddings-v3

Feature Extraction • 0.6B • Updated Feb 24 • 3.54M • 1.03k
Running

8

8

Paper Whisperer

📈

Paper Whisperer

stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4 • 73.7k • 1.5k

Runtime error

81

81

Dailypapershackernews

📈
Prithvi WxC: Foundation Model for Weather and Climate

Paper • 2409.13598 • Published Sep 20, 2024 • 45
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles

Paper • 2410.05262 • Published Oct 7, 2024 • 11
Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant

Paper • 2410.15316 • Published Oct 20, 2024 • 12

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs