Juan Herrera

juampahc

1 1 23

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

juampahc/LFM2.5-230M-openvino

published a model about 1 month ago

juampahc/LFM2.5-230M-openvino

liked a model 4 months ago

OuteAI/Llama-OuteTTS-1.0-1B

View all activity

Organizations

Collections 3

View 3 collections

models 10

juampahc/bge-m3-m2v-758

juampahc/bge-m3-m2v-256

juampahc/bge-m3-m2v-1024

juampahc/bge-m3-baai-onnx

juampahc/bge-m3-baai-quant-opt

juampahc/bge-m3-baai-quant

datasets 0

None public yet

Juan Herrera

AI & ML interests

Recent Activity

Organizations

Collections 3

CLEX: Continuous Length Extrapolation for Large Language Models

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

Data Engineering for Scaling Language Models to 128K Context

Transformers are Multi-State RNNs

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

CLEX: Continuous Length Extrapolation for Large Language Models

E^2-LLM: Efficient and Extreme Length Extension of Large Language Models

Data Engineering for Scaling Language Models to 128K Context

Transformers are Multi-State RNNs

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

models 10

juampahc/LFM2.5-230M-openvino

juampahc/gliner_multi-v2.1-openvino

juampahc/llama-3.2-3b-openvino

juampahc/gliner_multi-v2.1-onnx

juampahc/bge-m3-m2v-758

juampahc/bge-m3-m2v-256

juampahc/bge-m3-m2v-1024

juampahc/bge-m3-baai-onnx

juampahc/bge-m3-baai-quant-opt

juampahc/bge-m3-baai-quant

datasets 0

Juan Herrera

AI & ML interests

Recent Activity

Organizations

Collections 3

models 10 Sort: Recently updated

datasets 0

models 10