A big day for multimodal models!
Llama 3.2 is out with a major update: it can now process images.

Key highlights:
• 11B and 90B vision models
• Small 1B and 3B text models for mobile devices

Eval results already on the leaderboard: open-llm-leaderboard/open_llm_leaderboard
Collection: meta-llama/llama-32-66f448ffc8c32f949b04c8cf

1 reply

reacted to clem's post with 🔥 8 months ago

Post

4138

Just crossed 200,000 free public AI datasets shared by the community on Hugging Face! Text, image, video, audio, time-series & many more... Thanks everyone!

http://hf.co/datasets

reacted to codelion's post with ❤️ 8 months ago

Post

2227

We recently worked with OpenAI to fine-tune gpt-4o and built the SOTA model for the patched-codes/static-analysis-eval benchmark. All the code and data patched-codes/synth-vuln-fixes on how we did it is available on their GitHub - https://github.com/openai/build-hours/tree/main/5-4o_fine_tuning.

Here are some tips based on our experience:

→ Establish baseline with "conditioning" / prompting

→ Task-specific datasets are ideal for PEFT; hard to beat gpt-4o on "broad" tasks

→ Add your best system prompt to each example

→ Ensure training data distribution is similar to inference data

→ Shorten instructions with concise prompts; may require more examples.

→ Define clear evaluation metrics (seriously, please eval!)

You can see more details on the benchmark and process here - https://www.patched.codes/blog/the-static-analysis-evaluation-benchmark-measuring-llm-performance-in-fixing-software-vulnerabilities

updated a collection 8 months ago

FaceBChat_10

Collection

Text generation . Conversational .links generations . • 3 items • Updated Jan 9