Scaling Intelligence

university

https://scalingintelligence.stanford.edu/

ScalingIntelligence

AI & ML interests

None defined yet.

Recent Activity

a1zhang authored a paper 8 days ago

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

a1zhang authored a paper 8 days ago

KernelBench: Can LLMs Write Efficient GPU Kernels?

a1zhang authored a paper 8 days ago

VideoGameBench: Can Vision-Language Models complete popular video games?

View all activity

ScalingIntelligence's activity

a1zhang

authored 3 papers 8 days ago

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Paper • 2410.03859 • Published Oct 4, 2024

KernelBench: Can LLMs Write Efficient GPU Kernels?

Paper • 2502.10517 • Published Feb 14 • 3

VideoGameBench: Can Vision-Language Models complete popular video games?

Paper • 2505.18134 • Published 15 days ago • 6

caiacost

updated a dataset 17 days ago

ScalingIntelligence/tpt-gemma2-2b-gsm8k-2k

Viewer • Updated 17 days ago • 2.18k • 65

caiacost

published a dataset 17 days ago

ScalingIntelligence/tpt-gemma2-2b-gsm8k-2k

Viewer • Updated 17 days ago • 2.18k • 65

simonguozirui

authored 2 papers 3 months ago

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Paper • 2408.08274 • Published Aug 15, 2024 • 13

KernelBench: Can LLMs Write Efficient GPU Kernels?

Paper • 2502.10517 • Published Feb 14 • 3

simarora

authored 5 papers 8 months ago

DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models

Paper • 2306.11698 • Published Jun 20, 2023 • 12

Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT

Paper • 2402.07440 • Published Feb 12, 2024 • 1

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28, 2024 • 21

Just read twice: closing the recall gap for recurrent language models

Paper • 2407.05483 • Published Jul 7, 2024

LoLCATs: On Low-Rank Linearizing of Large Language Models

Paper • 2410.10254 • Published Oct 14, 2024

Bradley

authored a paper 10 months ago

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 13

RylanSchaeffer

authored a paper 12 months ago

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Paper • 2406.04391 • Published Jun 6, 2024 • 9

danbider

authored a paper about 1 year ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

ekellbuch

authored a paper about 1 year ago

Deep Ensembles Work, But Are They Necessary?

Paper • 2202.06985 • Published Feb 14, 2022

sabrieyuboglu

authored a paper about 1 year ago

Zoology: Measuring and Improving Recall in Efficient Language Models

Paper • 2312.04927 • Published Dec 8, 2023 • 2

simarora

authored 3 papers over 1 year ago

On the Opportunities and Risks of Foundation Models

Paper • 2108.07258 • Published Aug 16, 2021

Can Foundation Models Wrangle Your Data?

Paper • 2205.09911 • Published May 20, 2022

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Paper • 2310.12109 • Published Oct 18, 2023 • 1