Lightweight Semantic Mathlib Search โ€” LoRA Adapter (Demo)

What this is: A small, reproducible demo adapter for Qwen/Qwen3-Embedding-0.6B that enables lightweight, LLM-free semantic search over mathlib (Lean 4). Supports both natural language and formula-based search (can be used in combination) for finding Lean theorems or mathematical statements. Does not support LaTeX syntax search yet.

Data is built from ~180k Lean theorems extracted via LeanDojo for Lean 4.20.1 (there may be minor gaps vs. the full mathlib).

Part of the AITP 2025 submission: โ€œTowards Lightweight and LLM-Free Semantic Search for mathlib4.โ€

Contents

  • adapter/adapter_config.json & adapter/adapter_model.safetensors โ€” LoRA weights for the encoder
  • faiss/<*.idx> โ€” HNSW FAISS index (built for cosine/inner-product search)
  • faiss/*public_index.jsonl โ€” public mapping { idx, full_name, statement }

DEMO

https://github.com/IsaacLi74/Lightweight-and-LLM-Free-Semantic-Search-for-mathlib4

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Isaac74/qwen3-0.6b-lightweight-semantic-mathlib-search-adapter

Adapter
(3)
this model