Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,51 @@
|
|
1 |
-
---
|
2 |
-
license: mit
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: mit
|
3 |
+
base_model:
|
4 |
+
- google-bert/bert-base-uncased
|
5 |
+
pipeline_tag: text-classification
|
6 |
+
datasets:
|
7 |
+
- ChiekoSeren/RWKV-Thinking-problem-classify-v1
|
8 |
+
language:
|
9 |
+
- zh
|
10 |
+
- en
|
11 |
+
- fr
|
12 |
+
- ja
|
13 |
+
- ru
|
14 |
+
---
|
15 |
+
|
16 |
+
# RWKV-Thinking Problem Difficulty Classification
|
17 |
+
|
18 |
+
This model is designed to predict the difficulty of problems within the RWKV-Thinking dataset. This prediction is used to estimate the number of reasoning paths required for multi-path reasoning.
|
19 |
+
|
20 |
+
**Model Overview:**
|
21 |
+
|
22 |
+
This model leverages the `RWKV-Thinking-problem-classify-v1` dataset to classify the difficulty of problems. The difficulty classification is a crucial step in determining the complexity of reasoning required to solve a problem, which directly influences the number of reasoning paths explored during multi-path reasoning.
|
23 |
+
|
24 |
+
**Intended Use:**
|
25 |
+
|
26 |
+
* Predicting the difficulty level of problems in the RWKV-Thinking dataset.
|
27 |
+
* Estimating the number of reasoning paths needed for multi-path reasoning.
|
28 |
+
* Evaluating the performance of language models in understanding and classifying problem complexity.
|
29 |
+
* Supporting research in reasoning, problem-solving, and natural language understanding.
|
30 |
+
|
31 |
+
**Dataset Details:**
|
32 |
+
|
33 |
+
* **Dataset Name:** `RWKV-Thinking-problem-classify-v1`
|
34 |
+
* **Dataset Description:** This dataset assesses the diversity of problem types and the probability of successful problem-solving across various contexts. It includes a range of problem statements, classifications, and associated metadata.
|
35 |
+
* **Dataset Creation:**
|
36 |
+
* **Curation Rationale:** Created to provide a benchmark for evaluating how well models like RWKV can handle diverse problem types and predict solution success.
|
37 |
+
* **Source Data:** Problems may be sourced from synthetic generation, educational materials, or curated problem-solving repositories.
|
38 |
+
* **Preprocessing:** Problems were standardized, categorized, and assigned diversity and success probability scores.
|
39 |
+
* **Annotations:** Manual annotation by domain experts or automated scoring based on predefined criteria. Annotators assessed problem complexity, uniqueness, and solvability.
|
40 |
+
* **Fine-tuning Dataset Size:** 1K < n < 10K
|
41 |
+
|
42 |
+
**Model Training:**
|
43 |
+
|
44 |
+
* **Model Architecture:** BERT
|
45 |
+
* **Training Data:** `RWKV-Thinking-problem-classify-v1` dataset.
|
46 |
+
|
47 |
+
**Ethical Considerations:**
|
48 |
+
|
49 |
+
* **Social Impact:** This model can advance AI research in reasoning and education, potentially aiding in personalized learning systems or automated tutoring tools.
|
50 |
+
* **Biases:** Potential biases may arise from the selection of problem categories or the subjectivity in assigning diversity and success scores. Users should evaluate these factors for their specific use case.
|
51 |
+
* **Limitations:** Limited scope to predefined categories. Success probability may vary based on model capability or user expertise.
|