Update README.md
Browse files
README.md
CHANGED
@@ -1,200 +1,73 @@
|
|
1 |
-
---
|
2 |
-
library_name: transformers
|
3 |
-
tags:
|
4 |
-
- unsloth
|
5 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
|
7 |
# Model Card for Model ID
|
8 |
|
9 |
-
|
10 |
-
|
11 |
-
|
12 |
|
13 |
## Model Details
|
|
|
14 |
|
15 |
-
|
16 |
-
|
17 |
-
<!-- Provide a longer summary of what this model is. -->
|
18 |
-
|
19 |
-
This is the model card of a π€ transformers model that has been pushed on the Hub. This model card has been automatically generated.
|
20 |
|
21 |
-
|
22 |
-
- **Funded by [optional]:** [More Information Needed]
|
23 |
-
- **Shared by [optional]:** [More Information Needed]
|
24 |
-
- **Model type:** [More Information Needed]
|
25 |
-
- **Language(s) (NLP):** [More Information Needed]
|
26 |
-
- **License:** [More Information Needed]
|
27 |
-
- **Finetuned from model [optional]:** [More Information Needed]
|
28 |
|
29 |
-
|
|
|
|
|
|
|
|
|
|
|
30 |
|
31 |
-
|
|
|
|
|
|
|
32 |
|
33 |
-
|
34 |
-
- **
|
35 |
-
- **Demo [optional]:** [More Information Needed]
|
36 |
|
37 |
## Uses
|
38 |
|
39 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
|
|
|
|
|
|
40 |
|
41 |
-
|
42 |
-
|
43 |
-
|
44 |
-
|
45 |
-
[More Information Needed]
|
46 |
-
|
47 |
-
### Downstream Use [optional]
|
48 |
-
|
49 |
-
<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
|
50 |
-
|
51 |
-
[More Information Needed]
|
52 |
|
53 |
-
|
54 |
-
|
55 |
-
|
56 |
-
|
57 |
-
[More Information Needed]
|
58 |
|
59 |
## Bias, Risks, and Limitations
|
60 |
|
61 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
62 |
-
|
63 |
-
|
64 |
-
|
65 |
-
|
66 |
-
|
67 |
-
|
68 |
-
|
69 |
-
|
70 |
-
|
71 |
-
|
72 |
-
|
73 |
-
|
74 |
-
|
75 |
-
|
76 |
-
|
77 |
-
## Training Details
|
78 |
-
|
79 |
-
### Training Data
|
80 |
-
|
81 |
-
<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
|
82 |
-
|
83 |
-
[More Information Needed]
|
84 |
-
|
85 |
-
### Training Procedure
|
86 |
-
|
87 |
-
<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
|
88 |
-
|
89 |
-
#### Preprocessing [optional]
|
90 |
-
|
91 |
-
[More Information Needed]
|
92 |
-
|
93 |
-
|
94 |
-
#### Training Hyperparameters
|
95 |
-
|
96 |
-
- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
97 |
-
|
98 |
-
#### Speeds, Sizes, Times [optional]
|
99 |
-
|
100 |
-
<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
|
101 |
-
|
102 |
-
[More Information Needed]
|
103 |
-
|
104 |
-
## Evaluation
|
105 |
-
|
106 |
-
<!-- This section describes the evaluation protocols and provides the results. -->
|
107 |
-
|
108 |
-
### Testing Data, Factors & Metrics
|
109 |
-
|
110 |
-
#### Testing Data
|
111 |
-
|
112 |
-
<!-- This should link to a Dataset Card if possible. -->
|
113 |
-
|
114 |
-
[More Information Needed]
|
115 |
-
|
116 |
-
#### Factors
|
117 |
-
|
118 |
-
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
119 |
-
|
120 |
-
[More Information Needed]
|
121 |
-
|
122 |
-
#### Metrics
|
123 |
-
|
124 |
-
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
125 |
-
|
126 |
-
[More Information Needed]
|
127 |
-
|
128 |
-
### Results
|
129 |
-
|
130 |
-
[More Information Needed]
|
131 |
-
|
132 |
-
#### Summary
|
133 |
-
|
134 |
-
|
135 |
-
|
136 |
-
## Model Examination [optional]
|
137 |
-
|
138 |
-
<!-- Relevant interpretability work for the model goes here -->
|
139 |
-
|
140 |
-
[More Information Needed]
|
141 |
-
|
142 |
-
## Environmental Impact
|
143 |
-
|
144 |
-
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
145 |
-
|
146 |
-
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
147 |
-
|
148 |
-
- **Hardware Type:** [More Information Needed]
|
149 |
-
- **Hours used:** [More Information Needed]
|
150 |
-
- **Cloud Provider:** [More Information Needed]
|
151 |
-
- **Compute Region:** [More Information Needed]
|
152 |
-
- **Carbon Emitted:** [More Information Needed]
|
153 |
-
|
154 |
-
## Technical Specifications [optional]
|
155 |
-
|
156 |
-
### Model Architecture and Objective
|
157 |
-
|
158 |
-
[More Information Needed]
|
159 |
-
|
160 |
-
### Compute Infrastructure
|
161 |
-
|
162 |
-
[More Information Needed]
|
163 |
-
|
164 |
-
#### Hardware
|
165 |
-
|
166 |
-
[More Information Needed]
|
167 |
-
|
168 |
-
#### Software
|
169 |
-
|
170 |
-
[More Information Needed]
|
171 |
-
|
172 |
-
## Citation [optional]
|
173 |
-
|
174 |
-
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
175 |
-
|
176 |
-
**BibTeX:**
|
177 |
-
|
178 |
-
[More Information Needed]
|
179 |
-
|
180 |
-
**APA:**
|
181 |
-
|
182 |
-
[More Information Needed]
|
183 |
-
|
184 |
-
## Glossary [optional]
|
185 |
-
|
186 |
-
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
187 |
-
|
188 |
-
[More Information Needed]
|
189 |
-
|
190 |
-
## More Information [optional]
|
191 |
-
|
192 |
-
[More Information Needed]
|
193 |
-
|
194 |
-
## Model Card Authors [optional]
|
195 |
-
|
196 |
-
[More Information Needed]
|
197 |
-
|
198 |
-
## Model Card Contact
|
199 |
-
|
200 |
-
[More Information Needed]
|
|
|
1 |
+
---
|
2 |
+
library_name: transformers
|
3 |
+
tags:
|
4 |
+
- unsloth
|
5 |
+
license: gemma
|
6 |
+
datasets:
|
7 |
+
- SoyunKim/startrup
|
8 |
+
language:
|
9 |
+
- ko
|
10 |
+
base_model:
|
11 |
+
- google/gemma-2-2b
|
12 |
+
---
|
13 |
|
14 |
# Model Card for Model ID
|
15 |
|
16 |
+
**StartBridge** is a machine learning model that helps startups find relevant 2024 government startup support programs by providing quick, personalized answers about funding and opportunities.
|
|
|
|
|
17 |
|
18 |
## Model Details
|
19 |
+
Introducing **StartBridge**, your go-to guide for navigating 2024 government support programs for startups! Designed with fine-tuning machine learning, **StartBridge** bridges the gap between you and the vast world of public funding, making it easier than ever to find the right opportunities.
|
20 |
|
21 |
+
Whether youβre an entrepreneur searching for startup grants or a business in need of early-stage funding, **StartBridge** swiftly delivers clear, relevant, and personalized answers to your questions. With **StartBridge**, no more sifting through countless websites or documentsβsimply ask, and get connected to the right support in seconds. It's not just a tool, it's your personal bridge to growth and innovation!
|
|
|
|
|
|
|
|
|
22 |
|
23 |
+
### Model Description
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
|
25 |
+
- **Developed by:** Soyun Kim, Su-min Kim
|
26 |
+
- **Activity with:** MLB 2024, Gemma Sprint
|
27 |
+
- **Model type:** Causal Language Model (AutoModelForCausalLM)
|
28 |
+
- **Language(s) (NLP):** Korean
|
29 |
+
- **License:** Gemma Term of Use
|
30 |
+
- **Finetuned from model:** google/gemma2-2b
|
31 |
|
32 |
+
**Model Sources**
|
33 |
+
- **Reference #1:**Β https://github.com/unslothai/unsloth.git
|
34 |
+
- **Reference #2:**Β https://www.sktenterprise.com/bizInsight/blogDetail/dev/9480
|
35 |
+
- **Reference #3:** https://unfinishedgod.netlify.app/2024/06/15/llm-unsloth-gguf/
|
36 |
|
37 |
+
**Data Resource**
|
38 |
+
- **Resource**: https://huggingface.co/datasets/SoyunKim/startrup
|
|
|
39 |
|
40 |
## Uses
|
41 |
|
42 |
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
43 |
+
1. **Intended Use**:
|
44 |
+
- **StartBridge** is designed to help users find relevant government support programs specifically targeted towards startups. It serves as a query-based system where users can ask questions about available programs, eligibility criteria, deadlines, application processes, and any other relevant details.
|
45 |
+
- The model can provide information by matching user queries with existing data about various support programs, simplifying the process for startups seeking government assistance.
|
46 |
|
47 |
+
2. **Foreseeable Users**:
|
48 |
+
- **Startups and Entrepreneurs**: The primary users of the model will be founders, entrepreneurs, or teams looking to access government grants, subsidies, tax reliefs, or any other form of financial or administrative support.
|
49 |
+
- **Business Advisors and Consultants**: Professionals who assist startups in navigating the complex landscape of government support programs may use the model to provide quick and accurate answers to their clients.
|
50 |
+
- **Incubators and Accelerators**: Organizations that provide mentorship and resources to early-stage startups may also use the model to assist their cohorts in understanding available government programs.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
51 |
|
52 |
+
3. **Those Affected by the Model**:
|
53 |
+
- **Entrepreneurs**: With better access to information, entrepreneurs may have a more equitable chance of benefiting from the support programs, potentially influencing startup growth rates and success rates.
|
54 |
+
- **Policymakers**: Insights from the types of questions users ask can help policymakers refine or promote certain programs, ensuring that they meet the needs of startups more effectively.
|
|
|
|
|
55 |
|
56 |
## Bias, Risks, and Limitations
|
57 |
|
58 |
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
59 |
+
1. **Bias**:
|
60 |
+
- **StartBridge** has been primarily trained on data from **2024 government startup support programs** focusing specifically on **commercialization support** (μ¬μ
νμ§μ). This creates a potential bias, as the model's responses will be most accurate for questions related to this area.
|
61 |
+
- Programs supporting other aspects of startups, such as funding for research and development (R&D), international expansion, or human resources, may not be covered as thoroughly, leading to gaps in the information provided.
|
62 |
+
- The list of programs currently covered is as follows:
|
63 |
+
λ―Όκ΄κ³΅λμ°½μ
μλ°κ΅΄μ‘μ±μ¬μ
, μλΉμ°½μ
ν¨ν€μ§, μ΄κΈ°μ°½μ
ν¨ν€μ§, μ°½μ
λμ½ν¨ν€μ§, μ΄κ²©μ°¨ μ€ννΈμ
1000+ νλ‘μ νΈ, μκΈ°μ λμ½ μ‘μ±μ¬μ
, μ¬λμ μ±κ³΅ν¨ν€μ§, μ°½μ
μ€μ¬λν, μμ μ΅μ΄ μ²λ
μ°½μ
μ§μμ¬μ
, 곡곡기μ μ°½μ
μ¬μ
ν μ§μμ¬μ
, μ°½μ
μ±κ³΅ν¨ν€μ§(μ²λ
μ°½μ
μ¬κ΄νκ΅), λ‘컬ν¬λ¦¬μμ΄ν° μ‘μ±μ¬μ
, K-Global μ‘μ
λ¬λ μ΄ν° μ‘μ±μ¬μ
, κΈλ‘λ² ICT λ―Έλ μ λμ½ μ‘μ±, λ°μ΄ν° νμ© μ¬μ
ν μ§μμ¬μ
(DATA-Stars), μμ½μ€ννΈμ
μ§μμ¬μ
, λνλ―Όκ΅ λ¬Όμ°μ
νμ μ°½μ
λμ , λ¬Όλλ¦Ό μ¬μ
νμ§μ, μμ κΈ°μ
μ±μ₯ μ§μ, μ€ν¬μΈ μ°μ
μ°½μ
μ§μ, μ€ν¬μΈ μ°μ
μ°½μ
μ€κΈ°(μ‘μ
λ¬λ μ΄ν
) μ§μ, μ½ν
μΈ μμ΄λμ΄ μ¬μ
ν μ§μ, μ½ν
μΈ μ΄κΈ°μ°½μ
μ‘μ± μ§μ, μ½ν
μΈ μ°½μ
λμ½ νλ‘κ·Έλ¨, μ λκΈ°μ
μ°κ³ λλ°μ±μ₯ μ§μ(μ½ν
μΈ μ€νμ΄λ
Έλ² μ΄μ
), μ ν΅λ¬Έν μ²λ
μ°½μ
μ‘μ±μ§μμ¬μ
, κ΄κ΄λ²€μ²μ¬μ
곡λͺ¨μ , κ΄κ΄ μ‘μ
λ¬λ μ΄ν
νλ‘κ·Έλ¨, μ μ¬μ
μ°½μ
μ¬κ΄νκ΅, λμν λ²€μ²μ‘μ±μ§μ, λμν κΈ°μ μ°½μ
μ‘μ
λ¬λ μ΄ν° μ‘μ±μ§μ, λμν κΈ°μ νκ°μ§μ, λμν νλ‘μ§μ, μ λ§ μ°½μ
κΈ°μ
ν¬μμ μΉ μ§μμ¬μ
|
64 |
+
|
65 |
+
2. **Risks**:
|
66 |
+
- **Outdated Information**: Since the model is trained on a specific dataset from 2024, there is a risk that it may provide outdated or incomplete information if not updated regularly, especially as new government programs or changes to existing programs emerge.
|
67 |
+
- **Over-reliance on the Model**: Users may rely solely on the model without verifying the information through official channels, potentially leading to missed opportunities or misunderstandings in program eligibility and application requirements.
|
68 |
+
- **Limited Context**: The model might lack the ability to fully understand complex or nuanced queries that require knowledge beyond the provided dataset, which could result in incorrect or incomplete responses.
|
69 |
+
|
70 |
+
3. **Limitations**:
|
71 |
+
- **Data Scope**: As of now, **StartBridge** is trained only on data related to **commercialization support programs** in 2024. This limits the modelβs effectiveness when users ask about other areas of startup support such as tax incentives, infrastructure support, or legal assistance.
|
72 |
+
- **Language and Terminology Variations**: If users use different terms or phrases that aren't covered in the dataset, the model may struggle to interpret the question correctly.
|
73 |
+
- **Future Updates**: The model is expected to be updated continuously as more data on new government programs and support areas become available. While this will expand its scope, users must be aware that the current version is limited in coverage.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|