File size: 879 Bytes
f634f42
 
 
 
4d5d826
f634f42
 
 
 
 
 
 
 
 
 
 
 
 
 
4d5d826
 
 
 
 
 
 
 
 
 
 
 
 
f634f42
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
datasets:
- jaehy12/news3
language:
- ko
base_model:
- ibm-granite/granite-3.1-2b-instruct
library_name: transformers
---
## Model Details

**Granite-3.1-2B-instruct-KR-Summarization**

Granite-3.1-2B-instruct-KR-Summarization is continued pretrained(fully fine-tuned) language model based on Granite-3.1-2B-Instruct.  

This model is trained fully with publicily available resource at HuggingFace dataset hub, preprocessed Korean texts.  

The train was done on A6000 48GB * 4.  

**Model developers** Dongwook Min (mindw96)

**Variations** Granite-3.1-2B-instruct-KR-Summarization comes in one size — 2B.

**Input** Models input text only.

**Output** Models generate text only.

**Model Architecture** Granite 3.1 is an auto-regressive language model that uses an optimized transformer architecture. 

**Model Release Date** 02.01.2025.

**Capabilities**
* Summarization