Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -10,16 +10,65 @@ task_categories:
|
|
10 |
# gr00t Model - phospho Training Pipeline
|
11 |
|
12 |
|
13 |
-
##
|
|
|
14 |
|
15 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
|
17 |
|
18 |
## Training parameters:
|
19 |
|
20 |
- **Dataset**: [sebastiandavidlee/pensInHolder-simple](https://huggingface.co/datasets/sebastiandavidlee/pensInHolder-simple)
|
21 |
- **Wandb run URL**: None
|
22 |
-
- **Epochs**:
|
23 |
- **Batch size**: 49
|
24 |
- **Training steps**: None
|
25 |
|
|
|
10 |
# gr00t Model - phospho Training Pipeline
|
11 |
|
12 |
|
13 |
+
## Error Traceback
|
14 |
+
We faced an issue while training your model.
|
15 |
|
16 |
+
```
|
17 |
+
Traceback (most recent call last):
|
18 |
+
File "/opt/conda/lib/python3.11/asyncio/tasks.py", line 500, in wait_for
|
19 |
+
return fut.result()
|
20 |
+
^^^^^^^^^^^^
|
21 |
+
File "/root/phosphobot/am/gr00t.py", line 1093, in read_output
|
22 |
+
async for line in process.stdout:
|
23 |
+
File "/opt/conda/lib/python3.11/asyncio/streams.py", line 765, in __anext__
|
24 |
+
val = await self.readline()
|
25 |
+
^^^^^^^^^^^^^^^^^^^^^
|
26 |
+
File "/opt/conda/lib/python3.11/asyncio/streams.py", line 566, in readline
|
27 |
+
line = await self.readuntil(sep)
|
28 |
+
^^^^^^^^^^^^^^^^^^^^^^^^^
|
29 |
+
File "/opt/conda/lib/python3.11/asyncio/streams.py", line 658, in readuntil
|
30 |
+
await self._wait_for_data('readuntil')
|
31 |
+
File "/opt/conda/lib/python3.11/asyncio/streams.py", line 543, in _wait_for_data
|
32 |
+
await self._waiter
|
33 |
+
asyncio.exceptions.CancelledError
|
34 |
+
|
35 |
+
The above exception was the direct cause of the following exception:
|
36 |
+
|
37 |
+
Traceback (most recent call last):
|
38 |
+
File "/root/phosphobot/am/gr00t.py", line 1104, in run_gr00t_training
|
39 |
+
await asyncio.wait_for(read_output(), timeout=timeout_seconds)
|
40 |
+
File "/opt/conda/lib/python3.11/asyncio/tasks.py", line 502, in wait_for
|
41 |
+
raise exceptions.TimeoutError() from exc
|
42 |
+
TimeoutError
|
43 |
+
|
44 |
+
During handling of the above exception, another exception occurred:
|
45 |
+
|
46 |
+
Traceback (most recent call last):
|
47 |
+
File "/root/src/helper.py", line 166, in predict
|
48 |
+
trainer.train(timeout_seconds=timeout_seconds)
|
49 |
+
File "/root/phosphobot/am/gr00t.py", line 1272, in train
|
50 |
+
asyncio.run(
|
51 |
+
File "/opt/conda/lib/python3.11/asyncio/runners.py", line 190, in run
|
52 |
+
return runner.run(main)
|
53 |
+
^^^^^^^^^^^^^^^^
|
54 |
+
File "/opt/conda/lib/python3.11/asyncio/runners.py", line 118, in run
|
55 |
+
return self._loop.run_until_complete(task)
|
56 |
+
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
57 |
+
File "/opt/conda/lib/python3.11/asyncio/base_events.py", line 654, in run_until_complete
|
58 |
+
return future.result()
|
59 |
+
^^^^^^^^^^^^^^^
|
60 |
+
File "/root/phosphobot/am/gr00t.py", line 1109, in run_gr00t_training
|
61 |
+
raise TimeoutError(
|
62 |
+
TimeoutError: Training process exceeded timeout of 43200 seconds. Please consider lowering the number of epochs and/or batch size.
|
63 |
+
|
64 |
+
```
|
65 |
|
66 |
|
67 |
## Training parameters:
|
68 |
|
69 |
- **Dataset**: [sebastiandavidlee/pensInHolder-simple](https://huggingface.co/datasets/sebastiandavidlee/pensInHolder-simple)
|
70 |
- **Wandb run URL**: None
|
71 |
+
- **Epochs**: 100
|
72 |
- **Batch size**: 49
|
73 |
- **Training steps**: None
|
74 |
|