YuchenLi01 commited on
Commit
4853204
·
verified ·
1 Parent(s): fc8dad5

Training in progress, step 696

Browse files
logs/amlt_code_runner.txt CHANGED
@@ -1,13 +1,13 @@
1
- 2025-04-16 09:33:00,633:amlt-code-runner:INFO - SINGULARITY_LOCATION: centralus
2
- 2025-04-16 09:33:00,633:amlt-code-runner:INFO - AISC_INSTANCE_TYPE: Singularity.ND96_v4
3
- 2025-04-16 09:33:02,779:amlt-code-runner:INFO - Not removing AzureML's cd commands from /etc/profile due to an error: [Errno 13] Permission denied: '/etc/profile'
4
- 2025-04-16 09:33:02,780:amlt-code-runner:WARNING - Environment variable 'NCCL_SOCKET_IFNAME' already set to '=eth0', not changing to '^docker0,lo'
5
- 2025-04-16 09:33:02,780:amlt-code-runner:INFO - RANK = 0
6
- 2025-04-16 09:33:02,780:amlt-code-runner:INFO - LOCAL_RANK = None
7
- 2025-04-16 09:33:02,780:amlt-code-runner:INFO - WORLD_SIZE = 1
8
- 2025-04-16 09:33:02,780:amlt-code-runner:INFO - MASTER_ADDR = node-0
9
- 2025-04-16 09:33:02,780:amlt-code-runner:INFO - MASTER_PORT = 9500
10
- 2025-04-16 09:33:02,781:amlt-code-runner:WARNING - Installing amlt runtime dependencies: ['wrapt', 'azure-identity', 'python-dateutil', 'pytz'] into /tmp/amlt-user-base
11
- 2025-04-16 09:33:04,342:amlt-code-runner:INFO - Executing ./amlt_setup.sh, ./amlt_run.sh
12
- 2025-04-16 09:33:04,412:background_dirsync:INFO - Starting directory syncer from '/scratch/amlt_code/outputs' to '/mnt/output/projects/amlt_project/amlt-results/7255445584.55598-b201f5dd-180c-441d-ba26-4f4577aef984', every 30.000000s
13
- 2025-04-16 09:33:04,413:background_dirsync:INFO - Starting directory syncer from '/scratch/azureml/cr/j/331358e5f88644038530fe8c75097321/exe/wd/logs' to '/scratch/amlt_code/outputs/logs', every 30.000000s
 
1
+ 2025-04-16 09:37:37,326:amlt-code-runner:INFO - SINGULARITY_LOCATION: centralus
2
+ 2025-04-16 09:37:37,327:amlt-code-runner:INFO - AISC_INSTANCE_TYPE: Singularity.ND96_v4
3
+ 2025-04-16 09:37:40,416:amlt-code-runner:INFO - Not removing AzureML's cd commands from /etc/profile due to an error: [Errno 13] Permission denied: '/etc/profile'
4
+ 2025-04-16 09:37:40,416:amlt-code-runner:WARNING - Environment variable 'NCCL_SOCKET_IFNAME' already set to '=eth0', not changing to '^docker0,lo'
5
+ 2025-04-16 09:37:40,416:amlt-code-runner:INFO - RANK = 0
6
+ 2025-04-16 09:37:40,416:amlt-code-runner:INFO - LOCAL_RANK = None
7
+ 2025-04-16 09:37:40,416:amlt-code-runner:INFO - WORLD_SIZE = 1
8
+ 2025-04-16 09:37:40,416:amlt-code-runner:INFO - MASTER_ADDR = node-0
9
+ 2025-04-16 09:37:40,416:amlt-code-runner:INFO - MASTER_PORT = 9500
10
+ 2025-04-16 09:37:40,417:amlt-code-runner:WARNING - Installing amlt runtime dependencies: ['wrapt', 'azure-identity', 'python-dateutil', 'pytz'] into /tmp/amlt-user-base
11
+ 2025-04-16 09:37:41,988:amlt-code-runner:INFO - Executing ./amlt_setup.sh, ./amlt_run.sh
12
+ 2025-04-16 09:37:42,061:background_dirsync:INFO - Starting directory syncer from '/scratch/amlt_code/outputs' to '/mnt/output/projects/amlt_project/amlt-results/7255445584.54456-fd6f9646-57d9-4aae-9985-e79f5f6e9d15', every 30.000000s
13
+ 2025-04-16 09:37:42,064:background_dirsync:INFO - Starting directory syncer from '/scratch/azureml/cr/j/9e4869b65a3d443b986070748b2b821d/exe/wd/logs' to '/scratch/amlt_code/outputs/logs', every 30.000000s
model-00001-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2259c74b9a7b1bad34bc1ad3ec40536ca7c902be60162ff3470789f68240da04
3
  size 4943162336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1953ee06b177965fccca1a71b974e867224711366ae35e6248c8c5439ad633d8
3
  size 4943162336
model-00002-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:09e5381118a7908a2ceb814559013754ba3cb363eae0bbf54c8f6a4fd1d59d71
3
  size 4999819336
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6cd66ee6396a7edd5a67238f4fa5bcd5a1b4887b7359211a222dc924d058ffe8
3
  size 4999819336
model-00003-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ca52ca5af217e2a9f3834297a06b4cf7737d4ec766089fa407f36d4cd606b514
3
  size 4540516344
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:53c6b5a2d1e853edc08ce727925578fc83e60e4a397d6998b428ed2decb5ec16
3
  size 4540516344
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3d050e98d0393f4d55d424678d8906b1f313a1d137f6c384c290b4821ce0ffc9
3
  size 7736
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b8a0c91643e25dc49e84a98435c6cc575963630214c01b46eeb33dff405d0c4
3
  size 7736