|
Training 1/1 epoch (loss 2.2220): 0%| | 0/938 [00:06<?, ?it/s]
Training 1/1 epoch (loss 2.2220): 0%| | 1/938 [00:06<1:38:28, 6.31s/it]
Training 1/1 epoch (loss 2.1348): 0%| | 1/938 [00:09<1:38:28, 6.31s/it]
Training 1/1 epoch (loss 2.1348): 0%| | 2/938 [00:09<1:08:36, 4.40s/it]
Training 1/1 epoch (loss 2.1724): 0%| | 2/938 [00:09<1:08:36, 4.40s/it]
Training 1/1 epoch (loss 2.1724): 0%| | 3/938 [00:09<39:29, 2.53s/it]
Training 1/1 epoch (loss 2.1842): 0%| | 3/938 [00:10<39:29, 2.53s/it]
Training 1/1 epoch (loss 2.1842): 0%| | 4/938 [00:10<26:02, 1.67s/it]
Training 1/1 epoch (loss 2.1653): 0%| | 4/938 [00:10<26:02, 1.67s/it]
Training 1/1 epoch (loss 2.1653): 1%| | 5/938 [00:10<18:40, 1.20s/it]
Training 1/1 epoch (loss 2.1280): 1%| | 5/938 [00:10<18:40, 1.20s/it]
Training 1/1 epoch (loss 2.1280): 1%| | 6/938 [00:10<14:32, 1.07it/s]
Training 1/1 epoch (loss 2.2090): 1%| | 6/938 [00:11<14:32, 1.07it/s]
Training 1/1 epoch (loss 2.2090): 1%| | 7/938 [00:11<11:51, 1.31it/s]
Training 1/1 epoch (loss 2.1927): 1%| | 7/938 [00:11<11:51, 1.31it/s]
Training 1/1 epoch (loss 2.1927): 1%| | 8/938 [00:11<10:23, 1.49it/s]
Training 1/1 epoch (loss 1.9968): 1%| | 8/938 [00:12<10:23, 1.49it/s]
Training 1/1 epoch (loss 1.9968): 1%| | 9/938 [00:12<08:50, 1.75it/s]
Training 1/1 epoch (loss 2.0270): 1%| | 9/938 [00:12<08:50, 1.75it/s]
Training 1/1 epoch (loss 2.0270): 1%| | 10/938 [00:12<07:43, 2.00it/s]
Training 1/1 epoch (loss 2.0844): 1%| | 10/938 [00:12<07:43, 2.00it/s]
Training 1/1 epoch (loss 2.0844): 1%| | 11/938 [00:12<07:09, 2.16it/s]
Training 1/1 epoch (loss 2.1573): 1%| | 11/938 [00:13<07:09, 2.16it/s]
Training 1/1 epoch (loss 2.1573): 1%|β | 12/938 [00:13<07:22, 2.09it/s]
Training 1/1 epoch (loss 2.1605): 1%|β | 12/938 [00:13<07:22, 2.09it/s]
Training 1/1 epoch (loss 2.1605): 1%|β | 13/938 [00:13<07:23, 2.08it/s]
Training 1/1 epoch (loss 1.9138): 1%|β | 13/938 [00:14<07:23, 2.08it/s]
Training 1/1 epoch (loss 1.9138): 1%|β | 14/938 [00:14<06:37, 2.33it/s]
Training 1/1 epoch (loss 2.2113): 1%|β | 14/938 [00:14<06:37, 2.33it/s]
Training 1/1 epoch (loss 2.2113): 2%|β | 15/938 [00:14<06:34, 2.34it/s]
Training 1/1 epoch (loss 2.0852): 2%|β | 15/938 [00:15<06:34, 2.34it/s]
Training 1/1 epoch (loss 2.0852): 2%|β | 16/938 [00:15<07:01, 2.19it/s]
Training 1/1 epoch (loss 1.9559): 2%|β | 16/938 [00:15<07:01, 2.19it/s]
Training 1/1 epoch (loss 1.9559): 2%|β | 17/938 [00:15<06:50, 2.24it/s]
Training 1/1 epoch (loss 2.0714): 2%|β | 17/938 [00:15<06:50, 2.24it/s]
Training 1/1 epoch (loss 2.0714): 2%|β | 18/938 [00:15<06:31, 2.35it/s]
Training 1/1 epoch (loss 1.8474): 2%|β | 18/938 [00:16<06:31, 2.35it/s]
Training 1/1 epoch (loss 1.8474): 2%|β | 19/938 [00:16<06:11, 2.47it/s]
Training 1/1 epoch (loss 1.9205): 2%|β | 19/938 [00:16<06:11, 2.47it/s]
Training 1/1 epoch (loss 1.9205): 2%|β | 20/938 [00:16<06:08, 2.49it/s]
Training 1/1 epoch (loss 2.0388): 2%|β | 20/938 [00:17<06:08, 2.49it/s]
Training 1/1 epoch (loss 2.0388): 2%|β | 21/938 [00:17<06:40, 2.29it/s]
Training 1/1 epoch (loss 2.0611): 2%|β | 21/938 [00:17<06:40, 2.29it/s]
Training 1/1 epoch (loss 2.0611): 2%|β | 22/938 [00:17<06:34, 2.32it/s]
Training 1/1 epoch (loss 1.9040): 2%|β | 22/938 [00:17<06:34, 2.32it/s]
Training 1/1 epoch (loss 1.9040): 2%|β | 23/938 [00:17<06:20, 2.41it/s]
Training 1/1 epoch (loss 2.0298): 2%|β | 23/938 [00:18<06:20, 2.41it/s]
Training 1/1 epoch (loss 2.0298): 3%|β | 24/938 [00:18<06:08, 2.48it/s]
Training 1/1 epoch (loss 2.0659): 3%|β | 24/938 [00:18<06:08, 2.48it/s]
Training 1/1 epoch (loss 2.0659): 3%|β | 25/938 [00:18<06:20, 2.40it/s]
Training 1/1 epoch (loss 1.9792): 3%|β | 25/938 [00:19<06:20, 2.40it/s]
Training 1/1 epoch (loss 1.9792): 3%|β | 26/938 [00:19<06:32, 2.32it/s]
Training 1/1 epoch (loss 1.9643): 3%|β | 26/938 [00:19<06:32, 2.32it/s]
Training 1/1 epoch (loss 1.9643): 3%|β | 27/938 [00:19<06:22, 2.38it/s]
Training 1/1 epoch (loss 1.8965): 3%|β | 27/938 [00:19<06:22, 2.38it/s]
Training 1/1 epoch (loss 1.8965): 3%|β | 28/938 [00:19<06:16, 2.42it/s]
Training 1/1 epoch (loss 2.0121): 3%|β | 28/938 [00:20<06:16, 2.42it/s]
Training 1/1 epoch (loss 2.0121): 3%|β | 29/938 [00:20<06:06, 2.48it/s]
Training 1/1 epoch (loss 2.0208): 3%|β | 29/938 [00:20<06:06, 2.48it/s]
Training 1/1 epoch (loss 2.0208): 3%|β | 30/938 [00:20<06:02, 2.51it/s]
Training 1/1 epoch (loss 1.9586): 3%|β | 30/938 [00:21<06:02, 2.51it/s]
Training 1/1 epoch (loss 1.9586): 3%|β | 31/938 [00:21<06:17, 2.40it/s]
Training 1/1 epoch (loss 1.9993): 3%|β | 31/938 [00:21<06:17, 2.40it/s]
Training 1/1 epoch (loss 1.9993): 3%|β | 32/938 [00:21<06:15, 2.42it/s]
Training 1/1 epoch (loss 1.9812): 3%|β | 32/938 [00:21<06:15, 2.42it/s]
Training 1/1 epoch (loss 1.9812): 4%|β | 33/938 [00:21<06:01, 2.50it/s]
Training 1/1 epoch (loss 1.9201): 4%|β | 33/938 [00:22<06:01, 2.50it/s]
Training 1/1 epoch (loss 1.9201): 4%|β | 34/938 [00:22<05:51, 2.57it/s]
Training 1/1 epoch (loss 1.9739): 4%|β | 34/938 [00:22<05:51, 2.57it/s]
Training 1/1 epoch (loss 1.9739): 4%|β | 35/938 [00:22<05:42, 2.64it/s]
Training 1/1 epoch (loss 1.9911): 4%|β | 35/938 [00:23<05:42, 2.64it/s]
Training 1/1 epoch (loss 1.9911): 4%|β | 36/938 [00:23<05:51, 2.56it/s]
Training 1/1 epoch (loss 2.0473): 4%|β | 36/938 [00:23<05:51, 2.56it/s]
Training 1/1 epoch (loss 2.0473): 4%|β | 37/938 [00:23<06:24, 2.34it/s]
Training 1/1 epoch (loss 1.9885): 4%|β | 37/938 [00:24<06:24, 2.34it/s]
Training 1/1 epoch (loss 1.9885): 4%|β | 38/938 [00:24<06:17, 2.39it/s]
Training 1/1 epoch (loss 1.9089): 4%|β | 38/938 [00:24<06:17, 2.39it/s]
Training 1/1 epoch (loss 1.9089): 4%|β | 39/938 [00:24<06:00, 2.50it/s]
Training 1/1 epoch (loss 2.0994): 4%|β | 39/938 [00:25<06:00, 2.50it/s]
Training 1/1 epoch (loss 2.0994): 4%|β | 40/938 [00:25<07:00, 2.14it/s]
Training 1/1 epoch (loss 1.8468): 4%|β | 40/938 [00:25<07:00, 2.14it/s]
Training 1/1 epoch (loss 1.8468): 4%|β | 41/938 [00:25<06:38, 2.25it/s]
Training 1/1 epoch (loss 1.9319): 4%|β | 41/938 [00:25<06:38, 2.25it/s]
Training 1/1 epoch (loss 1.9319): 4%|β | 42/938 [00:25<06:29, 2.30it/s]
Training 1/1 epoch (loss 1.9209): 4%|β | 42/938 [00:26<06:29, 2.30it/s]
Training 1/1 epoch (loss 1.9209): 5%|β | 43/938 [00:26<06:03, 2.46it/s]
Training 1/1 epoch (loss 1.9627): 5%|β | 43/938 [00:26<06:03, 2.46it/s]
Training 1/1 epoch (loss 1.9627): 5%|β | 44/938 [00:26<05:58, 2.49it/s]
Training 1/1 epoch (loss 1.9554): 5%|β | 44/938 [00:26<05:58, 2.49it/s]
Training 1/1 epoch (loss 1.9554): 5%|β | 45/938 [00:26<05:44, 2.59it/s]
Training 1/1 epoch (loss 1.9003): 5%|β | 45/938 [00:27<05:44, 2.59it/s]
Training 1/1 epoch (loss 1.9003): 5%|β | 46/938 [00:27<06:23, 2.32it/s]
Training 1/1 epoch (loss 1.9916): 5%|β | 46/938 [00:27<06:23, 2.32it/s]
Training 1/1 epoch (loss 1.9916): 5%|β | 47/938 [00:27<05:56, 2.50it/s]
Training 1/1 epoch (loss 2.0183): 5%|β | 47/938 [00:28<05:56, 2.50it/s]
Training 1/1 epoch (loss 2.0183): 5%|β | 48/938 [00:28<05:49, 2.55it/s]
Training 1/1 epoch (loss 2.0447): 5%|β | 48/938 [00:28<05:49, 2.55it/s]
Training 1/1 epoch (loss 2.0447): 5%|β | 49/938 [00:28<05:41, 2.60it/s]
Training 1/1 epoch (loss 1.8472): 5%|β | 49/938 [00:28<05:41, 2.60it/s]
Training 1/1 epoch (loss 1.8472): 5%|β | 50/938 [00:28<05:38, 2.63it/s]
Training 1/1 epoch (loss 1.8300): 5%|β | 50/938 [00:29<05:38, 2.63it/s]
Training 1/1 epoch (loss 1.8300): 5%|β | 51/938 [00:29<05:50, 2.53it/s]
Training 1/1 epoch (loss 1.8982): 5%|β | 51/938 [00:29<05:50, 2.53it/s]
Training 1/1 epoch (loss 1.8982): 6%|β | 52/938 [00:29<06:04, 2.43it/s]
Training 1/1 epoch (loss 1.8491): 6%|β | 52/938 [00:30<06:04, 2.43it/s]
Training 1/1 epoch (loss 1.8491): 6%|β | 53/938 [00:30<06:07, 2.41it/s]
Training 1/1 epoch (loss 1.9811): 6%|β | 53/938 [00:30<06:07, 2.41it/s]
Training 1/1 epoch (loss 1.9811): 6%|β | 54/938 [00:30<06:17, 2.34it/s]
Training 1/1 epoch (loss 2.0053): 6%|β | 54/938 [00:30<06:17, 2.34it/s]
Training 1/1 epoch (loss 2.0053): 6%|β | 55/938 [00:30<05:56, 2.48it/s]
Training 1/1 epoch (loss 1.9305): 6%|β | 55/938 [00:31<05:56, 2.48it/s]
Training 1/1 epoch (loss 1.9305): 6%|β | 56/938 [00:31<06:01, 2.44it/s]
Training 1/1 epoch (loss 1.9169): 6%|β | 56/938 [00:31<06:01, 2.44it/s]
Training 1/1 epoch (loss 1.9169): 6%|β | 57/938 [00:31<06:00, 2.44it/s]
Training 1/1 epoch (loss 1.9080): 6%|β | 57/938 [00:32<06:00, 2.44it/s]
Training 1/1 epoch (loss 1.9080): 6%|β | 58/938 [00:32<05:51, 2.50it/s]
Training 1/1 epoch (loss 1.9597): 6%|β | 58/938 [00:32<05:51, 2.50it/s]
Training 1/1 epoch (loss 1.9597): 6%|β | 59/938 [00:32<05:34, 2.62it/s]
Training 1/1 epoch (loss 2.0467): 6%|β | 59/938 [00:32<05:34, 2.62it/s]
Training 1/1 epoch (loss 2.0467): 6%|β | 60/938 [00:32<05:23, 2.72it/s]
Training 1/1 epoch (loss 2.0173): 6%|β | 60/938 [00:33<05:23, 2.72it/s]
Training 1/1 epoch (loss 2.0173): 7%|β | 61/938 [00:33<05:24, 2.70it/s]
Training 1/1 epoch (loss 2.0024): 7%|β | 61/938 [00:33<05:24, 2.70it/s]
Training 1/1 epoch (loss 2.0024): 7%|β | 62/938 [00:33<05:46, 2.52it/s]
Training 1/1 epoch (loss 1.9954): 7%|β | 62/938 [00:34<05:46, 2.52it/s]
Training 1/1 epoch (loss 1.9954): 7%|β | 63/938 [00:34<05:38, 2.58it/s]
Training 1/1 epoch (loss 1.8226): 7%|β | 63/938 [00:34<05:38, 2.58it/s]
Training 1/1 epoch (loss 1.8226): 7%|β | 64/938 [00:34<05:42, 2.55it/s]
Training 1/1 epoch (loss 1.9375): 7%|β | 64/938 [00:34<05:42, 2.55it/s]
Training 1/1 epoch (loss 1.9375): 7%|β | 65/938 [00:34<06:02, 2.40it/s]
Training 1/1 epoch (loss 2.0100): 7%|β | 65/938 [00:35<06:02, 2.40it/s]
Training 1/1 epoch (loss 2.0100): 7%|β | 66/938 [00:35<06:13, 2.33it/s]
Training 1/1 epoch (loss 1.9682): 7%|β | 66/938 [00:35<06:13, 2.33it/s]
Training 1/1 epoch (loss 1.9682): 7%|β | 67/938 [00:35<06:10, 2.35it/s]
Training 1/1 epoch (loss 1.8981): 7%|β | 67/938 [00:36<06:10, 2.35it/s]
Training 1/1 epoch (loss 1.8981): 7%|β | 68/938 [00:36<05:50, 2.48it/s]
Training 1/1 epoch (loss 1.9856): 7%|β | 68/938 [00:36<05:50, 2.48it/s]
Training 1/1 epoch (loss 1.9856): 7%|β | 69/938 [00:36<05:33, 2.61it/s]
Training 1/1 epoch (loss 2.0554): 7%|β | 69/938 [00:36<05:33, 2.61it/s]
Training 1/1 epoch (loss 2.0554): 7%|β | 70/938 [00:36<05:23, 2.69it/s]
Training 1/1 epoch (loss 1.8820): 7%|β | 70/938 [00:37<05:23, 2.69it/s]
Training 1/1 epoch (loss 1.8820): 8%|β | 71/938 [00:37<05:33, 2.60it/s]
Training 1/1 epoch (loss 1.9553): 8%|β | 71/938 [00:37<05:33, 2.60it/s]
Training 1/1 epoch (loss 1.9553): 8%|β | 72/938 [00:37<05:49, 2.48it/s]
Training 1/1 epoch (loss 1.8976): 8%|β | 72/938 [00:38<05:49, 2.48it/s]
Training 1/1 epoch (loss 1.8976): 8%|β | 73/938 [00:38<05:29, 2.62it/s]
Training 1/1 epoch (loss 1.8380): 8%|β | 73/938 [00:38<05:29, 2.62it/s]
Training 1/1 epoch (loss 1.8380): 8%|β | 74/938 [00:38<05:33, 2.59it/s]
Training 1/1 epoch (loss 1.7591): 8%|β | 74/938 [00:38<05:33, 2.59it/s]
Training 1/1 epoch (loss 1.7591): 8%|β | 75/938 [00:38<05:21, 2.69it/s]
Training 1/1 epoch (loss 1.8664): 8%|β | 75/938 [00:39<05:21, 2.69it/s]
Training 1/1 epoch (loss 1.8664): 8%|β | 76/938 [00:39<05:20, 2.69it/s]
Training 1/1 epoch (loss 1.7734): 8%|β | 76/938 [00:39<05:20, 2.69it/s]
Training 1/1 epoch (loss 1.7734): 8%|β | 77/938 [00:39<05:36, 2.56it/s]
Training 1/1 epoch (loss 2.0039): 8%|β | 77/938 [00:39<05:36, 2.56it/s]
Training 1/1 epoch (loss 2.0039): 8%|β | 78/938 [00:39<05:35, 2.57it/s]
Training 1/1 epoch (loss 1.8925): 8%|β | 78/938 [00:40<05:35, 2.57it/s]
Training 1/1 epoch (loss 1.8925): 8%|β | 79/938 [00:40<05:18, 2.69it/s]
Training 1/1 epoch (loss 1.9892): 8%|β | 79/938 [00:40<05:18, 2.69it/s]
Training 1/1 epoch (loss 1.9892): 9%|β | 80/938 [00:40<05:18, 2.70it/s]
Training 1/1 epoch (loss 2.0106): 9%|β | 80/938 [00:41<05:18, 2.70it/s]
Training 1/1 epoch (loss 2.0106): 9%|β | 81/938 [00:41<05:24, 2.64it/s]
Training 1/1 epoch (loss 1.9003): 9%|β | 81/938 [00:41<05:24, 2.64it/s]
Training 1/1 epoch (loss 1.9003): 9%|β | 82/938 [00:41<05:33, 2.57it/s]
Training 1/1 epoch (loss 1.8614): 9%|β | 82/938 [00:41<05:33, 2.57it/s]
Training 1/1 epoch (loss 1.8614): 9%|β | 83/938 [00:41<05:36, 2.54it/s]
Training 1/1 epoch (loss 1.8251): 9%|β | 83/938 [00:42<05:36, 2.54it/s]
Training 1/1 epoch (loss 1.8251): 9%|β | 84/938 [00:42<05:22, 2.65it/s]
Training 1/1 epoch (loss 1.9154): 9%|β | 84/938 [00:42<05:22, 2.65it/s]
Training 1/1 epoch (loss 1.9154): 9%|β | 85/938 [00:42<05:18, 2.67it/s]
Training 1/1 epoch (loss 1.8866): 9%|β | 85/938 [00:42<05:18, 2.67it/s]
Training 1/1 epoch (loss 1.8866): 9%|β | 86/938 [00:42<05:19, 2.66it/s]
Training 1/1 epoch (loss 1.9024): 9%|β | 86/938 [00:43<05:19, 2.66it/s]
Training 1/1 epoch (loss 1.9024): 9%|β | 87/938 [00:43<05:49, 2.44it/s]
Training 1/1 epoch (loss 1.9140): 9%|β | 87/938 [00:43<05:49, 2.44it/s]
Training 1/1 epoch (loss 1.9140): 9%|β | 88/938 [00:43<05:57, 2.38it/s]
Training 1/1 epoch (loss 1.9120): 9%|β | 88/938 [00:44<05:57, 2.38it/s]
Training 1/1 epoch (loss 1.9120): 9%|β | 89/938 [00:44<05:53, 2.41it/s]
Training 1/1 epoch (loss 1.9215): 9%|β | 89/938 [00:44<05:53, 2.41it/s]
Training 1/1 epoch (loss 1.9215): 10%|β | 90/938 [00:44<05:39, 2.50it/s]
Training 1/1 epoch (loss 1.9926): 10%|β | 90/938 [00:45<05:39, 2.50it/s]
Training 1/1 epoch (loss 1.9926): 10%|β | 91/938 [00:45<06:01, 2.34it/s]
Training 1/1 epoch (loss 1.8381): 10%|β | 91/938 [00:45<06:01, 2.34it/s]
Training 1/1 epoch (loss 1.8381): 10%|β | 92/938 [00:45<05:58, 2.36it/s]
Training 1/1 epoch (loss 2.0584): 10%|β | 92/938 [00:46<05:58, 2.36it/s]
Training 1/1 epoch (loss 2.0584): 10%|β | 93/938 [00:46<06:12, 2.27it/s]
Training 1/1 epoch (loss 1.9441): 10%|β | 93/938 [00:46<06:12, 2.27it/s]
Training 1/1 epoch (loss 1.9441): 10%|β | 94/938 [00:46<05:52, 2.40it/s]
Training 1/1 epoch (loss 1.8392): 10%|β | 94/938 [00:46<05:52, 2.40it/s]
Training 1/1 epoch (loss 1.8392): 10%|β | 95/938 [00:46<05:29, 2.56it/s]
Training 1/1 epoch (loss 1.8216): 10%|β | 95/938 [00:47<05:29, 2.56it/s]
Training 1/1 epoch (loss 1.8216): 10%|β | 96/938 [00:47<05:47, 2.43it/s]
Training 1/1 epoch (loss 1.9710): 10%|β | 96/938 [00:47<05:47, 2.43it/s]
Training 1/1 epoch (loss 1.9710): 10%|β | 97/938 [00:47<06:02, 2.32it/s]
Training 1/1 epoch (loss 1.8447): 10%|β | 97/938 [00:48<06:02, 2.32it/s]
Training 1/1 epoch (loss 1.8447): 10%|β | 98/938 [00:48<05:51, 2.39it/s]
Training 1/1 epoch (loss 1.9581): 10%|β | 98/938 [00:48<05:51, 2.39it/s]
Training 1/1 epoch (loss 1.9581): 11%|β | 99/938 [00:48<05:29, 2.55it/s]
Training 1/1 epoch (loss 1.6821): 11%|β | 99/938 [00:48<05:29, 2.55it/s]
Training 1/1 epoch (loss 1.6821): 11%|β | 100/938 [00:48<05:09, 2.71it/s]
Training 1/1 epoch (loss 1.9115): 11%|β | 100/938 [00:49<05:09, 2.71it/s]
Training 1/1 epoch (loss 1.9115): 11%|β | 101/938 [00:49<05:06, 2.73it/s]
Training 1/1 epoch (loss 1.8876): 11%|β | 101/938 [00:49<05:06, 2.73it/s]
Training 1/1 epoch (loss 1.8876): 11%|β | 102/938 [00:49<04:57, 2.81it/s]
Training 1/1 epoch (loss 1.8352): 11%|β | 102/938 [00:49<04:57, 2.81it/s]
Training 1/1 epoch (loss 1.8352): 11%|β | 103/938 [00:49<05:04, 2.75it/s]
Training 1/1 epoch (loss 1.9042): 11%|β | 103/938 [00:50<05:04, 2.75it/s]
Training 1/1 epoch (loss 1.9042): 11%|β | 104/938 [00:50<05:04, 2.74it/s]
Training 1/1 epoch (loss 1.9621): 11%|β | 104/938 [00:50<05:04, 2.74it/s]
Training 1/1 epoch (loss 1.9621): 11%|β | 105/938 [00:50<05:33, 2.50it/s]
Training 1/1 epoch (loss 1.8808): 11%|β | 105/938 [00:50<05:33, 2.50it/s]
Training 1/1 epoch (loss 1.8808): 11%|ββ | 106/938 [00:50<05:14, 2.64it/s]
Training 1/1 epoch (loss 1.9377): 11%|ββ | 106/938 [00:51<05:14, 2.64it/s]
Training 1/1 epoch (loss 1.9377): 11%|ββ | 107/938 [00:51<05:22, 2.58it/s]
Training 1/1 epoch (loss 2.1005): 11%|ββ | 107/938 [00:51<05:22, 2.58it/s]
Training 1/1 epoch (loss 2.1005): 12%|ββ | 108/938 [00:51<05:40, 2.44it/s]
Training 1/1 epoch (loss 2.0435): 12%|ββ | 108/938 [00:52<05:40, 2.44it/s]
Training 1/1 epoch (loss 2.0435): 12%|ββ | 109/938 [00:52<05:17, 2.61it/s]
Training 1/1 epoch (loss 2.0695): 12%|ββ | 109/938 [00:52<05:17, 2.61it/s]
Training 1/1 epoch (loss 2.0695): 12%|ββ | 110/938 [00:52<05:04, 2.72it/s]
Training 1/1 epoch (loss 1.7382): 12%|ββ | 110/938 [00:52<05:04, 2.72it/s]
Training 1/1 epoch (loss 1.7382): 12%|ββ | 111/938 [00:52<05:09, 2.67it/s]
Training 1/1 epoch (loss 2.0131): 12%|ββ | 111/938 [00:53<05:09, 2.67it/s]
Training 1/1 epoch (loss 2.0131): 12%|ββ | 112/938 [00:53<05:14, 2.63it/s]
Training 1/1 epoch (loss 1.8224): 12%|ββ | 112/938 [00:53<05:14, 2.63it/s]
Training 1/1 epoch (loss 1.8224): 12%|ββ | 113/938 [00:53<05:14, 2.62it/s]
Training 1/1 epoch (loss 1.9035): 12%|ββ | 113/938 [00:54<05:14, 2.62it/s]
Training 1/1 epoch (loss 1.9035): 12%|ββ | 114/938 [00:54<05:40, 2.42it/s]
Training 1/1 epoch (loss 1.8465): 12%|ββ | 114/938 [00:54<05:40, 2.42it/s]
Training 1/1 epoch (loss 1.8465): 12%|ββ | 115/938 [00:54<05:33, 2.47it/s]
Training 1/1 epoch (loss 1.9724): 12%|ββ | 115/938 [00:55<05:33, 2.47it/s]
Training 1/1 epoch (loss 1.9724): 12%|ββ | 116/938 [00:55<05:57, 2.30it/s]
Training 1/1 epoch (loss 1.8001): 12%|ββ | 116/938 [00:55<05:57, 2.30it/s]
Training 1/1 epoch (loss 1.8001): 12%|ββ | 117/938 [00:55<05:47, 2.36it/s]
Training 1/1 epoch (loss 1.9260): 12%|ββ | 117/938 [00:55<05:47, 2.36it/s]
Training 1/1 epoch (loss 1.9260): 13%|ββ | 118/938 [00:55<05:52, 2.33it/s]
Training 1/1 epoch (loss 1.9705): 13%|ββ | 118/938 [00:56<05:52, 2.33it/s]
Training 1/1 epoch (loss 1.9705): 13%|ββ | 119/938 [00:56<05:26, 2.51it/s]
Training 1/1 epoch (loss 1.8801): 13%|ββ | 119/938 [00:56<05:26, 2.51it/s]
Training 1/1 epoch (loss 1.8801): 13%|ββ | 120/938 [00:56<05:26, 2.50it/s]
Training 1/1 epoch (loss 1.9432): 13%|ββ | 120/938 [00:56<05:26, 2.50it/s]
Training 1/1 epoch (loss 1.9432): 13%|ββ | 121/938 [00:56<05:10, 2.63it/s]
Training 1/1 epoch (loss 1.9196): 13%|ββ | 121/938 [00:57<05:10, 2.63it/s]
Training 1/1 epoch (loss 1.9196): 13%|ββ | 122/938 [00:57<05:13, 2.60it/s]
Training 1/1 epoch (loss 1.8550): 13%|ββ | 122/938 [00:57<05:13, 2.60it/s]
Training 1/1 epoch (loss 1.8550): 13%|ββ | 123/938 [00:57<05:12, 2.61it/s]
Training 1/1 epoch (loss 1.9007): 13%|ββ | 123/938 [00:58<05:12, 2.61it/s]
Training 1/1 epoch (loss 1.9007): 13%|ββ | 124/938 [00:58<05:07, 2.64it/s]
Training 1/1 epoch (loss 1.9370): 13%|ββ | 124/938 [00:58<05:07, 2.64it/s]
Training 1/1 epoch (loss 1.9370): 13%|ββ | 125/938 [00:58<05:02, 2.69it/s]
Training 1/1 epoch (loss 1.9887): 13%|ββ | 125/938 [00:58<05:02, 2.69it/s]
Training 1/1 epoch (loss 1.9887): 13%|ββ | 126/938 [00:58<05:00, 2.71it/s]
Training 1/1 epoch (loss 1.7783): 13%|ββ | 126/938 [00:59<05:00, 2.71it/s]
Training 1/1 epoch (loss 1.7783): 14%|ββ | 127/938 [00:59<04:54, 2.75it/s]
Training 1/1 epoch (loss 1.8991): 14%|ββ | 127/938 [00:59<04:54, 2.75it/s]
Training 1/1 epoch (loss 1.8991): 14%|ββ | 128/938 [00:59<05:09, 2.62it/s]
Training 1/1 epoch (loss 1.9085): 14%|ββ | 128/938 [01:00<05:09, 2.62it/s]
Training 1/1 epoch (loss 1.9085): 14%|ββ | 129/938 [01:00<05:51, 2.30it/s]
Training 1/1 epoch (loss 1.8114): 14%|ββ | 129/938 [01:00<05:51, 2.30it/s]
Training 1/1 epoch (loss 1.8114): 14%|ββ | 130/938 [01:00<05:33, 2.42it/s]
Training 1/1 epoch (loss 1.8533): 14%|ββ | 130/938 [01:00<05:33, 2.42it/s]
Training 1/1 epoch (loss 1.8533): 14%|ββ | 131/938 [01:00<05:47, 2.32it/s]
Training 1/1 epoch (loss 1.7540): 14%|ββ | 131/938 [01:01<05:47, 2.32it/s]
Training 1/1 epoch (loss 1.7540): 14%|ββ | 132/938 [01:01<05:44, 2.34it/s]
Training 1/1 epoch (loss 1.9154): 14%|ββ | 132/938 [01:01<05:44, 2.34it/s]
Training 1/1 epoch (loss 1.9154): 14%|ββ | 133/938 [01:01<05:18, 2.53it/s]
Training 1/1 epoch (loss 2.0708): 14%|ββ | 133/938 [01:02<05:18, 2.53it/s]
Training 1/1 epoch (loss 2.0708): 14%|ββ | 134/938 [01:02<05:13, 2.57it/s]
Training 1/1 epoch (loss 1.7834): 14%|ββ | 134/938 [01:02<05:13, 2.57it/s]
Training 1/1 epoch (loss 1.7834): 14%|ββ | 135/938 [01:02<05:08, 2.60it/s]
Training 1/1 epoch (loss 1.8876): 14%|ββ | 135/938 [01:02<05:08, 2.60it/s]
Training 1/1 epoch (loss 1.8876): 14%|ββ | 136/938 [01:02<04:51, 2.75it/s]
Training 1/1 epoch (loss 1.9396): 14%|ββ | 136/938 [01:03<04:51, 2.75it/s]
Training 1/1 epoch (loss 1.9396): 15%|ββ | 137/938 [01:03<04:55, 2.72it/s]
Training 1/1 epoch (loss 1.9331): 15%|ββ | 137/938 [01:03<04:55, 2.72it/s]
Training 1/1 epoch (loss 1.9331): 15%|ββ | 138/938 [01:03<04:44, 2.81it/s]
Training 1/1 epoch (loss 1.8867): 15%|ββ | 138/938 [01:03<04:44, 2.81it/s]
Training 1/1 epoch (loss 1.8867): 15%|ββ | 139/938 [01:03<04:40, 2.85it/s]
Training 1/1 epoch (loss 1.9144): 15%|ββ | 139/938 [01:04<04:40, 2.85it/s]
Training 1/1 epoch (loss 1.9144): 15%|ββ | 140/938 [01:04<04:33, 2.91it/s]
Training 1/1 epoch (loss 1.9752): 15%|ββ | 140/938 [01:04<04:33, 2.91it/s]
Training 1/1 epoch (loss 1.9752): 15%|ββ | 141/938 [01:04<05:13, 2.54it/s]
Training 1/1 epoch (loss 1.9065): 15%|ββ | 141/938 [01:05<05:13, 2.54it/s]
Training 1/1 epoch (loss 1.9065): 15%|ββ | 142/938 [01:05<05:33, 2.39it/s]
Training 1/1 epoch (loss 1.8340): 15%|ββ | 142/938 [01:05<05:33, 2.39it/s]
Training 1/1 epoch (loss 1.8340): 15%|ββ | 143/938 [01:05<05:14, 2.53it/s]
Training 1/1 epoch (loss 1.9820): 15%|ββ | 143/938 [01:05<05:14, 2.53it/s]
Training 1/1 epoch (loss 1.9820): 15%|ββ | 144/938 [01:05<05:14, 2.52it/s]
Training 1/1 epoch (loss 1.8798): 15%|ββ | 144/938 [01:06<05:14, 2.52it/s]
Training 1/1 epoch (loss 1.8798): 15%|ββ | 145/938 [01:06<05:01, 2.63it/s]
Training 1/1 epoch (loss 1.8653): 15%|ββ | 145/938 [01:06<05:01, 2.63it/s]
Training 1/1 epoch (loss 1.8653): 16%|ββ | 146/938 [01:06<04:55, 2.68it/s]
Training 1/1 epoch (loss 1.6954): 16%|ββ | 146/938 [01:07<04:55, 2.68it/s]
Training 1/1 epoch (loss 1.6954): 16%|ββ | 147/938 [01:07<06:22, 2.07it/s]
Training 1/1 epoch (loss 2.0651): 16%|ββ | 147/938 [01:07<06:22, 2.07it/s]
Training 1/1 epoch (loss 2.0651): 16%|ββ | 148/938 [01:07<05:50, 2.25it/s]
Training 1/1 epoch (loss 1.9874): 16%|ββ | 148/938 [01:08<05:50, 2.25it/s]
Training 1/1 epoch (loss 1.9874): 16%|ββ | 149/938 [01:08<06:18, 2.08it/s]
Training 1/1 epoch (loss 1.7731): 16%|ββ | 149/938 [01:08<06:18, 2.08it/s]
Training 1/1 epoch (loss 1.7731): 16%|ββ | 150/938 [01:08<05:51, 2.24it/s]
Training 1/1 epoch (loss 1.9317): 16%|ββ | 150/938 [01:08<05:51, 2.24it/s]
Training 1/1 epoch (loss 1.9317): 16%|ββ | 151/938 [01:08<05:20, 2.46it/s]
Training 1/1 epoch (loss 1.8224): 16%|ββ | 151/938 [01:09<05:20, 2.46it/s]
Training 1/1 epoch (loss 1.8224): 16%|ββ | 152/938 [01:09<05:13, 2.51it/s]
Training 1/1 epoch (loss 1.8990): 16%|ββ | 152/938 [01:09<05:13, 2.51it/s]
Training 1/1 epoch (loss 1.8990): 16%|ββ | 153/938 [01:09<05:00, 2.61it/s]
Training 1/1 epoch (loss 1.7595): 16%|ββ | 153/938 [01:10<05:00, 2.61it/s]
Training 1/1 epoch (loss 1.7595): 16%|ββ | 154/938 [01:10<05:01, 2.60it/s]
Training 1/1 epoch (loss 1.9885): 16%|ββ | 154/938 [01:10<05:01, 2.60it/s]
Training 1/1 epoch (loss 1.9885): 17%|ββ | 155/938 [01:10<04:40, 2.79it/s]
Training 1/1 epoch (loss 1.8749): 17%|ββ | 155/938 [01:10<04:40, 2.79it/s]
Training 1/1 epoch (loss 1.8749): 17%|ββ | 156/938 [01:10<04:46, 2.73it/s]
Training 1/1 epoch (loss 1.9154): 17%|ββ | 156/938 [01:11<04:46, 2.73it/s]
Training 1/1 epoch (loss 1.9154): 17%|ββ | 157/938 [01:11<05:40, 2.29it/s]
Training 1/1 epoch (loss 1.8001): 17%|ββ | 157/938 [01:11<05:40, 2.29it/s]
Training 1/1 epoch (loss 1.8001): 17%|ββ | 158/938 [01:11<06:07, 2.12it/s]
Training 1/1 epoch (loss 1.9466): 17%|ββ | 158/938 [01:12<06:07, 2.12it/s]
Training 1/1 epoch (loss 1.9466): 17%|ββ | 159/938 [01:12<06:04, 2.14it/s]
Training 1/1 epoch (loss 1.9001): 17%|ββ | 159/938 [01:12<06:04, 2.14it/s]
Training 1/1 epoch (loss 1.9001): 17%|ββ | 160/938 [01:12<06:10, 2.10it/s]
Training 1/1 epoch (loss 1.7853): 17%|ββ | 160/938 [01:13<06:10, 2.10it/s]
Training 1/1 epoch (loss 1.7853): 17%|ββ | 161/938 [01:13<06:07, 2.11it/s]
Training 1/1 epoch (loss 1.9007): 17%|ββ | 161/938 [01:13<06:07, 2.11it/s]
Training 1/1 epoch (loss 1.9007): 17%|ββ | 162/938 [01:13<06:23, 2.03it/s]
Training 1/1 epoch (loss 1.9421): 17%|ββ | 162/938 [01:14<06:23, 2.03it/s]
Training 1/1 epoch (loss 1.9421): 17%|ββ | 163/938 [01:14<06:09, 2.10it/s]
Training 1/1 epoch (loss 1.8834): 17%|ββ | 163/938 [01:14<06:09, 2.10it/s]
Training 1/1 epoch (loss 1.8834): 17%|ββ | 164/938 [01:14<05:41, 2.26it/s]
Training 1/1 epoch (loss 1.8991): 17%|ββ | 164/938 [01:15<05:41, 2.26it/s]
Training 1/1 epoch (loss 1.8991): 18%|ββ | 165/938 [01:15<05:51, 2.20it/s]
Training 1/1 epoch (loss 1.8145): 18%|ββ | 165/938 [01:15<05:51, 2.20it/s]
Training 1/1 epoch (loss 1.8145): 18%|ββ | 166/938 [01:15<05:45, 2.23it/s]
Training 1/1 epoch (loss 1.9202): 18%|ββ | 166/938 [01:15<05:45, 2.23it/s]
Training 1/1 epoch (loss 1.9202): 18%|ββ | 167/938 [01:15<05:44, 2.24it/s]
Training 1/1 epoch (loss 1.7263): 18%|ββ | 167/938 [01:16<05:44, 2.24it/s]
Training 1/1 epoch (loss 1.7263): 18%|ββ | 168/938 [01:16<05:28, 2.35it/s]
Training 1/1 epoch (loss 2.0150): 18%|ββ | 168/938 [01:16<05:28, 2.35it/s]
Training 1/1 epoch (loss 2.0150): 18%|ββ | 169/938 [01:16<05:17, 2.42it/s]
Training 1/1 epoch (loss 1.8856): 18%|ββ | 169/938 [01:17<05:17, 2.42it/s]
Training 1/1 epoch (loss 1.8856): 18%|ββ | 170/938 [01:17<05:37, 2.27it/s]
Training 1/1 epoch (loss 1.8593): 18%|ββ | 170/938 [01:17<05:37, 2.27it/s]
Training 1/1 epoch (loss 1.8593): 18%|ββ | 171/938 [01:17<05:27, 2.34it/s]
Training 1/1 epoch (loss 1.7799): 18%|ββ | 171/938 [01:18<05:27, 2.34it/s]
Training 1/1 epoch (loss 1.7799): 18%|ββ | 172/938 [01:18<05:25, 2.36it/s]
Training 1/1 epoch (loss 1.9062): 18%|ββ | 172/938 [01:18<05:25, 2.36it/s]
Training 1/1 epoch (loss 1.9062): 18%|ββ | 173/938 [01:18<05:06, 2.49it/s]
Training 1/1 epoch (loss 2.0347): 18%|ββ | 173/938 [01:18<05:06, 2.49it/s]
Training 1/1 epoch (loss 2.0347): 19%|ββ | 174/938 [01:18<04:57, 2.57it/s]
Training 1/1 epoch (loss 1.9494): 19%|ββ | 174/938 [01:19<04:57, 2.57it/s]
Training 1/1 epoch (loss 1.9494): 19%|ββ | 175/938 [01:19<04:49, 2.64it/s]
Training 1/1 epoch (loss 1.6884): 19%|ββ | 175/938 [01:19<04:49, 2.64it/s]
Training 1/1 epoch (loss 1.6884): 19%|ββ | 176/938 [01:19<04:52, 2.60it/s]
Training 1/1 epoch (loss 1.8310): 19%|ββ | 176/938 [01:19<04:52, 2.60it/s]
Training 1/1 epoch (loss 1.8310): 19%|ββ | 177/938 [01:19<04:46, 2.66it/s]
Training 1/1 epoch (loss 1.8946): 19%|ββ | 177/938 [01:20<04:46, 2.66it/s]
Training 1/1 epoch (loss 1.8946): 19%|ββ | 178/938 [01:20<04:46, 2.65it/s]
Training 1/1 epoch (loss 1.9684): 19%|ββ | 178/938 [01:20<04:46, 2.65it/s]
Training 1/1 epoch (loss 1.9684): 19%|ββ | 179/938 [01:20<04:42, 2.69it/s]
Training 1/1 epoch (loss 1.8251): 19%|ββ | 179/938 [01:20<04:42, 2.69it/s]
Training 1/1 epoch (loss 1.8251): 19%|ββ | 180/938 [01:20<04:43, 2.68it/s]
Training 1/1 epoch (loss 1.8193): 19%|ββ | 180/938 [01:21<04:43, 2.68it/s]
Training 1/1 epoch (loss 1.8193): 19%|ββ | 181/938 [01:21<05:01, 2.51it/s]
Training 1/1 epoch (loss 1.7255): 19%|ββ | 181/938 [01:21<05:01, 2.51it/s]
Training 1/1 epoch (loss 1.7255): 19%|ββ | 182/938 [01:21<05:01, 2.50it/s]
Training 1/1 epoch (loss 1.9876): 19%|ββ | 182/938 [01:22<05:01, 2.50it/s]
Training 1/1 epoch (loss 1.9876): 20%|ββ | 183/938 [01:22<05:01, 2.50it/s]
Training 1/1 epoch (loss 1.8105): 20%|ββ | 183/938 [01:22<05:01, 2.50it/s]
Training 1/1 epoch (loss 1.8105): 20%|ββ | 184/938 [01:22<04:59, 2.52it/s]
Training 1/1 epoch (loss 1.9081): 20%|ββ | 184/938 [01:23<04:59, 2.52it/s]
Training 1/1 epoch (loss 1.9081): 20%|ββ | 185/938 [01:23<04:53, 2.57it/s]
Training 1/1 epoch (loss 1.8328): 20%|ββ | 185/938 [01:23<04:53, 2.57it/s]
Training 1/1 epoch (loss 1.8328): 20%|ββ | 186/938 [01:23<05:16, 2.38it/s]
Training 1/1 epoch (loss 1.6989): 20%|ββ | 186/938 [01:23<05:16, 2.38it/s]
Training 1/1 epoch (loss 1.6989): 20%|ββ | 187/938 [01:23<05:24, 2.32it/s]
Training 1/1 epoch (loss 1.8318): 20%|ββ | 187/938 [01:24<05:24, 2.32it/s]
Training 1/1 epoch (loss 1.8318): 20%|ββ | 188/938 [01:24<05:07, 2.44it/s]
Training 1/1 epoch (loss 1.8736): 20%|ββ | 188/938 [01:24<05:07, 2.44it/s]
Training 1/1 epoch (loss 1.8736): 20%|ββ | 189/938 [01:24<05:10, 2.41it/s]
Training 1/1 epoch (loss 2.1595): 20%|ββ | 189/938 [01:25<05:10, 2.41it/s]
Training 1/1 epoch (loss 2.1595): 20%|ββ | 190/938 [01:25<05:25, 2.30it/s]
Training 1/1 epoch (loss 1.8147): 20%|ββ | 190/938 [01:25<05:25, 2.30it/s]
Training 1/1 epoch (loss 1.8147): 20%|ββ | 191/938 [01:25<05:29, 2.26it/s]
Training 1/1 epoch (loss 1.8938): 20%|ββ | 191/938 [01:26<05:29, 2.26it/s]
Training 1/1 epoch (loss 1.8938): 20%|ββ | 192/938 [01:26<05:23, 2.31it/s]
Training 1/1 epoch (loss 1.8868): 20%|ββ | 192/938 [01:26<05:23, 2.31it/s]
Training 1/1 epoch (loss 1.8868): 21%|ββ | 193/938 [01:26<05:04, 2.44it/s]
Training 1/1 epoch (loss 1.8710): 21%|ββ | 193/938 [01:26<05:04, 2.44it/s]
Training 1/1 epoch (loss 1.8710): 21%|ββ | 194/938 [01:26<05:00, 2.48it/s]
Training 1/1 epoch (loss 1.8852): 21%|ββ | 194/938 [01:27<05:00, 2.48it/s]
Training 1/1 epoch (loss 1.8852): 21%|ββ | 195/938 [01:27<05:05, 2.43it/s]
Training 1/1 epoch (loss 1.8484): 21%|ββ | 195/938 [01:27<05:05, 2.43it/s]
Training 1/1 epoch (loss 1.8484): 21%|ββ | 196/938 [01:27<05:06, 2.42it/s]
Training 1/1 epoch (loss 1.9611): 21%|ββ | 196/938 [01:28<05:06, 2.42it/s]
Training 1/1 epoch (loss 1.9611): 21%|ββ | 197/938 [01:28<04:59, 2.47it/s]
Training 1/1 epoch (loss 1.7724): 21%|ββ | 197/938 [01:28<04:59, 2.47it/s]
Training 1/1 epoch (loss 1.7724): 21%|ββ | 198/938 [01:28<04:43, 2.61it/s]
Training 1/1 epoch (loss 1.8703): 21%|ββ | 198/938 [01:28<04:43, 2.61it/s]
Training 1/1 epoch (loss 1.8703): 21%|ββ | 199/938 [01:28<04:35, 2.68it/s]
Training 1/1 epoch (loss 1.8068): 21%|ββ | 199/938 [01:29<04:35, 2.68it/s]
Training 1/1 epoch (loss 1.8068): 21%|βββ | 200/938 [01:29<04:46, 2.57it/s]
Training 1/1 epoch (loss 1.8575): 21%|βββ | 200/938 [01:29<04:46, 2.57it/s]
Training 1/1 epoch (loss 1.8575): 21%|βββ | 201/938 [01:29<04:50, 2.53it/s]
Training 1/1 epoch (loss 1.9390): 21%|βββ | 201/938 [01:30<04:50, 2.53it/s]
Training 1/1 epoch (loss 1.9390): 22%|βββ | 202/938 [01:30<05:13, 2.34it/s]
Training 1/1 epoch (loss 1.9038): 22%|βββ | 202/938 [01:30<05:13, 2.34it/s]
Training 1/1 epoch (loss 1.9038): 22%|βββ | 203/938 [01:30<04:51, 2.52it/s]
Training 1/1 epoch (loss 1.8200): 22%|βββ | 203/938 [01:30<04:51, 2.52it/s]
Training 1/1 epoch (loss 1.8200): 22%|βββ | 204/938 [01:30<04:37, 2.64it/s]
Training 1/1 epoch (loss 1.8145): 22%|βββ | 204/938 [01:31<04:37, 2.64it/s]
Training 1/1 epoch (loss 1.8145): 22%|βββ | 205/938 [01:31<04:39, 2.62it/s]
Training 1/1 epoch (loss 1.9717): 22%|βββ | 205/938 [01:31<04:39, 2.62it/s]
Training 1/1 epoch (loss 1.9717): 22%|βββ | 206/938 [01:31<04:41, 2.60it/s]
Training 1/1 epoch (loss 1.9190): 22%|βββ | 206/938 [01:31<04:41, 2.60it/s]
Training 1/1 epoch (loss 1.9190): 22%|βββ | 207/938 [01:31<04:40, 2.60it/s]
Training 1/1 epoch (loss 1.9296): 22%|βββ | 207/938 [01:32<04:40, 2.60it/s]
Training 1/1 epoch (loss 1.9296): 22%|βββ | 208/938 [01:32<04:42, 2.58it/s]
Training 1/1 epoch (loss 1.8120): 22%|βββ | 208/938 [01:32<04:42, 2.58it/s]
Training 1/1 epoch (loss 1.8120): 22%|βββ | 209/938 [01:32<04:38, 2.62it/s]
Training 1/1 epoch (loss 1.8017): 22%|βββ | 209/938 [01:33<04:38, 2.62it/s]
Training 1/1 epoch (loss 1.8017): 22%|βββ | 210/938 [01:33<04:33, 2.66it/s]
Training 1/1 epoch (loss 1.9044): 22%|βββ | 210/938 [01:33<04:33, 2.66it/s]
Training 1/1 epoch (loss 1.9044): 22%|βββ | 211/938 [01:33<04:33, 2.66it/s]
Training 1/1 epoch (loss 1.7307): 22%|βββ | 211/938 [01:33<04:33, 2.66it/s]
Training 1/1 epoch (loss 1.7307): 23%|βββ | 212/938 [01:33<04:44, 2.55it/s]
Training 1/1 epoch (loss 1.7741): 23%|βββ | 212/938 [01:34<04:44, 2.55it/s]
Training 1/1 epoch (loss 1.7741): 23%|βββ | 213/938 [01:34<04:36, 2.62it/s]
Training 1/1 epoch (loss 1.8641): 23%|βββ | 213/938 [01:34<04:36, 2.62it/s]
Training 1/1 epoch (loss 1.8641): 23%|βββ | 214/938 [01:34<04:43, 2.56it/s]
Training 1/1 epoch (loss 1.8841): 23%|βββ | 214/938 [01:35<04:43, 2.56it/s]
Training 1/1 epoch (loss 1.8841): 23%|βββ | 215/938 [01:35<05:06, 2.36it/s]
Training 1/1 epoch (loss 1.7418): 23%|βββ | 215/938 [01:35<05:06, 2.36it/s]
Training 1/1 epoch (loss 1.7418): 23%|βββ | 216/938 [01:35<05:21, 2.25it/s]
Training 1/1 epoch (loss 1.9979): 23%|βββ | 216/938 [01:36<05:21, 2.25it/s]
Training 1/1 epoch (loss 1.9979): 23%|βββ | 217/938 [01:36<05:15, 2.29it/s]
Training 1/1 epoch (loss 1.8798): 23%|βββ | 217/938 [01:36<05:15, 2.29it/s]
Training 1/1 epoch (loss 1.8798): 23%|βββ | 218/938 [01:36<04:53, 2.45it/s]
Training 1/1 epoch (loss 1.8203): 23%|βββ | 218/938 [01:36<04:53, 2.45it/s]
Training 1/1 epoch (loss 1.8203): 23%|βββ | 219/938 [01:36<04:42, 2.55it/s]
Training 1/1 epoch (loss 1.7585): 23%|βββ | 219/938 [01:37<04:42, 2.55it/s]
Training 1/1 epoch (loss 1.7585): 23%|βββ | 220/938 [01:37<04:21, 2.74it/s]
Training 1/1 epoch (loss 1.9284): 23%|βββ | 220/938 [01:37<04:21, 2.74it/s]
Training 1/1 epoch (loss 1.9284): 24%|βββ | 221/938 [01:37<04:36, 2.59it/s]
Training 1/1 epoch (loss 1.7878): 24%|βββ | 221/938 [01:37<04:36, 2.59it/s]
Training 1/1 epoch (loss 1.7878): 24%|βββ | 222/938 [01:37<05:02, 2.37it/s]
Training 1/1 epoch (loss 1.9698): 24%|βββ | 222/938 [01:38<05:02, 2.37it/s]
Training 1/1 epoch (loss 1.9698): 24%|βββ | 223/938 [01:38<04:53, 2.44it/s]
Training 1/1 epoch (loss 1.9037): 24%|βββ | 223/938 [01:38<04:53, 2.44it/s]
Training 1/1 epoch (loss 1.9037): 24%|βββ | 224/938 [01:38<04:46, 2.49it/s]
Training 1/1 epoch (loss 1.8851): 24%|βββ | 224/938 [01:39<04:46, 2.49it/s]
Training 1/1 epoch (loss 1.8851): 24%|βββ | 225/938 [01:39<04:40, 2.54it/s]
Training 1/1 epoch (loss 1.9441): 24%|βββ | 225/938 [01:39<04:40, 2.54it/s]
Training 1/1 epoch (loss 1.9441): 24%|βββ | 226/938 [01:39<05:00, 2.37it/s]
Training 1/1 epoch (loss 1.8779): 24%|βββ | 226/938 [01:40<05:00, 2.37it/s]
Training 1/1 epoch (loss 1.8779): 24%|βββ | 227/938 [01:40<05:15, 2.25it/s]
Training 1/1 epoch (loss 1.8950): 24%|βββ | 227/938 [01:40<05:15, 2.25it/s]
Training 1/1 epoch (loss 1.8950): 24%|βββ | 228/938 [01:40<04:57, 2.38it/s]
Training 1/1 epoch (loss 1.8933): 24%|βββ | 228/938 [01:40<04:57, 2.38it/s]
Training 1/1 epoch (loss 1.8933): 24%|βββ | 229/938 [01:40<04:51, 2.43it/s]
Training 1/1 epoch (loss 1.9389): 24%|βββ | 229/938 [01:41<04:51, 2.43it/s]
Training 1/1 epoch (loss 1.9389): 25%|βββ | 230/938 [01:41<05:02, 2.34it/s]
Training 1/1 epoch (loss 1.8613): 25%|βββ | 230/938 [01:41<05:02, 2.34it/s]
Training 1/1 epoch (loss 1.8613): 25%|βββ | 231/938 [01:41<05:21, 2.20it/s]
Training 1/1 epoch (loss 1.8758): 25%|βββ | 231/938 [01:42<05:21, 2.20it/s]
Training 1/1 epoch (loss 1.8758): 25%|βββ | 232/938 [01:42<05:26, 2.16it/s]
Training 1/1 epoch (loss 1.8951): 25%|βββ | 232/938 [01:42<05:26, 2.16it/s]
Training 1/1 epoch (loss 1.8951): 25%|βββ | 233/938 [01:42<05:16, 2.23it/s]
Training 1/1 epoch (loss 1.8631): 25%|βββ | 233/938 [01:43<05:16, 2.23it/s]
Training 1/1 epoch (loss 1.8631): 25%|βββ | 234/938 [01:43<05:21, 2.19it/s]
Training 1/1 epoch (loss 1.8404): 25%|βββ | 234/938 [01:43<05:21, 2.19it/s]
Training 1/1 epoch (loss 1.8404): 25%|βββ | 235/938 [01:43<05:29, 2.13it/s]
Training 1/1 epoch (loss 1.7809): 25%|βββ | 235/938 [01:44<05:29, 2.13it/s]
Training 1/1 epoch (loss 1.7809): 25%|βββ | 236/938 [01:44<05:21, 2.18it/s]
Training 1/1 epoch (loss 1.9851): 25%|βββ | 236/938 [01:44<05:21, 2.18it/s]
Training 1/1 epoch (loss 1.9851): 25%|βββ | 237/938 [01:44<05:06, 2.29it/s]
Training 1/1 epoch (loss 1.7303): 25%|βββ | 237/938 [01:44<05:06, 2.29it/s]
Training 1/1 epoch (loss 1.7303): 25%|βββ | 238/938 [01:44<05:12, 2.24it/s]
Training 1/1 epoch (loss 1.7647): 25%|βββ | 238/938 [01:45<05:12, 2.24it/s]
Training 1/1 epoch (loss 1.7647): 25%|βββ | 239/938 [01:45<05:12, 2.24it/s]
Training 1/1 epoch (loss 1.8432): 25%|βββ | 239/938 [01:46<05:12, 2.24it/s]
Training 1/1 epoch (loss 1.8432): 26%|βββ | 240/938 [01:46<05:39, 2.06it/s]
Training 1/1 epoch (loss 1.9860): 26%|βββ | 240/938 [01:46<05:39, 2.06it/s]
Training 1/1 epoch (loss 1.9860): 26%|βββ | 241/938 [01:46<05:23, 2.16it/s]
Training 1/1 epoch (loss 1.9164): 26%|βββ | 241/938 [01:46<05:23, 2.16it/s]
Training 1/1 epoch (loss 1.9164): 26%|βββ | 242/938 [01:46<05:00, 2.32it/s]
Training 1/1 epoch (loss 1.9111): 26%|βββ | 242/938 [01:47<05:00, 2.32it/s]
Training 1/1 epoch (loss 1.9111): 26%|βββ | 243/938 [01:47<04:53, 2.37it/s]
Training 1/1 epoch (loss 1.7805): 26%|βββ | 243/938 [01:47<04:53, 2.37it/s]
Training 1/1 epoch (loss 1.7805): 26%|βββ | 244/938 [01:47<04:44, 2.44it/s]
Training 1/1 epoch (loss 1.7904): 26%|βββ | 244/938 [01:47<04:44, 2.44it/s]
Training 1/1 epoch (loss 1.7904): 26%|βββ | 245/938 [01:47<04:46, 2.42it/s]
Training 1/1 epoch (loss 1.8602): 26%|βββ | 245/938 [01:48<04:46, 2.42it/s]
Training 1/1 epoch (loss 1.8602): 26%|βββ | 246/938 [01:48<04:51, 2.38it/s]
Training 1/1 epoch (loss 1.8572): 26%|βββ | 246/938 [01:48<04:51, 2.38it/s]
Training 1/1 epoch (loss 1.8572): 26%|βββ | 247/938 [01:48<04:35, 2.51it/s]
Training 1/1 epoch (loss 1.7882): 26%|βββ | 247/938 [01:49<04:35, 2.51it/s]
Training 1/1 epoch (loss 1.7882): 26%|βββ | 248/938 [01:49<04:24, 2.61it/s]
Training 1/1 epoch (loss 1.8923): 26%|βββ | 248/938 [01:49<04:24, 2.61it/s]
Training 1/1 epoch (loss 1.8923): 27%|βββ | 249/938 [01:49<04:17, 2.68it/s]
Training 1/1 epoch (loss 1.6846): 27%|βββ | 249/938 [01:49<04:17, 2.68it/s]
Training 1/1 epoch (loss 1.6846): 27%|βββ | 250/938 [01:49<04:17, 2.67it/s]
Training 1/1 epoch (loss 1.9552): 27%|βββ | 250/938 [01:50<04:17, 2.67it/s]
Training 1/1 epoch (loss 1.9552): 27%|βββ | 251/938 [01:50<04:29, 2.55it/s]
Training 1/1 epoch (loss 1.8999): 27%|βββ | 251/938 [01:50<04:29, 2.55it/s]
Training 1/1 epoch (loss 1.8999): 27%|βββ | 252/938 [01:50<04:28, 2.56it/s]
Training 1/1 epoch (loss 1.8726): 27%|βββ | 252/938 [01:50<04:28, 2.56it/s]
Training 1/1 epoch (loss 1.8726): 27%|βββ | 253/938 [01:50<04:14, 2.69it/s]
Training 1/1 epoch (loss 1.8620): 27%|βββ | 253/938 [01:51<04:14, 2.69it/s]
Training 1/1 epoch (loss 1.8620): 27%|βββ | 254/938 [01:51<04:18, 2.65it/s]
Training 1/1 epoch (loss 1.9170): 27%|βββ | 254/938 [01:51<04:18, 2.65it/s]
Training 1/1 epoch (loss 1.9170): 27%|βββ | 255/938 [01:51<04:16, 2.67it/s]
Training 1/1 epoch (loss 1.8188): 27%|βββ | 255/938 [01:52<04:16, 2.67it/s]
Training 1/1 epoch (loss 1.8188): 27%|βββ | 256/938 [01:52<04:20, 2.62it/s]
Training 1/1 epoch (loss 1.8237): 27%|βββ | 256/938 [01:52<04:20, 2.62it/s]
Training 1/1 epoch (loss 1.8237): 27%|βββ | 257/938 [01:52<04:19, 2.62it/s]
Training 1/1 epoch (loss 1.8898): 27%|βββ | 257/938 [01:52<04:19, 2.62it/s]
Training 1/1 epoch (loss 1.8898): 28%|βββ | 258/938 [01:52<04:19, 2.62it/s]
Training 1/1 epoch (loss 1.9217): 28%|βββ | 258/938 [01:53<04:19, 2.62it/s]
Training 1/1 epoch (loss 1.9217): 28%|βββ | 259/938 [01:53<04:23, 2.57it/s]
Training 1/1 epoch (loss 1.7992): 28%|βββ | 259/938 [01:53<04:23, 2.57it/s]
Training 1/1 epoch (loss 1.7992): 28%|βββ | 260/938 [01:53<04:21, 2.59it/s]
Training 1/1 epoch (loss 1.8333): 28%|βββ | 260/938 [01:54<04:21, 2.59it/s]
Training 1/1 epoch (loss 1.8333): 28%|βββ | 261/938 [01:54<04:47, 2.36it/s]
Training 1/1 epoch (loss 1.8476): 28%|βββ | 261/938 [01:54<04:47, 2.36it/s]
Training 1/1 epoch (loss 1.8476): 28%|βββ | 262/938 [01:54<04:33, 2.48it/s]
Training 1/1 epoch (loss 1.7293): 28%|βββ | 262/938 [01:55<04:33, 2.48it/s]
Training 1/1 epoch (loss 1.7293): 28%|βββ | 263/938 [01:55<04:43, 2.38it/s]
Training 1/1 epoch (loss 2.0098): 28%|βββ | 263/938 [01:55<04:43, 2.38it/s]
Training 1/1 epoch (loss 2.0098): 28%|βββ | 264/938 [01:55<04:45, 2.36it/s]
Training 1/1 epoch (loss 1.8749): 28%|βββ | 264/938 [01:55<04:45, 2.36it/s]
Training 1/1 epoch (loss 1.8749): 28%|βββ | 265/938 [01:55<04:40, 2.40it/s]
Training 1/1 epoch (loss 1.9167): 28%|βββ | 265/938 [01:56<04:40, 2.40it/s]
Training 1/1 epoch (loss 1.9167): 28%|βββ | 266/938 [01:56<05:07, 2.18it/s]
Training 1/1 epoch (loss 1.7122): 28%|βββ | 266/938 [01:56<05:07, 2.18it/s]
Training 1/1 epoch (loss 1.7122): 28%|βββ | 267/938 [01:56<04:41, 2.38it/s]
Training 1/1 epoch (loss 1.7892): 28%|βββ | 267/938 [01:57<04:41, 2.38it/s]
Training 1/1 epoch (loss 1.7892): 29%|βββ | 268/938 [01:57<04:24, 2.53it/s]
Training 1/1 epoch (loss 1.8174): 29%|βββ | 268/938 [01:57<04:24, 2.53it/s]
Training 1/1 epoch (loss 1.8174): 29%|βββ | 269/938 [01:57<04:17, 2.59it/s]
Training 1/1 epoch (loss 1.8428): 29%|βββ | 269/938 [01:57<04:17, 2.59it/s]
Training 1/1 epoch (loss 1.8428): 29%|βββ | 270/938 [01:57<04:10, 2.67it/s]
Training 1/1 epoch (loss 1.9387): 29%|βββ | 270/938 [01:58<04:10, 2.67it/s]
Training 1/1 epoch (loss 1.9387): 29%|βββ | 271/938 [01:58<04:14, 2.62it/s]
Training 1/1 epoch (loss 1.7537): 29%|βββ | 271/938 [01:58<04:14, 2.62it/s]
Training 1/1 epoch (loss 1.7537): 29%|βββ | 272/938 [01:58<04:24, 2.52it/s]
Training 1/1 epoch (loss 1.8947): 29%|βββ | 272/938 [01:58<04:24, 2.52it/s]
Training 1/1 epoch (loss 1.8947): 29%|βββ | 273/938 [01:58<04:13, 2.62it/s]
Training 1/1 epoch (loss 1.9248): 29%|βββ | 273/938 [01:59<04:13, 2.62it/s]
Training 1/1 epoch (loss 1.9248): 29%|βββ | 274/938 [01:59<04:14, 2.61it/s]
Training 1/1 epoch (loss 2.0335): 29%|βββ | 274/938 [01:59<04:14, 2.61it/s]
Training 1/1 epoch (loss 2.0335): 29%|βββ | 275/938 [01:59<04:16, 2.59it/s]
Training 1/1 epoch (loss 1.9129): 29%|βββ | 275/938 [02:00<04:16, 2.59it/s]
Training 1/1 epoch (loss 1.9129): 29%|βββ | 276/938 [02:00<04:33, 2.42it/s]
Training 1/1 epoch (loss 1.9431): 29%|βββ | 276/938 [02:00<04:33, 2.42it/s]
Training 1/1 epoch (loss 1.9431): 30%|βββ | 277/938 [02:00<04:25, 2.49it/s]
Training 1/1 epoch (loss 1.8247): 30%|βββ | 277/938 [02:00<04:25, 2.49it/s]
Training 1/1 epoch (loss 1.8247): 30%|βββ | 278/938 [02:00<04:14, 2.59it/s]
Training 1/1 epoch (loss 1.8989): 30%|βββ | 278/938 [02:01<04:14, 2.59it/s]
Training 1/1 epoch (loss 1.8989): 30%|βββ | 279/938 [02:01<04:08, 2.66it/s]
Training 1/1 epoch (loss 1.7479): 30%|βββ | 279/938 [02:01<04:08, 2.66it/s]
Training 1/1 epoch (loss 1.7479): 30%|βββ | 280/938 [02:01<04:05, 2.68it/s]
Training 1/1 epoch (loss 1.8172): 30%|βββ | 280/938 [02:02<04:05, 2.68it/s]
Training 1/1 epoch (loss 1.8172): 30%|βββ | 281/938 [02:02<04:09, 2.63it/s]
Training 1/1 epoch (loss 1.7444): 30%|βββ | 281/938 [02:02<04:09, 2.63it/s]
Training 1/1 epoch (loss 1.7444): 30%|βββ | 282/938 [02:02<04:22, 2.50it/s]
Training 1/1 epoch (loss 2.0121): 30%|βββ | 282/938 [02:02<04:22, 2.50it/s]
Training 1/1 epoch (loss 2.0121): 30%|βββ | 283/938 [02:02<04:12, 2.60it/s]
Training 1/1 epoch (loss 1.8259): 30%|βββ | 283/938 [02:03<04:12, 2.60it/s]
Training 1/1 epoch (loss 1.8259): 30%|βββ | 284/938 [02:03<04:01, 2.71it/s]
Training 1/1 epoch (loss 1.7686): 30%|βββ | 284/938 [02:03<04:01, 2.71it/s]
Training 1/1 epoch (loss 1.7686): 30%|βββ | 285/938 [02:03<03:57, 2.74it/s]
Training 1/1 epoch (loss 1.8979): 30%|βββ | 285/938 [02:04<03:57, 2.74it/s]
Training 1/1 epoch (loss 1.8979): 30%|βββ | 286/938 [02:04<04:16, 2.54it/s]
Training 1/1 epoch (loss 1.7733): 30%|βββ | 286/938 [02:04<04:16, 2.54it/s]
Training 1/1 epoch (loss 1.7733): 31%|βββ | 287/938 [02:04<04:27, 2.43it/s]
Training 1/1 epoch (loss 1.9544): 31%|βββ | 287/938 [02:04<04:27, 2.43it/s]
Training 1/1 epoch (loss 1.9544): 31%|βββ | 288/938 [02:04<04:37, 2.35it/s]
Training 1/1 epoch (loss 1.9884): 31%|βββ | 288/938 [02:05<04:37, 2.35it/s]
Training 1/1 epoch (loss 1.9884): 31%|βββ | 289/938 [02:05<04:33, 2.37it/s]
Training 1/1 epoch (loss 1.7486): 31%|βββ | 289/938 [02:05<04:33, 2.37it/s]
Training 1/1 epoch (loss 1.7486): 31%|βββ | 290/938 [02:05<04:30, 2.39it/s]
Training 1/1 epoch (loss 1.9147): 31%|βββ | 290/938 [02:06<04:30, 2.39it/s]
Training 1/1 epoch (loss 1.9147): 31%|βββ | 291/938 [02:06<04:28, 2.41it/s]
Training 1/1 epoch (loss 1.8176): 31%|βββ | 291/938 [02:06<04:28, 2.41it/s]
Training 1/1 epoch (loss 1.8176): 31%|βββ | 292/938 [02:06<04:57, 2.17it/s]
Training 1/1 epoch (loss 1.8493): 31%|βββ | 292/938 [02:07<04:57, 2.17it/s]
Training 1/1 epoch (loss 1.8493): 31%|βββ | 293/938 [02:07<04:31, 2.38it/s]
Training 1/1 epoch (loss 1.8101): 31%|βββ | 293/938 [02:07<04:31, 2.38it/s]
Training 1/1 epoch (loss 1.8101): 31%|ββββ | 294/938 [02:07<04:24, 2.44it/s]
Training 1/1 epoch (loss 1.6357): 31%|ββββ | 294/938 [02:07<04:24, 2.44it/s]
Training 1/1 epoch (loss 1.6357): 31%|ββββ | 295/938 [02:07<04:07, 2.60it/s]
Training 1/1 epoch (loss 1.8327): 31%|ββββ | 295/938 [02:08<04:07, 2.60it/s]
Training 1/1 epoch (loss 1.8327): 32%|ββββ | 296/938 [02:08<04:16, 2.51it/s]
Training 1/1 epoch (loss 1.8795): 32%|ββββ | 296/938 [02:08<04:16, 2.51it/s]
Training 1/1 epoch (loss 1.8795): 32%|ββββ | 297/938 [02:08<04:20, 2.46it/s]
Training 1/1 epoch (loss 1.8187): 32%|ββββ | 297/938 [02:08<04:20, 2.46it/s]
Training 1/1 epoch (loss 1.8187): 32%|ββββ | 298/938 [02:08<04:09, 2.56it/s]
Training 1/1 epoch (loss 1.8888): 32%|ββββ | 298/938 [02:09<04:09, 2.56it/s]
Training 1/1 epoch (loss 1.8888): 32%|ββββ | 299/938 [02:09<04:26, 2.40it/s]
Training 1/1 epoch (loss 1.8859): 32%|ββββ | 299/938 [02:09<04:26, 2.40it/s]
Training 1/1 epoch (loss 1.8859): 32%|ββββ | 300/938 [02:09<04:15, 2.50it/s]
Training 1/1 epoch (loss 1.9081): 32%|ββββ | 300/938 [02:10<04:15, 2.50it/s]
Training 1/1 epoch (loss 1.9081): 32%|ββββ | 301/938 [02:10<04:09, 2.55it/s]
Training 1/1 epoch (loss 1.9135): 32%|ββββ | 301/938 [02:10<04:09, 2.55it/s]
Training 1/1 epoch (loss 1.9135): 32%|ββββ | 302/938 [02:10<04:21, 2.43it/s]
Training 1/1 epoch (loss 1.7485): 32%|ββββ | 302/938 [02:11<04:21, 2.43it/s]
Training 1/1 epoch (loss 1.7485): 32%|ββββ | 303/938 [02:11<04:13, 2.51it/s]
Training 1/1 epoch (loss 2.0725): 32%|ββββ | 303/938 [02:11<04:13, 2.51it/s]
Training 1/1 epoch (loss 2.0725): 32%|ββββ | 304/938 [02:11<04:13, 2.50it/s]
Training 1/1 epoch (loss 1.7930): 32%|ββββ | 304/938 [02:11<04:13, 2.50it/s]
Training 1/1 epoch (loss 1.7930): 33%|ββββ | 305/938 [02:11<04:09, 2.53it/s]
Training 1/1 epoch (loss 1.8497): 33%|ββββ | 305/938 [02:12<04:09, 2.53it/s]
Training 1/1 epoch (loss 1.8497): 33%|ββββ | 306/938 [02:12<04:17, 2.46it/s]
Training 1/1 epoch (loss 1.8780): 33%|ββββ | 306/938 [02:12<04:17, 2.46it/s]
Training 1/1 epoch (loss 1.8780): 33%|ββββ | 307/938 [02:12<04:25, 2.38it/s]
Training 1/1 epoch (loss 1.8553): 33%|ββββ | 307/938 [02:13<04:25, 2.38it/s]
Training 1/1 epoch (loss 1.8553): 33%|ββββ | 308/938 [02:13<04:11, 2.51it/s]
Training 1/1 epoch (loss 1.6953): 33%|ββββ | 308/938 [02:13<04:11, 2.51it/s]
Training 1/1 epoch (loss 1.6953): 33%|ββββ | 309/938 [02:13<04:09, 2.52it/s]
Training 1/1 epoch (loss 1.9860): 33%|ββββ | 309/938 [02:13<04:09, 2.52it/s]
Training 1/1 epoch (loss 1.9860): 33%|ββββ | 310/938 [02:13<04:03, 2.58it/s]
Training 1/1 epoch (loss 1.7154): 33%|ββββ | 310/938 [02:14<04:03, 2.58it/s]
Training 1/1 epoch (loss 1.7154): 33%|ββββ | 311/938 [02:14<04:11, 2.50it/s]
Training 1/1 epoch (loss 1.8017): 33%|ββββ | 311/938 [02:14<04:11, 2.50it/s]
Training 1/1 epoch (loss 1.8017): 33%|ββββ | 312/938 [02:14<04:37, 2.26it/s]
Training 1/1 epoch (loss 1.9533): 33%|ββββ | 312/938 [02:15<04:37, 2.26it/s]
Training 1/1 epoch (loss 1.9533): 33%|ββββ | 313/938 [02:15<04:38, 2.24it/s]
Training 1/1 epoch (loss 2.0157): 33%|ββββ | 313/938 [02:15<04:38, 2.24it/s]
Training 1/1 epoch (loss 2.0157): 33%|ββββ | 314/938 [02:15<04:18, 2.41it/s]
Training 1/1 epoch (loss 1.7254): 33%|ββββ | 314/938 [02:15<04:18, 2.41it/s]
Training 1/1 epoch (loss 1.7254): 34%|ββββ | 315/938 [02:15<04:06, 2.53it/s]
Training 1/1 epoch (loss 1.8892): 34%|ββββ | 315/938 [02:16<04:06, 2.53it/s]
Training 1/1 epoch (loss 1.8892): 34%|ββββ | 316/938 [02:16<04:11, 2.47it/s]
Training 1/1 epoch (loss 1.8375): 34%|ββββ | 316/938 [02:16<04:11, 2.47it/s]
Training 1/1 epoch (loss 1.8375): 34%|ββββ | 317/938 [02:16<04:36, 2.24it/s]
Training 1/1 epoch (loss 1.9025): 34%|ββββ | 317/938 [02:17<04:36, 2.24it/s]
Training 1/1 epoch (loss 1.9025): 34%|ββββ | 318/938 [02:17<04:41, 2.20it/s]
Training 1/1 epoch (loss 1.7319): 34%|ββββ | 318/938 [02:17<04:41, 2.20it/s]
Training 1/1 epoch (loss 1.7319): 34%|ββββ | 319/938 [02:17<04:39, 2.22it/s]
Training 1/1 epoch (loss 1.7164): 34%|ββββ | 319/938 [02:18<04:39, 2.22it/s]
Training 1/1 epoch (loss 1.7164): 34%|ββββ | 320/938 [02:18<04:44, 2.17it/s]
Training 1/1 epoch (loss 1.7835): 34%|ββββ | 320/938 [02:18<04:44, 2.17it/s]
Training 1/1 epoch (loss 1.7835): 34%|ββββ | 321/938 [02:18<04:48, 2.14it/s]
Training 1/1 epoch (loss 1.9044): 34%|ββββ | 321/938 [02:19<04:48, 2.14it/s]
Training 1/1 epoch (loss 1.9044): 34%|ββββ | 322/938 [02:19<04:40, 2.19it/s]
Training 1/1 epoch (loss 2.0008): 34%|ββββ | 322/938 [02:19<04:40, 2.19it/s]
Training 1/1 epoch (loss 2.0008): 34%|ββββ | 323/938 [02:19<04:45, 2.16it/s]
Training 1/1 epoch (loss 1.7910): 34%|ββββ | 323/938 [02:20<04:45, 2.16it/s]
Training 1/1 epoch (loss 1.7910): 35%|ββββ | 324/938 [02:20<04:36, 2.22it/s]
Training 1/1 epoch (loss 1.7461): 35%|ββββ | 324/938 [02:20<04:36, 2.22it/s]
Training 1/1 epoch (loss 1.7461): 35%|ββββ | 325/938 [02:20<04:19, 2.36it/s]
Training 1/1 epoch (loss 1.9733): 35%|ββββ | 325/938 [02:20<04:19, 2.36it/s]
Training 1/1 epoch (loss 1.9733): 35%|ββββ | 326/938 [02:20<04:15, 2.40it/s]
Training 1/1 epoch (loss 1.7810): 35%|ββββ | 326/938 [02:21<04:15, 2.40it/s]
Training 1/1 epoch (loss 1.7810): 35%|ββββ | 327/938 [02:21<04:01, 2.53it/s]
Training 1/1 epoch (loss 1.7772): 35%|ββββ | 327/938 [02:21<04:01, 2.53it/s]
Training 1/1 epoch (loss 1.7772): 35%|ββββ | 328/938 [02:21<03:58, 2.56it/s]
Training 1/1 epoch (loss 1.7311): 35%|ββββ | 328/938 [02:22<03:58, 2.56it/s]
Training 1/1 epoch (loss 1.7311): 35%|ββββ | 329/938 [02:22<04:46, 2.13it/s]
Training 1/1 epoch (loss 1.8820): 35%|ββββ | 329/938 [02:22<04:46, 2.13it/s]
Training 1/1 epoch (loss 1.8820): 35%|ββββ | 330/938 [02:22<04:35, 2.21it/s]
Training 1/1 epoch (loss 1.8134): 35%|ββββ | 330/938 [02:23<04:35, 2.21it/s]
Training 1/1 epoch (loss 1.8134): 35%|ββββ | 331/938 [02:23<04:55, 2.06it/s]
Training 1/1 epoch (loss 1.9190): 35%|ββββ | 331/938 [02:23<04:55, 2.06it/s]
Training 1/1 epoch (loss 1.9190): 35%|ββββ | 332/938 [02:23<04:29, 2.25it/s]
Training 1/1 epoch (loss 1.8669): 35%|ββββ | 332/938 [02:23<04:29, 2.25it/s]
Training 1/1 epoch (loss 1.8669): 36%|ββββ | 333/938 [02:23<04:11, 2.41it/s]
Training 1/1 epoch (loss 1.7788): 36%|ββββ | 333/938 [02:24<04:11, 2.41it/s]
Training 1/1 epoch (loss 1.7788): 36%|ββββ | 334/938 [02:24<04:19, 2.33it/s]
Training 1/1 epoch (loss 1.8716): 36%|ββββ | 334/938 [02:24<04:19, 2.33it/s]
Training 1/1 epoch (loss 1.8716): 36%|ββββ | 335/938 [02:24<04:54, 2.05it/s]
Training 1/1 epoch (loss 1.7446): 36%|ββββ | 335/938 [02:25<04:54, 2.05it/s]
Training 1/1 epoch (loss 1.7446): 36%|ββββ | 336/938 [02:25<04:58, 2.02it/s]
Training 1/1 epoch (loss 1.9250): 36%|ββββ | 336/938 [02:25<04:58, 2.02it/s]
Training 1/1 epoch (loss 1.9250): 36%|ββββ | 337/938 [02:25<04:35, 2.18it/s]
Training 1/1 epoch (loss 1.8895): 36%|ββββ | 337/938 [02:26<04:35, 2.18it/s]
Training 1/1 epoch (loss 1.8895): 36%|ββββ | 338/938 [02:26<04:18, 2.32it/s]
Training 1/1 epoch (loss 1.9291): 36%|ββββ | 338/938 [02:26<04:18, 2.32it/s]
Training 1/1 epoch (loss 1.9291): 36%|ββββ | 339/938 [02:26<04:10, 2.39it/s]
Training 1/1 epoch (loss 1.8130): 36%|ββββ | 339/938 [02:27<04:10, 2.39it/s]
Training 1/1 epoch (loss 1.8130): 36%|ββββ | 340/938 [02:27<04:06, 2.43it/s]
Training 1/1 epoch (loss 1.8379): 36%|ββββ | 340/938 [02:27<04:06, 2.43it/s]
Training 1/1 epoch (loss 1.8379): 36%|ββββ | 341/938 [02:27<04:16, 2.33it/s]
Training 1/1 epoch (loss 1.7725): 36%|ββββ | 341/938 [02:27<04:16, 2.33it/s]
Training 1/1 epoch (loss 1.7725): 36%|ββββ | 342/938 [02:27<03:58, 2.50it/s]
Training 1/1 epoch (loss 1.9056): 36%|ββββ | 342/938 [02:28<03:58, 2.50it/s]
Training 1/1 epoch (loss 1.9056): 37%|ββββ | 343/938 [02:28<03:49, 2.59it/s]
Training 1/1 epoch (loss 1.8576): 37%|ββββ | 343/938 [02:28<03:49, 2.59it/s]
Training 1/1 epoch (loss 1.8576): 37%|ββββ | 344/938 [02:28<03:54, 2.53it/s]
Training 1/1 epoch (loss 1.7557): 37%|ββββ | 344/938 [02:28<03:54, 2.53it/s]
Training 1/1 epoch (loss 1.7557): 37%|ββββ | 345/938 [02:28<03:36, 2.74it/s]
Training 1/1 epoch (loss 1.7457): 37%|ββββ | 345/938 [02:29<03:36, 2.74it/s]
Training 1/1 epoch (loss 1.7457): 37%|ββββ | 346/938 [02:29<03:31, 2.80it/s]
Training 1/1 epoch (loss 1.8075): 37%|ββββ | 346/938 [02:29<03:31, 2.80it/s]
Training 1/1 epoch (loss 1.8075): 37%|ββββ | 347/938 [02:29<03:36, 2.73it/s]
Training 1/1 epoch (loss 1.6833): 37%|ββββ | 347/938 [02:30<03:36, 2.73it/s]
Training 1/1 epoch (loss 1.6833): 37%|ββββ | 348/938 [02:30<04:04, 2.41it/s]
Training 1/1 epoch (loss 1.8114): 37%|ββββ | 348/938 [02:30<04:04, 2.41it/s]
Training 1/1 epoch (loss 1.8114): 37%|ββββ | 349/938 [02:30<04:13, 2.32it/s]
Training 1/1 epoch (loss 1.8146): 37%|ββββ | 349/938 [02:30<04:13, 2.32it/s]
Training 1/1 epoch (loss 1.8146): 37%|ββββ | 350/938 [02:30<03:52, 2.53it/s]
Training 1/1 epoch (loss 1.8923): 37%|ββββ | 350/938 [02:31<03:52, 2.53it/s]
Training 1/1 epoch (loss 1.8923): 37%|ββββ | 351/938 [02:31<03:41, 2.65it/s]
Training 1/1 epoch (loss 1.7870): 37%|ββββ | 351/938 [02:31<03:41, 2.65it/s]
Training 1/1 epoch (loss 1.7870): 38%|ββββ | 352/938 [02:31<03:37, 2.69it/s]
Training 1/1 epoch (loss 1.9732): 38%|ββββ | 352/938 [02:31<03:37, 2.69it/s]
Training 1/1 epoch (loss 1.9732): 38%|ββββ | 353/938 [02:31<03:28, 2.80it/s]
Training 1/1 epoch (loss 1.8849): 38%|ββββ | 353/938 [02:32<03:28, 2.80it/s]
Training 1/1 epoch (loss 1.8849): 38%|ββββ | 354/938 [02:32<03:37, 2.69it/s]
Training 1/1 epoch (loss 1.8610): 38%|ββββ | 354/938 [02:32<03:37, 2.69it/s]
Training 1/1 epoch (loss 1.8610): 38%|ββββ | 355/938 [02:32<03:38, 2.67it/s]
Training 1/1 epoch (loss 1.7996): 38%|ββββ | 355/938 [02:33<03:38, 2.67it/s]
Training 1/1 epoch (loss 1.7996): 38%|ββββ | 356/938 [02:33<03:38, 2.66it/s]
Training 1/1 epoch (loss 1.8451): 38%|ββββ | 356/938 [02:33<03:38, 2.66it/s]
Training 1/1 epoch (loss 1.8451): 38%|ββββ | 357/938 [02:33<03:42, 2.61it/s]
Training 1/1 epoch (loss 1.8120): 38%|ββββ | 357/938 [02:33<03:42, 2.61it/s]
Training 1/1 epoch (loss 1.8120): 38%|ββββ | 358/938 [02:33<03:37, 2.67it/s]
Training 1/1 epoch (loss 1.7558): 38%|ββββ | 358/938 [02:34<03:37, 2.67it/s]
Training 1/1 epoch (loss 1.7558): 38%|ββββ | 359/938 [02:34<03:34, 2.70it/s]
Training 1/1 epoch (loss 1.9788): 38%|ββββ | 359/938 [02:34<03:34, 2.70it/s]
Training 1/1 epoch (loss 1.9788): 38%|ββββ | 360/938 [02:34<03:47, 2.54it/s]
Training 1/1 epoch (loss 1.9723): 38%|ββββ | 360/938 [02:35<03:47, 2.54it/s]
Training 1/1 epoch (loss 1.9723): 38%|ββββ | 361/938 [02:35<04:14, 2.26it/s]
Training 1/1 epoch (loss 1.8289): 38%|ββββ | 361/938 [02:35<04:14, 2.26it/s]
Training 1/1 epoch (loss 1.8289): 39%|ββββ | 362/938 [02:35<04:09, 2.31it/s]
Training 1/1 epoch (loss 1.8563): 39%|ββββ | 362/938 [02:36<04:09, 2.31it/s]
Training 1/1 epoch (loss 1.8563): 39%|ββββ | 363/938 [02:36<04:05, 2.34it/s]
Training 1/1 epoch (loss 1.9092): 39%|ββββ | 363/938 [02:36<04:05, 2.34it/s]
Training 1/1 epoch (loss 1.9092): 39%|ββββ | 364/938 [02:36<04:06, 2.33it/s]
Training 1/1 epoch (loss 1.8469): 39%|ββββ | 364/938 [02:36<04:06, 2.33it/s]
Training 1/1 epoch (loss 1.8469): 39%|ββββ | 365/938 [02:36<03:59, 2.40it/s]
Training 1/1 epoch (loss 1.8219): 39%|ββββ | 365/938 [02:37<03:59, 2.40it/s]
Training 1/1 epoch (loss 1.8219): 39%|ββββ | 366/938 [02:37<03:58, 2.40it/s]
Training 1/1 epoch (loss 1.8121): 39%|ββββ | 366/938 [02:37<03:58, 2.40it/s]
Training 1/1 epoch (loss 1.8121): 39%|ββββ | 367/938 [02:37<04:01, 2.37it/s]
Training 1/1 epoch (loss 1.9527): 39%|ββββ | 367/938 [02:38<04:01, 2.37it/s]
Training 1/1 epoch (loss 1.9527): 39%|ββββ | 368/938 [02:38<03:56, 2.41it/s]
Training 1/1 epoch (loss 1.8069): 39%|ββββ | 368/938 [02:38<03:56, 2.41it/s]
Training 1/1 epoch (loss 1.8069): 39%|ββββ | 369/938 [02:38<03:50, 2.47it/s]
Training 1/1 epoch (loss 1.8528): 39%|ββββ | 369/938 [02:38<03:50, 2.47it/s]
Training 1/1 epoch (loss 1.8528): 39%|ββββ | 370/938 [02:38<03:39, 2.59it/s]
Training 1/1 epoch (loss 1.8277): 39%|ββββ | 370/938 [02:39<03:39, 2.59it/s]
Training 1/1 epoch (loss 1.8277): 40%|ββββ | 371/938 [02:39<03:36, 2.62it/s]
Training 1/1 epoch (loss 1.8297): 40%|ββββ | 371/938 [02:39<03:36, 2.62it/s]
Training 1/1 epoch (loss 1.8297): 40%|ββββ | 372/938 [02:39<03:29, 2.70it/s]
Training 1/1 epoch (loss 1.7536): 40%|ββββ | 372/938 [02:39<03:29, 2.70it/s]
Training 1/1 epoch (loss 1.7536): 40%|ββββ | 373/938 [02:39<03:24, 2.76it/s]
Training 1/1 epoch (loss 1.6981): 40%|ββββ | 373/938 [02:40<03:24, 2.76it/s]
Training 1/1 epoch (loss 1.6981): 40%|ββββ | 374/938 [02:40<03:31, 2.67it/s]
Training 1/1 epoch (loss 1.9368): 40%|ββββ | 374/938 [02:40<03:31, 2.67it/s]
Training 1/1 epoch (loss 1.9368): 40%|ββββ | 375/938 [02:40<03:28, 2.71it/s]
Training 1/1 epoch (loss 1.8502): 40%|ββββ | 375/938 [02:41<03:28, 2.71it/s]
Training 1/1 epoch (loss 1.8502): 40%|ββββ | 376/938 [02:41<03:25, 2.73it/s]
Training 1/1 epoch (loss 1.9715): 40%|ββββ | 376/938 [02:41<03:25, 2.73it/s]
Training 1/1 epoch (loss 1.9715): 40%|ββββ | 377/938 [02:41<03:41, 2.53it/s]
Training 1/1 epoch (loss 1.7393): 40%|ββββ | 377/938 [02:41<03:41, 2.53it/s]
Training 1/1 epoch (loss 1.7393): 40%|ββββ | 378/938 [02:41<03:37, 2.57it/s]
Training 1/1 epoch (loss 1.8049): 40%|ββββ | 378/938 [02:42<03:37, 2.57it/s]
Training 1/1 epoch (loss 1.8049): 40%|ββββ | 379/938 [02:42<03:35, 2.60it/s]
Training 1/1 epoch (loss 1.6851): 40%|ββββ | 379/938 [02:42<03:35, 2.60it/s]
Training 1/1 epoch (loss 1.6851): 41%|ββββ | 380/938 [02:42<03:38, 2.55it/s]
Training 1/1 epoch (loss 1.8253): 41%|ββββ | 380/938 [02:42<03:38, 2.55it/s]
Training 1/1 epoch (loss 1.8253): 41%|ββββ | 381/938 [02:42<03:31, 2.64it/s]
Training 1/1 epoch (loss 1.7411): 41%|ββββ | 381/938 [02:43<03:31, 2.64it/s]
Training 1/1 epoch (loss 1.7411): 41%|ββββ | 382/938 [02:43<03:40, 2.52it/s]
Training 1/1 epoch (loss 1.8821): 41%|ββββ | 382/938 [02:43<03:40, 2.52it/s]
Training 1/1 epoch (loss 1.8821): 41%|ββββ | 383/938 [02:43<03:34, 2.59it/s]
Training 1/1 epoch (loss 1.9353): 41%|ββββ | 383/938 [02:44<03:34, 2.59it/s]
Training 1/1 epoch (loss 1.9353): 41%|ββββ | 384/938 [02:44<03:32, 2.61it/s]
Training 1/1 epoch (loss 1.7933): 41%|ββββ | 384/938 [02:44<03:32, 2.61it/s]
Training 1/1 epoch (loss 1.7933): 41%|ββββ | 385/938 [02:44<03:33, 2.59it/s]
Training 1/1 epoch (loss 1.8800): 41%|ββββ | 385/938 [02:45<03:33, 2.59it/s]
Training 1/1 epoch (loss 1.8800): 41%|ββββ | 386/938 [02:45<03:41, 2.50it/s]
Training 1/1 epoch (loss 1.8722): 41%|ββββ | 386/938 [02:45<03:41, 2.50it/s]
Training 1/1 epoch (loss 1.8722): 41%|βββββ | 387/938 [02:45<03:37, 2.53it/s]
Training 1/1 epoch (loss 1.8756): 41%|βββββ | 387/938 [02:45<03:37, 2.53it/s]
Training 1/1 epoch (loss 1.8756): 41%|βββββ | 388/938 [02:45<03:42, 2.47it/s]
Training 1/1 epoch (loss 1.9805): 41%|βββββ | 388/938 [02:46<03:42, 2.47it/s]
Training 1/1 epoch (loss 1.9805): 41%|βββββ | 389/938 [02:46<03:39, 2.50it/s]
Training 1/1 epoch (loss 1.8444): 41%|βββββ | 389/938 [02:46<03:39, 2.50it/s]
Training 1/1 epoch (loss 1.8444): 42%|βββββ | 390/938 [02:46<03:39, 2.49it/s]
Training 1/1 epoch (loss 1.8135): 42%|βββββ | 390/938 [02:46<03:39, 2.49it/s]
Training 1/1 epoch (loss 1.8135): 42%|βββββ | 391/938 [02:46<03:31, 2.59it/s]
Training 1/1 epoch (loss 1.9313): 42%|βββββ | 391/938 [02:47<03:31, 2.59it/s]
Training 1/1 epoch (loss 1.9313): 42%|βββββ | 392/938 [02:47<03:32, 2.57it/s]
Training 1/1 epoch (loss 1.8391): 42%|βββββ | 392/938 [02:47<03:32, 2.57it/s]
Training 1/1 epoch (loss 1.8391): 42%|βββββ | 393/938 [02:47<03:29, 2.60it/s]
Training 1/1 epoch (loss 1.9843): 42%|βββββ | 393/938 [02:48<03:29, 2.60it/s]
Training 1/1 epoch (loss 1.9843): 42%|βββββ | 394/938 [02:48<03:31, 2.57it/s]
Training 1/1 epoch (loss 1.7801): 42%|βββββ | 394/938 [02:48<03:31, 2.57it/s]
Training 1/1 epoch (loss 1.7801): 42%|βββββ | 395/938 [02:48<03:30, 2.58it/s]
Training 1/1 epoch (loss 1.8070): 42%|βββββ | 395/938 [02:48<03:30, 2.58it/s]
Training 1/1 epoch (loss 1.8070): 42%|βββββ | 396/938 [02:48<03:26, 2.62it/s]
Training 1/1 epoch (loss 1.7809): 42%|βββββ | 396/938 [02:49<03:26, 2.62it/s]
Training 1/1 epoch (loss 1.7809): 42%|βββββ | 397/938 [02:49<03:25, 2.64it/s]
Training 1/1 epoch (loss 1.7641): 42%|βββββ | 397/938 [02:49<03:25, 2.64it/s]
Training 1/1 epoch (loss 1.7641): 42%|βββββ | 398/938 [02:49<03:21, 2.68it/s]
Training 1/1 epoch (loss 1.8202): 42%|βββββ | 398/938 [02:49<03:21, 2.68it/s]
Training 1/1 epoch (loss 1.8202): 43%|βββββ | 399/938 [02:49<03:22, 2.66it/s]
Training 1/1 epoch (loss 1.8977): 43%|βββββ | 399/938 [02:50<03:22, 2.66it/s]
Training 1/1 epoch (loss 1.8977): 43%|βββββ | 400/938 [02:50<03:31, 2.55it/s]
Training 1/1 epoch (loss 1.9199): 43%|βββββ | 400/938 [02:50<03:31, 2.55it/s]
Training 1/1 epoch (loss 1.9199): 43%|βββββ | 401/938 [02:50<03:28, 2.58it/s]
Training 1/1 epoch (loss 1.8693): 43%|βββββ | 401/938 [02:51<03:28, 2.58it/s]
Training 1/1 epoch (loss 1.8693): 43%|βββββ | 402/938 [02:51<03:21, 2.66it/s]
Training 1/1 epoch (loss 1.8355): 43%|βββββ | 402/938 [02:51<03:21, 2.66it/s]
Training 1/1 epoch (loss 1.8355): 43%|βββββ | 403/938 [02:51<03:36, 2.47it/s]
Training 1/1 epoch (loss 1.8971): 43%|βββββ | 403/938 [02:51<03:36, 2.47it/s]
Training 1/1 epoch (loss 1.8971): 43%|βββββ | 404/938 [02:51<03:29, 2.55it/s]
Training 1/1 epoch (loss 1.7280): 43%|βββββ | 404/938 [02:52<03:29, 2.55it/s]
Training 1/1 epoch (loss 1.7280): 43%|βββββ | 405/938 [02:52<03:30, 2.54it/s]
Training 1/1 epoch (loss 1.9815): 43%|βββββ | 405/938 [02:52<03:30, 2.54it/s]
Training 1/1 epoch (loss 1.9815): 43%|βββββ | 406/938 [02:52<03:27, 2.56it/s]
Training 1/1 epoch (loss 1.8968): 43%|βββββ | 406/938 [02:53<03:27, 2.56it/s]
Training 1/1 epoch (loss 1.8968): 43%|βββββ | 407/938 [02:53<03:29, 2.53it/s]
Training 1/1 epoch (loss 1.8169): 43%|βββββ | 407/938 [02:53<03:29, 2.53it/s]
Training 1/1 epoch (loss 1.8169): 43%|βββββ | 408/938 [02:53<03:31, 2.51it/s]
Training 1/1 epoch (loss 1.8098): 43%|βββββ | 408/938 [02:53<03:31, 2.51it/s]
Training 1/1 epoch (loss 1.8098): 44%|βββββ | 409/938 [02:53<03:24, 2.59it/s]
Training 1/1 epoch (loss 1.8128): 44%|βββββ | 409/938 [02:54<03:24, 2.59it/s]
Training 1/1 epoch (loss 1.8128): 44%|βββββ | 410/938 [02:54<03:43, 2.37it/s]
Training 1/1 epoch (loss 1.6971): 44%|βββββ | 410/938 [02:55<03:43, 2.37it/s]
Training 1/1 epoch (loss 1.6971): 44%|βββββ | 411/938 [02:55<04:05, 2.14it/s]
Training 1/1 epoch (loss 1.7716): 44%|βββββ | 411/938 [02:55<04:05, 2.14it/s]
Training 1/1 epoch (loss 1.7716): 44%|βββββ | 412/938 [02:55<03:57, 2.22it/s]
Training 1/1 epoch (loss 1.7657): 44%|βββββ | 412/938 [02:55<03:57, 2.22it/s]
Training 1/1 epoch (loss 1.7657): 44%|βββββ | 413/938 [02:55<03:50, 2.27it/s]
Training 1/1 epoch (loss 1.8410): 44%|βββββ | 413/938 [02:56<03:50, 2.27it/s]
Training 1/1 epoch (loss 1.8410): 44%|βββββ | 414/938 [02:56<03:42, 2.36it/s]
Training 1/1 epoch (loss 1.7624): 44%|βββββ | 414/938 [02:56<03:42, 2.36it/s]
Training 1/1 epoch (loss 1.7624): 44%|βββββ | 415/938 [02:56<03:46, 2.31it/s]
Training 1/1 epoch (loss 1.7489): 44%|βββββ | 415/938 [02:57<03:46, 2.31it/s]
Training 1/1 epoch (loss 1.7489): 44%|βββββ | 416/938 [02:57<03:36, 2.41it/s]
Training 1/1 epoch (loss 1.6519): 44%|βββββ | 416/938 [02:57<03:36, 2.41it/s]
Training 1/1 epoch (loss 1.6519): 44%|βββββ | 417/938 [02:57<03:32, 2.45it/s]
Training 1/1 epoch (loss 1.7160): 44%|βββββ | 417/938 [02:57<03:32, 2.45it/s]
Training 1/1 epoch (loss 1.7160): 45%|βββββ | 418/938 [02:57<03:28, 2.50it/s]
Training 1/1 epoch (loss 1.8072): 45%|βββββ | 418/938 [02:58<03:28, 2.50it/s]
Training 1/1 epoch (loss 1.8072): 45%|βββββ | 419/938 [02:58<03:35, 2.41it/s]
Training 1/1 epoch (loss 1.9290): 45%|βββββ | 419/938 [02:58<03:35, 2.41it/s]
Training 1/1 epoch (loss 1.9290): 45%|βββββ | 420/938 [02:58<03:36, 2.40it/s]
Training 1/1 epoch (loss 1.8354): 45%|βββββ | 420/938 [02:59<03:36, 2.40it/s]
Training 1/1 epoch (loss 1.8354): 45%|βββββ | 421/938 [02:59<03:21, 2.57it/s]
Training 1/1 epoch (loss 1.7256): 45%|βββββ | 421/938 [02:59<03:21, 2.57it/s]
Training 1/1 epoch (loss 1.7256): 45%|βββββ | 422/938 [02:59<03:15, 2.64it/s]
Training 1/1 epoch (loss 1.7354): 45%|βββββ | 422/938 [02:59<03:15, 2.64it/s]
Training 1/1 epoch (loss 1.7354): 45%|βββββ | 423/938 [02:59<03:27, 2.48it/s]
Training 1/1 epoch (loss 1.7111): 45%|βββββ | 423/938 [03:00<03:27, 2.48it/s]
Training 1/1 epoch (loss 1.7111): 45%|βββββ | 424/938 [03:00<03:33, 2.41it/s]
Training 1/1 epoch (loss 1.8267): 45%|βββββ | 424/938 [03:00<03:33, 2.41it/s]
Training 1/1 epoch (loss 1.8267): 45%|βββββ | 425/938 [03:00<03:29, 2.45it/s]
Training 1/1 epoch (loss 1.8730): 45%|βββββ | 425/938 [03:00<03:29, 2.45it/s]
Training 1/1 epoch (loss 1.8730): 45%|βββββ | 426/938 [03:00<03:14, 2.63it/s]
Training 1/1 epoch (loss 1.8654): 45%|βββββ | 426/938 [03:01<03:14, 2.63it/s]
Training 1/1 epoch (loss 1.8654): 46%|βββββ | 427/938 [03:01<03:17, 2.59it/s]
Training 1/1 epoch (loss 1.7727): 46%|βββββ | 427/938 [03:01<03:17, 2.59it/s]
Training 1/1 epoch (loss 1.7727): 46%|βββββ | 428/938 [03:01<03:08, 2.70it/s]
Training 1/1 epoch (loss 1.7646): 46%|βββββ | 428/938 [03:02<03:08, 2.70it/s]
Training 1/1 epoch (loss 1.7646): 46%|βββββ | 429/938 [03:02<03:12, 2.65it/s]
Training 1/1 epoch (loss 1.8755): 46%|βββββ | 429/938 [03:02<03:12, 2.65it/s]
Training 1/1 epoch (loss 1.8755): 46%|βββββ | 430/938 [03:02<03:11, 2.66it/s]
Training 1/1 epoch (loss 1.8081): 46%|βββββ | 430/938 [03:02<03:11, 2.66it/s]
Training 1/1 epoch (loss 1.8081): 46%|βββββ | 431/938 [03:02<03:13, 2.62it/s]
Training 1/1 epoch (loss 1.8453): 46%|βββββ | 431/938 [03:03<03:13, 2.62it/s]
Training 1/1 epoch (loss 1.8453): 46%|βββββ | 432/938 [03:03<03:08, 2.69it/s]
Training 1/1 epoch (loss 1.8649): 46%|βββββ | 432/938 [03:03<03:08, 2.69it/s]
Training 1/1 epoch (loss 1.8649): 46%|βββββ | 433/938 [03:03<03:07, 2.69it/s]
Training 1/1 epoch (loss 1.9276): 46%|βββββ | 433/938 [03:04<03:07, 2.69it/s]
Training 1/1 epoch (loss 1.9276): 46%|βββββ | 434/938 [03:04<03:12, 2.62it/s]
Training 1/1 epoch (loss 1.8174): 46%|βββββ | 434/938 [03:04<03:12, 2.62it/s]
Training 1/1 epoch (loss 1.8174): 46%|βββββ | 435/938 [03:04<03:18, 2.53it/s]
Training 1/1 epoch (loss 1.8754): 46%|βββββ | 435/938 [03:04<03:18, 2.53it/s]
Training 1/1 epoch (loss 1.8754): 46%|βββββ | 436/938 [03:04<03:30, 2.38it/s]
Training 1/1 epoch (loss 1.8506): 46%|βββββ | 436/938 [03:05<03:30, 2.38it/s]
Training 1/1 epoch (loss 1.8506): 47%|βββββ | 437/938 [03:05<03:28, 2.41it/s]
Training 1/1 epoch (loss 1.7153): 47%|βββββ | 437/938 [03:05<03:28, 2.41it/s]
Training 1/1 epoch (loss 1.7153): 47%|βββββ | 438/938 [03:05<03:19, 2.50it/s]
Training 1/1 epoch (loss 1.8601): 47%|βββββ | 438/938 [03:06<03:19, 2.50it/s]
Training 1/1 epoch (loss 1.8601): 47%|βββββ | 439/938 [03:06<03:21, 2.48it/s]
Training 1/1 epoch (loss 1.8121): 47%|βββββ | 439/938 [03:06<03:21, 2.48it/s]
Training 1/1 epoch (loss 1.8121): 47%|βββββ | 440/938 [03:06<03:21, 2.48it/s]
Training 1/1 epoch (loss 1.8192): 47%|βββββ | 440/938 [03:06<03:21, 2.48it/s]
Training 1/1 epoch (loss 1.8192): 47%|βββββ | 441/938 [03:06<03:15, 2.54it/s]
Training 1/1 epoch (loss 1.7236): 47%|βββββ | 441/938 [03:07<03:15, 2.54it/s]
Training 1/1 epoch (loss 1.7236): 47%|βββββ | 442/938 [03:07<03:12, 2.58it/s]
Training 1/1 epoch (loss 1.8125): 47%|βββββ | 442/938 [03:07<03:12, 2.58it/s]
Training 1/1 epoch (loss 1.8125): 47%|βββββ | 443/938 [03:07<03:09, 2.62it/s]
Training 1/1 epoch (loss 1.9016): 47%|βββββ | 443/938 [03:07<03:09, 2.62it/s]
Training 1/1 epoch (loss 1.9016): 47%|βββββ | 444/938 [03:07<03:00, 2.74it/s]
Training 1/1 epoch (loss 1.5927): 47%|βββββ | 444/938 [03:08<03:00, 2.74it/s]
Training 1/1 epoch (loss 1.5927): 47%|βββββ | 445/938 [03:08<03:30, 2.34it/s]
Training 1/1 epoch (loss 1.7754): 47%|βββββ | 445/938 [03:08<03:30, 2.34it/s]
Training 1/1 epoch (loss 1.7754): 48%|βββββ | 446/938 [03:08<03:19, 2.47it/s]
Training 1/1 epoch (loss 1.8354): 48%|βββββ | 446/938 [03:09<03:19, 2.47it/s]
Training 1/1 epoch (loss 1.8354): 48%|βββββ | 447/938 [03:09<03:07, 2.62it/s]
Training 1/1 epoch (loss 1.7829): 48%|βββββ | 447/938 [03:09<03:07, 2.62it/s]
Training 1/1 epoch (loss 1.7829): 48%|βββββ | 448/938 [03:09<03:09, 2.58it/s]
Training 1/1 epoch (loss 1.9821): 48%|βββββ | 448/938 [03:09<03:09, 2.58it/s]
Training 1/1 epoch (loss 1.9821): 48%|βββββ | 449/938 [03:09<03:04, 2.66it/s]
Training 1/1 epoch (loss 1.6366): 48%|βββββ | 449/938 [03:10<03:04, 2.66it/s]
Training 1/1 epoch (loss 1.6366): 48%|βββββ | 450/938 [03:10<03:12, 2.53it/s]
Training 1/1 epoch (loss 1.7153): 48%|βββββ | 450/938 [03:10<03:12, 2.53it/s]
Training 1/1 epoch (loss 1.7153): 48%|βββββ | 451/938 [03:10<03:15, 2.49it/s]
Training 1/1 epoch (loss 1.7992): 48%|βββββ | 451/938 [03:11<03:15, 2.49it/s]
Training 1/1 epoch (loss 1.7992): 48%|βββββ | 452/938 [03:11<03:04, 2.64it/s]
Training 1/1 epoch (loss 1.8171): 48%|βββββ | 452/938 [03:11<03:04, 2.64it/s]
Training 1/1 epoch (loss 1.8171): 48%|βββββ | 453/938 [03:11<03:09, 2.56it/s]
Training 1/1 epoch (loss 1.7980): 48%|βββββ | 453/938 [03:11<03:09, 2.56it/s]
Training 1/1 epoch (loss 1.7980): 48%|βββββ | 454/938 [03:11<03:09, 2.55it/s]
Training 1/1 epoch (loss 1.8311): 48%|βββββ | 454/938 [03:12<03:09, 2.55it/s]
Training 1/1 epoch (loss 1.8311): 49%|βββββ | 455/938 [03:12<03:02, 2.65it/s]
Training 1/1 epoch (loss 1.8029): 49%|βββββ | 455/938 [03:12<03:02, 2.65it/s]
Training 1/1 epoch (loss 1.8029): 49%|βββββ | 456/938 [03:12<03:33, 2.26it/s]
Training 1/1 epoch (loss 1.9395): 49%|βββββ | 456/938 [03:13<03:33, 2.26it/s]
Training 1/1 epoch (loss 1.9395): 49%|βββββ | 457/938 [03:13<03:42, 2.16it/s]
Training 1/1 epoch (loss 1.9020): 49%|βββββ | 457/938 [03:13<03:42, 2.16it/s]
Training 1/1 epoch (loss 1.9020): 49%|βββββ | 458/938 [03:13<03:31, 2.27it/s]
Training 1/1 epoch (loss 1.9285): 49%|βββββ | 458/938 [03:14<03:31, 2.27it/s]
Training 1/1 epoch (loss 1.9285): 49%|βββββ | 459/938 [03:14<03:26, 2.32it/s]
Training 1/1 epoch (loss 1.8498): 49%|βββββ | 459/938 [03:14<03:26, 2.32it/s]
Training 1/1 epoch (loss 1.8498): 49%|βββββ | 460/938 [03:14<03:29, 2.28it/s]
Training 1/1 epoch (loss 1.7821): 49%|βββββ | 460/938 [03:15<03:29, 2.28it/s]
Training 1/1 epoch (loss 1.7821): 49%|βββββ | 461/938 [03:15<03:36, 2.20it/s]
Training 1/1 epoch (loss 1.7817): 49%|βββββ | 461/938 [03:15<03:36, 2.20it/s]
Training 1/1 epoch (loss 1.7817): 49%|βββββ | 462/938 [03:15<03:31, 2.25it/s]
Training 1/1 epoch (loss 1.7059): 49%|βββββ | 462/938 [03:15<03:31, 2.25it/s]
Training 1/1 epoch (loss 1.7059): 49%|βββββ | 463/938 [03:15<03:23, 2.33it/s]
Training 1/1 epoch (loss 1.8906): 49%|βββββ | 463/938 [03:16<03:23, 2.33it/s]
Training 1/1 epoch (loss 1.8906): 49%|βββββ | 464/938 [03:16<03:20, 2.36it/s]
Training 1/1 epoch (loss 1.7927): 49%|βββββ | 464/938 [03:16<03:20, 2.36it/s]
Training 1/1 epoch (loss 1.7927): 50%|βββββ | 465/938 [03:16<03:22, 2.34it/s]
Training 1/1 epoch (loss 1.9009): 50%|βββββ | 465/938 [03:17<03:22, 2.34it/s]
Training 1/1 epoch (loss 1.9009): 50%|βββββ | 466/938 [03:17<03:18, 2.38it/s]
Training 1/1 epoch (loss 1.9224): 50%|βββββ | 466/938 [03:17<03:18, 2.38it/s]
Training 1/1 epoch (loss 1.9224): 50%|βββββ | 467/938 [03:17<03:10, 2.48it/s]
Training 1/1 epoch (loss 1.9557): 50%|βββββ | 467/938 [03:17<03:10, 2.48it/s]
Training 1/1 epoch (loss 1.9557): 50%|βββββ | 468/938 [03:17<03:12, 2.44it/s]
Training 1/1 epoch (loss 1.9025): 50%|βββββ | 468/938 [03:18<03:12, 2.44it/s]
Training 1/1 epoch (loss 1.9025): 50%|βββββ | 469/938 [03:18<03:14, 2.41it/s]
Training 1/1 epoch (loss 1.6247): 50%|βββββ | 469/938 [03:18<03:14, 2.41it/s]
Training 1/1 epoch (loss 1.6247): 50%|βββββ | 470/938 [03:18<03:25, 2.27it/s]
Training 1/1 epoch (loss 1.8405): 50%|βββββ | 470/938 [03:19<03:25, 2.27it/s]
Training 1/1 epoch (loss 1.8405): 50%|βββββ | 471/938 [03:19<03:15, 2.39it/s]
Training 1/1 epoch (loss 1.8595): 50%|βββββ | 471/938 [03:19<03:15, 2.39it/s]
Training 1/1 epoch (loss 1.8595): 50%|βββββ | 472/938 [03:19<03:06, 2.50it/s]
Training 1/1 epoch (loss 1.7193): 50%|βββββ | 472/938 [03:20<03:06, 2.50it/s]
Training 1/1 epoch (loss 1.7193): 50%|βββββ | 473/938 [03:20<03:08, 2.46it/s]
Training 1/1 epoch (loss 1.8568): 50%|βββββ | 473/938 [03:20<03:08, 2.46it/s]
Training 1/1 epoch (loss 1.8568): 51%|βββββ | 474/938 [03:20<03:06, 2.49it/s]
Training 1/1 epoch (loss 1.7531): 51%|βββββ | 474/938 [03:20<03:06, 2.49it/s]
Training 1/1 epoch (loss 1.7531): 51%|βββββ | 475/938 [03:20<03:13, 2.40it/s]
Training 1/1 epoch (loss 1.7691): 51%|βββββ | 475/938 [03:21<03:13, 2.40it/s]
Training 1/1 epoch (loss 1.7691): 51%|βββββ | 476/938 [03:21<03:07, 2.47it/s]
Training 1/1 epoch (loss 1.7789): 51%|βββββ | 476/938 [03:21<03:07, 2.47it/s]
Training 1/1 epoch (loss 1.7789): 51%|βββββ | 477/938 [03:21<03:00, 2.55it/s]
Training 1/1 epoch (loss 1.7330): 51%|βββββ | 477/938 [03:21<03:00, 2.55it/s]
Training 1/1 epoch (loss 1.7330): 51%|βββββ | 478/938 [03:21<02:56, 2.61it/s]
Training 1/1 epoch (loss 1.8653): 51%|βββββ | 478/938 [03:22<02:56, 2.61it/s]
Training 1/1 epoch (loss 1.8653): 51%|βββββ | 479/938 [03:22<02:59, 2.56it/s]
Training 1/1 epoch (loss 1.7865): 51%|βββββ | 479/938 [03:22<02:59, 2.56it/s]
Training 1/1 epoch (loss 1.7865): 51%|βββββ | 480/938 [03:22<03:01, 2.52it/s]
Training 1/1 epoch (loss 1.8590): 51%|βββββ | 480/938 [03:23<03:01, 2.52it/s]
Training 1/1 epoch (loss 1.8590): 51%|ββββββ | 481/938 [03:23<03:02, 2.51it/s]
Training 1/1 epoch (loss 1.8127): 51%|ββββββ | 481/938 [03:23<03:02, 2.51it/s]
Training 1/1 epoch (loss 1.8127): 51%|ββββββ | 482/938 [03:23<03:01, 2.51it/s]
Training 1/1 epoch (loss 1.7818): 51%|ββββββ | 482/938 [03:23<03:01, 2.51it/s]
Training 1/1 epoch (loss 1.7818): 51%|ββββββ | 483/938 [03:23<02:52, 2.64it/s]
Training 1/1 epoch (loss 2.0040): 51%|ββββββ | 483/938 [03:24<02:52, 2.64it/s]
Training 1/1 epoch (loss 2.0040): 52%|ββββββ | 484/938 [03:24<03:07, 2.43it/s]
Training 1/1 epoch (loss 1.9618): 52%|ββββββ | 484/938 [03:25<03:07, 2.43it/s]
Training 1/1 epoch (loss 1.9618): 52%|ββββββ | 485/938 [03:25<03:39, 2.07it/s]
Training 1/1 epoch (loss 1.8149): 52%|ββββββ | 485/938 [03:25<03:39, 2.07it/s]
Training 1/1 epoch (loss 1.8149): 52%|ββββββ | 486/938 [03:25<03:22, 2.23it/s]
Training 1/1 epoch (loss 1.7989): 52%|ββββββ | 486/938 [03:25<03:22, 2.23it/s]
Training 1/1 epoch (loss 1.7989): 52%|ββββββ | 487/938 [03:25<03:08, 2.39it/s]
Training 1/1 epoch (loss 1.7737): 52%|ββββββ | 487/938 [03:26<03:08, 2.39it/s]
Training 1/1 epoch (loss 1.7737): 52%|ββββββ | 488/938 [03:26<03:04, 2.44it/s]
Training 1/1 epoch (loss 1.8220): 52%|ββββββ | 488/938 [03:26<03:04, 2.44it/s]
Training 1/1 epoch (loss 1.8220): 52%|ββββββ | 489/938 [03:26<03:08, 2.39it/s]
Training 1/1 epoch (loss 1.8980): 52%|ββββββ | 489/938 [03:27<03:08, 2.39it/s]
Training 1/1 epoch (loss 1.8980): 52%|ββββββ | 490/938 [03:27<03:14, 2.31it/s]
Training 1/1 epoch (loss 1.9144): 52%|ββββββ | 490/938 [03:27<03:14, 2.31it/s]
Training 1/1 epoch (loss 1.9144): 52%|ββββββ | 491/938 [03:27<03:15, 2.29it/s]
Training 1/1 epoch (loss 1.9669): 52%|ββββββ | 491/938 [03:27<03:15, 2.29it/s]
Training 1/1 epoch (loss 1.9669): 52%|ββββββ | 492/938 [03:27<03:02, 2.45it/s]
Training 1/1 epoch (loss 1.8028): 52%|ββββββ | 492/938 [03:28<03:02, 2.45it/s]
Training 1/1 epoch (loss 1.8028): 53%|ββββββ | 493/938 [03:28<02:55, 2.54it/s]
Training 1/1 epoch (loss 1.8117): 53%|ββββββ | 493/938 [03:28<02:55, 2.54it/s]
Training 1/1 epoch (loss 1.8117): 53%|ββββββ | 494/938 [03:28<02:57, 2.50it/s]
Training 1/1 epoch (loss 1.8527): 53%|ββββββ | 494/938 [03:29<02:57, 2.50it/s]
Training 1/1 epoch (loss 1.8527): 53%|ββββββ | 495/938 [03:29<03:00, 2.46it/s]
Training 1/1 epoch (loss 1.6713): 53%|ββββββ | 495/938 [03:29<03:00, 2.46it/s]
Training 1/1 epoch (loss 1.6713): 53%|ββββββ | 496/938 [03:29<02:58, 2.48it/s]
Training 1/1 epoch (loss 1.8962): 53%|ββββββ | 496/938 [03:29<02:58, 2.48it/s]
Training 1/1 epoch (loss 1.8962): 53%|ββββββ | 497/938 [03:29<03:05, 2.38it/s]
Training 1/1 epoch (loss 1.7594): 53%|ββββββ | 497/938 [03:30<03:05, 2.38it/s]
Training 1/1 epoch (loss 1.7594): 53%|ββββββ | 498/938 [03:30<02:53, 2.53it/s]
Training 1/1 epoch (loss 1.7219): 53%|ββββββ | 498/938 [03:30<02:53, 2.53it/s]
Training 1/1 epoch (loss 1.7219): 53%|ββββββ | 499/938 [03:30<02:52, 2.55it/s]
Training 1/1 epoch (loss 1.7358): 53%|ββββββ | 499/938 [03:31<02:52, 2.55it/s]
Training 1/1 epoch (loss 1.7358): 53%|ββββββ | 500/938 [03:31<02:47, 2.61it/s]
Training 1/1 epoch (loss 1.8776): 53%|ββββββ | 500/938 [03:31<02:47, 2.61it/s]
Training 1/1 epoch (loss 1.8776): 53%|ββββββ | 501/938 [03:31<02:45, 2.65it/s]
Training 1/1 epoch (loss 1.9080): 53%|ββββββ | 501/938 [03:31<02:45, 2.65it/s]
Training 1/1 epoch (loss 1.9080): 54%|ββββββ | 502/938 [03:31<02:41, 2.71it/s]
Training 1/1 epoch (loss 1.8147): 54%|ββββββ | 502/938 [03:32<02:41, 2.71it/s]
Training 1/1 epoch (loss 1.8147): 54%|ββββββ | 503/938 [03:32<02:39, 2.73it/s]
Training 1/1 epoch (loss 1.7070): 54%|ββββββ | 503/938 [03:32<02:39, 2.73it/s]
Training 1/1 epoch (loss 1.7070): 54%|ββββββ | 504/938 [03:32<02:40, 2.71it/s]
Training 1/1 epoch (loss 1.8174): 54%|ββββββ | 504/938 [03:32<02:40, 2.71it/s]
Training 1/1 epoch (loss 1.8174): 54%|ββββββ | 505/938 [03:32<02:48, 2.57it/s]
Training 1/1 epoch (loss 1.7841): 54%|ββββββ | 505/938 [03:33<02:48, 2.57it/s]
Training 1/1 epoch (loss 1.7841): 54%|ββββββ | 506/938 [03:33<02:47, 2.58it/s]
Training 1/1 epoch (loss 1.8641): 54%|ββββββ | 506/938 [03:33<02:47, 2.58it/s]
Training 1/1 epoch (loss 1.8641): 54%|ββββββ | 507/938 [03:33<02:46, 2.58it/s]
Training 1/1 epoch (loss 1.8152): 54%|ββββββ | 507/938 [03:34<02:46, 2.58it/s]
Training 1/1 epoch (loss 1.8152): 54%|ββββββ | 508/938 [03:34<02:49, 2.53it/s]
Training 1/1 epoch (loss 1.8179): 54%|ββββββ | 508/938 [03:34<02:49, 2.53it/s]
Training 1/1 epoch (loss 1.8179): 54%|ββββββ | 509/938 [03:34<02:45, 2.59it/s]
Training 1/1 epoch (loss 1.8715): 54%|ββββββ | 509/938 [03:34<02:45, 2.59it/s]
Training 1/1 epoch (loss 1.8715): 54%|ββββββ | 510/938 [03:34<02:55, 2.44it/s]
Training 1/1 epoch (loss 2.0180): 54%|ββββββ | 510/938 [03:35<02:55, 2.44it/s]
Training 1/1 epoch (loss 2.0180): 54%|ββββββ | 511/938 [03:35<03:31, 2.02it/s]
Training 1/1 epoch (loss 1.7367): 54%|ββββββ | 511/938 [03:35<03:31, 2.02it/s]
Training 1/1 epoch (loss 1.7367): 55%|ββββββ | 512/938 [03:35<03:15, 2.18it/s]
Training 1/1 epoch (loss 1.8381): 55%|ββββββ | 512/938 [03:36<03:15, 2.18it/s]
Training 1/1 epoch (loss 1.8381): 55%|ββββββ | 513/938 [03:36<03:33, 1.99it/s]
Training 1/1 epoch (loss 1.8567): 55%|ββββββ | 513/938 [03:36<03:33, 1.99it/s]
Training 1/1 epoch (loss 1.8567): 55%|ββββββ | 514/938 [03:36<03:19, 2.12it/s]
Training 1/1 epoch (loss 1.7653): 55%|ββββββ | 514/938 [03:37<03:19, 2.12it/s]
Training 1/1 epoch (loss 1.7653): 55%|ββββββ | 515/938 [03:37<03:11, 2.20it/s]
Training 1/1 epoch (loss 1.8342): 55%|ββββββ | 515/938 [03:37<03:11, 2.20it/s]
Training 1/1 epoch (loss 1.8342): 55%|ββββββ | 516/938 [03:37<03:02, 2.31it/s]
Training 1/1 epoch (loss 1.8910): 55%|ββββββ | 516/938 [03:38<03:02, 2.31it/s]
Training 1/1 epoch (loss 1.8910): 55%|ββββββ | 517/938 [03:38<02:52, 2.44it/s]
Training 1/1 epoch (loss 1.7969): 55%|ββββββ | 517/938 [03:38<02:52, 2.44it/s]
Training 1/1 epoch (loss 1.7969): 55%|ββββββ | 518/938 [03:38<02:47, 2.51it/s]
Training 1/1 epoch (loss 1.8622): 55%|ββββββ | 518/938 [03:38<02:47, 2.51it/s]
Training 1/1 epoch (loss 1.8622): 55%|ββββββ | 519/938 [03:38<02:54, 2.41it/s]
Training 1/1 epoch (loss 1.9056): 55%|ββββββ | 519/938 [03:39<02:54, 2.41it/s]
Training 1/1 epoch (loss 1.9056): 55%|ββββββ | 520/938 [03:39<03:11, 2.18it/s]
Training 1/1 epoch (loss 1.7265): 55%|ββββββ | 520/938 [03:39<03:11, 2.18it/s]
Training 1/1 epoch (loss 1.7265): 56%|ββββββ | 521/938 [03:39<02:57, 2.35it/s]
Training 1/1 epoch (loss 1.7753): 56%|ββββββ | 521/938 [03:40<02:57, 2.35it/s]
Training 1/1 epoch (loss 1.7753): 56%|ββββββ | 522/938 [03:40<02:52, 2.41it/s]
Training 1/1 epoch (loss 1.7832): 56%|ββββββ | 522/938 [03:40<02:52, 2.41it/s]
Training 1/1 epoch (loss 1.7832): 56%|ββββββ | 523/938 [03:40<02:51, 2.42it/s]
Training 1/1 epoch (loss 1.7025): 56%|ββββββ | 523/938 [03:41<02:51, 2.42it/s]
Training 1/1 epoch (loss 1.7025): 56%|ββββββ | 524/938 [03:41<02:47, 2.48it/s]
Training 1/1 epoch (loss 1.7619): 56%|ββββββ | 524/938 [03:41<02:47, 2.48it/s]
Training 1/1 epoch (loss 1.7619): 56%|ββββββ | 525/938 [03:41<02:45, 2.49it/s]
Training 1/1 epoch (loss 1.8702): 56%|ββββββ | 525/938 [03:41<02:45, 2.49it/s]
Training 1/1 epoch (loss 1.8702): 56%|ββββββ | 526/938 [03:41<02:37, 2.62it/s]
Training 1/1 epoch (loss 1.7951): 56%|ββββββ | 526/938 [03:42<02:37, 2.62it/s]
Training 1/1 epoch (loss 1.7951): 56%|ββββββ | 527/938 [03:42<02:35, 2.64it/s]
Training 1/1 epoch (loss 1.8519): 56%|ββββββ | 527/938 [03:42<02:35, 2.64it/s]
Training 1/1 epoch (loss 1.8519): 56%|ββββββ | 528/938 [03:42<02:35, 2.64it/s]
Training 1/1 epoch (loss 1.7298): 56%|ββββββ | 528/938 [03:43<02:35, 2.64it/s]
Training 1/1 epoch (loss 1.7298): 56%|ββββββ | 529/938 [03:43<02:44, 2.48it/s]
Training 1/1 epoch (loss 1.7603): 56%|ββββββ | 529/938 [03:43<02:44, 2.48it/s]
Training 1/1 epoch (loss 1.7603): 57%|ββββββ | 530/938 [03:43<02:46, 2.46it/s]
Training 1/1 epoch (loss 1.7504): 57%|ββββββ | 530/938 [03:43<02:46, 2.46it/s]
Training 1/1 epoch (loss 1.7504): 57%|ββββββ | 531/938 [03:43<02:42, 2.51it/s]
Training 1/1 epoch (loss 1.9131): 57%|ββββββ | 531/938 [03:44<02:42, 2.51it/s]
Training 1/1 epoch (loss 1.9131): 57%|ββββββ | 532/938 [03:44<02:38, 2.55it/s]
Training 1/1 epoch (loss 1.8213): 57%|ββββββ | 532/938 [03:44<02:38, 2.55it/s]
Training 1/1 epoch (loss 1.8213): 57%|ββββββ | 533/938 [03:44<02:39, 2.55it/s]
Training 1/1 epoch (loss 1.7695): 57%|ββββββ | 533/938 [03:44<02:39, 2.55it/s]
Training 1/1 epoch (loss 1.7695): 57%|ββββββ | 534/938 [03:44<02:33, 2.63it/s]
Training 1/1 epoch (loss 1.7609): 57%|ββββββ | 534/938 [03:45<02:33, 2.63it/s]
Training 1/1 epoch (loss 1.7609): 57%|ββββββ | 535/938 [03:45<02:51, 2.34it/s]
Training 1/1 epoch (loss 1.6901): 57%|ββββββ | 535/938 [03:45<02:51, 2.34it/s]
Training 1/1 epoch (loss 1.6901): 57%|ββββββ | 536/938 [03:45<02:42, 2.47it/s]
Training 1/1 epoch (loss 1.8637): 57%|ββββββ | 536/938 [03:46<02:42, 2.47it/s]
Training 1/1 epoch (loss 1.8637): 57%|ββββββ | 537/938 [03:46<02:38, 2.54it/s]
Training 1/1 epoch (loss 1.7151): 57%|ββββββ | 537/938 [03:46<02:38, 2.54it/s]
Training 1/1 epoch (loss 1.7151): 57%|ββββββ | 538/938 [03:46<02:36, 2.55it/s]
Training 1/1 epoch (loss 1.8987): 57%|ββββββ | 538/938 [03:47<02:36, 2.55it/s]
Training 1/1 epoch (loss 1.8987): 57%|ββββββ | 539/938 [03:47<02:41, 2.47it/s]
Training 1/1 epoch (loss 1.7858): 57%|ββββββ | 539/938 [03:47<02:41, 2.47it/s]
Training 1/1 epoch (loss 1.7858): 58%|ββββββ | 540/938 [03:47<02:35, 2.56it/s]
Training 1/1 epoch (loss 1.6614): 58%|ββββββ | 540/938 [03:47<02:35, 2.56it/s]
Training 1/1 epoch (loss 1.6614): 58%|ββββββ | 541/938 [03:47<02:31, 2.61it/s]
Training 1/1 epoch (loss 1.8097): 58%|ββββββ | 541/938 [03:48<02:31, 2.61it/s]
Training 1/1 epoch (loss 1.8097): 58%|ββββββ | 542/938 [03:48<02:31, 2.62it/s]
Training 1/1 epoch (loss 1.8621): 58%|ββββββ | 542/938 [03:48<02:31, 2.62it/s]
Training 1/1 epoch (loss 1.8621): 58%|ββββββ | 543/938 [03:48<02:34, 2.56it/s]
Training 1/1 epoch (loss 1.7658): 58%|ββββββ | 543/938 [03:48<02:34, 2.56it/s]
Training 1/1 epoch (loss 1.7658): 58%|ββββββ | 544/938 [03:48<02:39, 2.47it/s]
Training 1/1 epoch (loss 1.7019): 58%|ββββββ | 544/938 [03:49<02:39, 2.47it/s]
Training 1/1 epoch (loss 1.7019): 58%|ββββββ | 545/938 [03:49<02:50, 2.30it/s]
Training 1/1 epoch (loss 1.6684): 58%|ββββββ | 545/938 [03:49<02:50, 2.30it/s]
Training 1/1 epoch (loss 1.6684): 58%|ββββββ | 546/938 [03:49<02:51, 2.29it/s]
Training 1/1 epoch (loss 1.7756): 58%|ββββββ | 546/938 [03:50<02:51, 2.29it/s]
Training 1/1 epoch (loss 1.7756): 58%|ββββββ | 547/938 [03:50<02:43, 2.40it/s]
Training 1/1 epoch (loss 1.8715): 58%|ββββββ | 547/938 [03:50<02:43, 2.40it/s]
Training 1/1 epoch (loss 1.8715): 58%|ββββββ | 548/938 [03:50<02:37, 2.48it/s]
Training 1/1 epoch (loss 1.7087): 58%|ββββββ | 548/938 [03:51<02:37, 2.48it/s]
Training 1/1 epoch (loss 1.7087): 59%|ββββββ | 549/938 [03:51<02:39, 2.44it/s]
Training 1/1 epoch (loss 1.8770): 59%|ββββββ | 549/938 [03:51<02:39, 2.44it/s]
Training 1/1 epoch (loss 1.8770): 59%|ββββββ | 550/938 [03:51<02:34, 2.51it/s]
Training 1/1 epoch (loss 1.8351): 59%|ββββββ | 550/938 [03:51<02:34, 2.51it/s]
Training 1/1 epoch (loss 1.8351): 59%|ββββββ | 551/938 [03:51<02:31, 2.55it/s]
Training 1/1 epoch (loss 1.7663): 59%|ββββββ | 551/938 [03:52<02:31, 2.55it/s]
Training 1/1 epoch (loss 1.7663): 59%|ββββββ | 552/938 [03:52<02:30, 2.57it/s]
Training 1/1 epoch (loss 1.8727): 59%|ββββββ | 552/938 [03:52<02:30, 2.57it/s]
Training 1/1 epoch (loss 1.8727): 59%|ββββββ | 553/938 [03:52<02:27, 2.60it/s]
Training 1/1 epoch (loss 1.8385): 59%|ββββββ | 553/938 [03:52<02:27, 2.60it/s]
Training 1/1 epoch (loss 1.8385): 59%|ββββββ | 554/938 [03:52<02:27, 2.60it/s]
Training 1/1 epoch (loss 1.6988): 59%|ββββββ | 554/938 [03:53<02:27, 2.60it/s]
Training 1/1 epoch (loss 1.6988): 59%|ββββββ | 555/938 [03:53<02:33, 2.50it/s]
Training 1/1 epoch (loss 1.8372): 59%|ββββββ | 555/938 [03:53<02:33, 2.50it/s]
Training 1/1 epoch (loss 1.8372): 59%|ββββββ | 556/938 [03:53<02:28, 2.58it/s]
Training 1/1 epoch (loss 1.8541): 59%|ββββββ | 556/938 [03:54<02:28, 2.58it/s]
Training 1/1 epoch (loss 1.8541): 59%|ββββββ | 557/938 [03:54<02:25, 2.61it/s]
Training 1/1 epoch (loss 1.8108): 59%|ββββββ | 557/938 [03:54<02:25, 2.61it/s]
Training 1/1 epoch (loss 1.8108): 59%|ββββββ | 558/938 [03:54<02:33, 2.47it/s]
Training 1/1 epoch (loss 1.8046): 59%|ββββββ | 558/938 [03:55<02:33, 2.47it/s]
Training 1/1 epoch (loss 1.8046): 60%|ββββββ | 559/938 [03:55<02:42, 2.33it/s]
Training 1/1 epoch (loss 1.7254): 60%|ββββββ | 559/938 [03:55<02:42, 2.33it/s]
Training 1/1 epoch (loss 1.7254): 60%|ββββββ | 560/938 [03:55<02:51, 2.20it/s]
Training 1/1 epoch (loss 1.8617): 60%|ββββββ | 560/938 [03:55<02:51, 2.20it/s]
Training 1/1 epoch (loss 1.8617): 60%|ββββββ | 561/938 [03:55<02:40, 2.35it/s]
Training 1/1 epoch (loss 1.9248): 60%|ββββββ | 561/938 [03:56<02:40, 2.35it/s]
Training 1/1 epoch (loss 1.9248): 60%|ββββββ | 562/938 [03:56<02:35, 2.42it/s]
Training 1/1 epoch (loss 1.7124): 60%|ββββββ | 562/938 [03:56<02:35, 2.42it/s]
Training 1/1 epoch (loss 1.7124): 60%|ββββββ | 563/938 [03:56<02:34, 2.42it/s]
Training 1/1 epoch (loss 1.8942): 60%|ββββββ | 563/938 [03:57<02:34, 2.42it/s]
Training 1/1 epoch (loss 1.8942): 60%|ββββββ | 564/938 [03:57<02:32, 2.45it/s]
Training 1/1 epoch (loss 1.8360): 60%|ββββββ | 564/938 [03:57<02:32, 2.45it/s]
Training 1/1 epoch (loss 1.8360): 60%|ββββββ | 565/938 [03:57<02:37, 2.37it/s]
Training 1/1 epoch (loss 1.7816): 60%|ββββββ | 565/938 [03:57<02:37, 2.37it/s]
Training 1/1 epoch (loss 1.7816): 60%|ββββββ | 566/938 [03:57<02:27, 2.52it/s]
Training 1/1 epoch (loss 1.9018): 60%|ββββββ | 566/938 [03:58<02:27, 2.52it/s]
Training 1/1 epoch (loss 1.9018): 60%|ββββββ | 567/938 [03:58<02:28, 2.50it/s]
Training 1/1 epoch (loss 1.8707): 60%|ββββββ | 567/938 [03:58<02:28, 2.50it/s]
Training 1/1 epoch (loss 1.8707): 61%|ββββββ | 568/938 [03:58<02:32, 2.43it/s]
Training 1/1 epoch (loss 1.9798): 61%|ββββββ | 568/938 [03:59<02:32, 2.43it/s]
Training 1/1 epoch (loss 1.9798): 61%|ββββββ | 569/938 [03:59<02:23, 2.57it/s]
Training 1/1 epoch (loss 1.7833): 61%|ββββββ | 569/938 [03:59<02:23, 2.57it/s]
Training 1/1 epoch (loss 1.7833): 61%|ββββββ | 570/938 [03:59<02:31, 2.43it/s]
Training 1/1 epoch (loss 1.8718): 61%|ββββββ | 570/938 [04:00<02:31, 2.43it/s]
Training 1/1 epoch (loss 1.8718): 61%|ββββββ | 571/938 [04:00<02:58, 2.06it/s]
Training 1/1 epoch (loss 1.8195): 61%|ββββββ | 571/938 [04:00<02:58, 2.06it/s]
Training 1/1 epoch (loss 1.8195): 61%|ββββββ | 572/938 [04:00<02:46, 2.19it/s]
Training 1/1 epoch (loss 1.8403): 61%|ββββββ | 572/938 [04:01<02:46, 2.19it/s]
Training 1/1 epoch (loss 1.8403): 61%|ββββββ | 573/938 [04:01<02:38, 2.30it/s]
Training 1/1 epoch (loss 1.8128): 61%|ββββββ | 573/938 [04:01<02:38, 2.30it/s]
Training 1/1 epoch (loss 1.8128): 61%|ββββββ | 574/938 [04:01<02:33, 2.38it/s]
Training 1/1 epoch (loss 1.7883): 61%|ββββββ | 574/938 [04:01<02:33, 2.38it/s]
Training 1/1 epoch (loss 1.7883): 61%|βββββββ | 575/938 [04:01<02:28, 2.45it/s]
Training 1/1 epoch (loss 1.7747): 61%|βββββββ | 575/938 [04:02<02:28, 2.45it/s]
Training 1/1 epoch (loss 1.7747): 61%|βββββββ | 576/938 [04:02<02:22, 2.53it/s]
Training 1/1 epoch (loss 1.7917): 61%|βββββββ | 576/938 [04:02<02:22, 2.53it/s]
Training 1/1 epoch (loss 1.7917): 62%|βββββββ | 577/938 [04:02<02:21, 2.55it/s]
Training 1/1 epoch (loss 1.9325): 62%|βββββββ | 577/938 [04:02<02:21, 2.55it/s]
Training 1/1 epoch (loss 1.9325): 62%|βββββββ | 578/938 [04:02<02:16, 2.64it/s]
Training 1/1 epoch (loss 1.8498): 62%|βββββββ | 578/938 [04:03<02:16, 2.64it/s]
Training 1/1 epoch (loss 1.8498): 62%|βββββββ | 579/938 [04:03<02:12, 2.70it/s]
Training 1/1 epoch (loss 1.8744): 62%|βββββββ | 579/938 [04:03<02:12, 2.70it/s]
Training 1/1 epoch (loss 1.8744): 62%|βββββββ | 580/938 [04:03<02:14, 2.66it/s]
Training 1/1 epoch (loss 1.8184): 62%|βββββββ | 580/938 [04:04<02:14, 2.66it/s]
Training 1/1 epoch (loss 1.8184): 62%|βββββββ | 581/938 [04:04<02:16, 2.61it/s]
Training 1/1 epoch (loss 1.7420): 62%|βββββββ | 581/938 [04:04<02:16, 2.61it/s]
Training 1/1 epoch (loss 1.7420): 62%|βββββββ | 582/938 [04:04<02:13, 2.66it/s]
Training 1/1 epoch (loss 1.8332): 62%|βββββββ | 582/938 [04:04<02:13, 2.66it/s]
Training 1/1 epoch (loss 1.8332): 62%|βββββββ | 583/938 [04:04<02:11, 2.69it/s]
Training 1/1 epoch (loss 1.7103): 62%|βββββββ | 583/938 [04:05<02:11, 2.69it/s]
Training 1/1 epoch (loss 1.7103): 62%|βββββββ | 584/938 [04:05<02:28, 2.39it/s]
Training 1/1 epoch (loss 1.6962): 62%|βββββββ | 584/938 [04:05<02:28, 2.39it/s]
Training 1/1 epoch (loss 1.6962): 62%|βββββββ | 585/938 [04:05<02:28, 2.38it/s]
Training 1/1 epoch (loss 1.8375): 62%|βββββββ | 585/938 [04:06<02:28, 2.38it/s]
Training 1/1 epoch (loss 1.8375): 62%|βββββββ | 586/938 [04:06<02:25, 2.42it/s]
Training 1/1 epoch (loss 1.8611): 62%|βββββββ | 586/938 [04:06<02:25, 2.42it/s]
Training 1/1 epoch (loss 1.8611): 63%|βββββββ | 587/938 [04:06<02:18, 2.54it/s]
Training 1/1 epoch (loss 1.8378): 63%|βββββββ | 587/938 [04:06<02:18, 2.54it/s]
Training 1/1 epoch (loss 1.8378): 63%|βββββββ | 588/938 [04:06<02:19, 2.51it/s]
Training 1/1 epoch (loss 1.8787): 63%|βββββββ | 588/938 [04:07<02:19, 2.51it/s]
Training 1/1 epoch (loss 1.8787): 63%|βββββββ | 589/938 [04:07<02:19, 2.50it/s]
Training 1/1 epoch (loss 1.9118): 63%|βββββββ | 589/938 [04:07<02:19, 2.50it/s]
Training 1/1 epoch (loss 1.9118): 63%|βββββββ | 590/938 [04:07<02:17, 2.53it/s]
Training 1/1 epoch (loss 1.7790): 63%|βββββββ | 590/938 [04:07<02:17, 2.53it/s]
Training 1/1 epoch (loss 1.7790): 63%|βββββββ | 591/938 [04:07<02:14, 2.57it/s]
Training 1/1 epoch (loss 1.7238): 63%|βββββββ | 591/938 [04:08<02:14, 2.57it/s]
Training 1/1 epoch (loss 1.7238): 63%|βββββββ | 592/938 [04:08<02:14, 2.57it/s]
Training 1/1 epoch (loss 1.8666): 63%|βββββββ | 592/938 [04:08<02:14, 2.57it/s]
Training 1/1 epoch (loss 1.8666): 63%|βββββββ | 593/938 [04:08<02:10, 2.64it/s]
Training 1/1 epoch (loss 1.8240): 63%|βββββββ | 593/938 [04:09<02:10, 2.64it/s]
Training 1/1 epoch (loss 1.8240): 63%|βββββββ | 594/938 [04:09<02:10, 2.63it/s]
Training 1/1 epoch (loss 1.7206): 63%|βββββββ | 594/938 [04:09<02:10, 2.63it/s]
Training 1/1 epoch (loss 1.7206): 63%|βββββββ | 595/938 [04:09<02:11, 2.60it/s]
Training 1/1 epoch (loss 1.8650): 63%|βββββββ | 595/938 [04:09<02:11, 2.60it/s]
Training 1/1 epoch (loss 1.8650): 64%|βββββββ | 596/938 [04:09<02:07, 2.68it/s]
Training 1/1 epoch (loss 1.8206): 64%|βββββββ | 596/938 [04:10<02:07, 2.68it/s]
Training 1/1 epoch (loss 1.8206): 64%|βββββββ | 597/938 [04:10<02:12, 2.58it/s]
Training 1/1 epoch (loss 1.9382): 64%|βββββββ | 597/938 [04:10<02:12, 2.58it/s]
Training 1/1 epoch (loss 1.9382): 64%|βββββββ | 598/938 [04:10<02:16, 2.50it/s]
Training 1/1 epoch (loss 1.8202): 64%|βββββββ | 598/938 [04:11<02:16, 2.50it/s]
Training 1/1 epoch (loss 1.8202): 64%|βββββββ | 599/938 [04:11<02:14, 2.52it/s]
Training 1/1 epoch (loss 1.7342): 64%|βββββββ | 599/938 [04:11<02:14, 2.52it/s]
Training 1/1 epoch (loss 1.7342): 64%|βββββββ | 600/938 [04:11<02:12, 2.55it/s]
Training 1/1 epoch (loss 1.8671): 64%|βββββββ | 600/938 [04:11<02:12, 2.55it/s]
Training 1/1 epoch (loss 1.8671): 64%|βββββββ | 601/938 [04:11<02:09, 2.60it/s]
Training 1/1 epoch (loss 1.7049): 64%|βββββββ | 601/938 [04:12<02:09, 2.60it/s]
Training 1/1 epoch (loss 1.7049): 64%|βββββββ | 602/938 [04:12<02:11, 2.56it/s]
Training 1/1 epoch (loss 1.8923): 64%|βββββββ | 602/938 [04:12<02:11, 2.56it/s]
Training 1/1 epoch (loss 1.8923): 64%|βββββββ | 603/938 [04:12<02:08, 2.61it/s]
Training 1/1 epoch (loss 1.8871): 64%|βββββββ | 603/938 [04:12<02:08, 2.61it/s]
Training 1/1 epoch (loss 1.8871): 64%|βββββββ | 604/938 [04:12<02:05, 2.65it/s]
Training 1/1 epoch (loss 1.7846): 64%|βββββββ | 604/938 [04:13<02:05, 2.65it/s]
Training 1/1 epoch (loss 1.7846): 64%|βββββββ | 605/938 [04:13<02:10, 2.56it/s]
Training 1/1 epoch (loss 1.8476): 64%|βββββββ | 605/938 [04:13<02:10, 2.56it/s]
Training 1/1 epoch (loss 1.8476): 65%|βββββββ | 606/938 [04:13<02:09, 2.56it/s]
Training 1/1 epoch (loss 1.7674): 65%|βββββββ | 606/938 [04:14<02:09, 2.56it/s]
Training 1/1 epoch (loss 1.7674): 65%|βββββββ | 607/938 [04:14<02:07, 2.60it/s]
Training 1/1 epoch (loss 1.8798): 65%|βββββββ | 607/938 [04:14<02:07, 2.60it/s]
Training 1/1 epoch (loss 1.8798): 65%|βββββββ | 608/938 [04:14<02:16, 2.42it/s]
Training 1/1 epoch (loss 1.7866): 65%|βββββββ | 608/938 [04:15<02:16, 2.42it/s]
Training 1/1 epoch (loss 1.7866): 65%|βββββββ | 609/938 [04:15<02:15, 2.43it/s]
Training 1/1 epoch (loss 1.7985): 65%|βββββββ | 609/938 [04:15<02:15, 2.43it/s]
Training 1/1 epoch (loss 1.7985): 65%|βββββββ | 610/938 [04:15<02:19, 2.35it/s]
Training 1/1 epoch (loss 1.8666): 65%|βββββββ | 610/938 [04:16<02:19, 2.35it/s]
Training 1/1 epoch (loss 1.8666): 65%|βββββββ | 611/938 [04:16<02:26, 2.24it/s]
Training 1/1 epoch (loss 1.7725): 65%|βββββββ | 611/938 [04:16<02:26, 2.24it/s]
Training 1/1 epoch (loss 1.7725): 65%|βββββββ | 612/938 [04:16<02:17, 2.37it/s]
Training 1/1 epoch (loss 1.5944): 65%|βββββββ | 612/938 [04:16<02:17, 2.37it/s]
Training 1/1 epoch (loss 1.5944): 65%|βββββββ | 613/938 [04:16<02:13, 2.43it/s]
Training 1/1 epoch (loss 1.9424): 65%|βββββββ | 613/938 [04:17<02:13, 2.43it/s]
Training 1/1 epoch (loss 1.9424): 65%|βββββββ | 614/938 [04:17<02:19, 2.33it/s]
Training 1/1 epoch (loss 1.6690): 65%|βββββββ | 614/938 [04:17<02:19, 2.33it/s]
Training 1/1 epoch (loss 1.6690): 66%|βββββββ | 615/938 [04:17<02:12, 2.43it/s]
Training 1/1 epoch (loss 1.8161): 66%|βββββββ | 615/938 [04:18<02:12, 2.43it/s]
Training 1/1 epoch (loss 1.8161): 66%|βββββββ | 616/938 [04:18<02:12, 2.43it/s]
Training 1/1 epoch (loss 1.6681): 66%|βββββββ | 616/938 [04:18<02:12, 2.43it/s]
Training 1/1 epoch (loss 1.6681): 66%|βββββββ | 617/938 [04:18<02:10, 2.46it/s]
Training 1/1 epoch (loss 1.7428): 66%|βββββββ | 617/938 [04:18<02:10, 2.46it/s]
Training 1/1 epoch (loss 1.7428): 66%|βββββββ | 618/938 [04:18<02:04, 2.57it/s]
Training 1/1 epoch (loss 1.8348): 66%|βββββββ | 618/938 [04:19<02:04, 2.57it/s]
Training 1/1 epoch (loss 1.8348): 66%|βββββββ | 619/938 [04:19<02:02, 2.61it/s]
Training 1/1 epoch (loss 1.6880): 66%|βββββββ | 619/938 [04:19<02:02, 2.61it/s]
Training 1/1 epoch (loss 1.6880): 66%|βββββββ | 620/938 [04:19<02:05, 2.53it/s]
Training 1/1 epoch (loss 1.7313): 66%|βββββββ | 620/938 [04:19<02:05, 2.53it/s]
Training 1/1 epoch (loss 1.7313): 66%|βββββββ | 621/938 [04:19<02:05, 2.52it/s]
Training 1/1 epoch (loss 1.9206): 66%|βββββββ | 621/938 [04:20<02:05, 2.52it/s]
Training 1/1 epoch (loss 1.9206): 66%|βββββββ | 622/938 [04:20<02:02, 2.57it/s]
Training 1/1 epoch (loss 1.6799): 66%|βββββββ | 622/938 [04:20<02:02, 2.57it/s]
Training 1/1 epoch (loss 1.6799): 66%|βββββββ | 623/938 [04:20<02:02, 2.57it/s]
Training 1/1 epoch (loss 1.7637): 66%|βββββββ | 623/938 [04:21<02:02, 2.57it/s]
Training 1/1 epoch (loss 1.7637): 67%|βββββββ | 624/938 [04:21<02:08, 2.44it/s]
Training 1/1 epoch (loss 1.5636): 67%|βββββββ | 624/938 [04:21<02:08, 2.44it/s]
Training 1/1 epoch (loss 1.5636): 67%|βββββββ | 625/938 [04:21<02:06, 2.48it/s]
Training 1/1 epoch (loss 1.8208): 67%|βββββββ | 625/938 [04:21<02:06, 2.48it/s]
Training 1/1 epoch (loss 1.8208): 67%|βββββββ | 626/938 [04:21<02:01, 2.57it/s]
Training 1/1 epoch (loss 1.8643): 67%|βββββββ | 626/938 [04:22<02:01, 2.57it/s]
Training 1/1 epoch (loss 1.8643): 67%|βββββββ | 627/938 [04:22<02:02, 2.54it/s]
Training 1/1 epoch (loss 1.7557): 67%|βββββββ | 627/938 [04:22<02:02, 2.54it/s]
Training 1/1 epoch (loss 1.7557): 67%|βββββββ | 628/938 [04:22<01:57, 2.64it/s]
Training 1/1 epoch (loss 1.7701): 67%|βββββββ | 628/938 [04:23<01:57, 2.64it/s]
Training 1/1 epoch (loss 1.7701): 67%|βββββββ | 629/938 [04:23<01:56, 2.66it/s]
Training 1/1 epoch (loss 1.8290): 67%|βββββββ | 629/938 [04:23<01:56, 2.66it/s]
Training 1/1 epoch (loss 1.8290): 67%|βββββββ | 630/938 [04:23<01:53, 2.71it/s]
Training 1/1 epoch (loss 1.7004): 67%|βββββββ | 630/938 [04:23<01:53, 2.71it/s]
Training 1/1 epoch (loss 1.7004): 67%|βββββββ | 631/938 [04:23<01:54, 2.69it/s]
Training 1/1 epoch (loss 1.9113): 67%|βββββββ | 631/938 [04:24<01:54, 2.69it/s]
Training 1/1 epoch (loss 1.9113): 67%|βββββββ | 632/938 [04:24<01:56, 2.62it/s]
Training 1/1 epoch (loss 1.6454): 67%|βββββββ | 632/938 [04:24<01:56, 2.62it/s]
Training 1/1 epoch (loss 1.6454): 67%|βββββββ | 633/938 [04:24<01:59, 2.55it/s]
Training 1/1 epoch (loss 1.7736): 67%|βββββββ | 633/938 [04:25<01:59, 2.55it/s]
Training 1/1 epoch (loss 1.7736): 68%|βββββββ | 634/938 [04:25<02:05, 2.41it/s]
Training 1/1 epoch (loss 1.7384): 68%|βββββββ | 634/938 [04:25<02:05, 2.41it/s]
Training 1/1 epoch (loss 1.7384): 68%|βββββββ | 635/938 [04:25<02:12, 2.29it/s]
Training 1/1 epoch (loss 1.7581): 68%|βββββββ | 635/938 [04:25<02:12, 2.29it/s]
Training 1/1 epoch (loss 1.7581): 68%|βββββββ | 636/938 [04:25<02:06, 2.39it/s]
Training 1/1 epoch (loss 1.9050): 68%|βββββββ | 636/938 [04:26<02:06, 2.39it/s]
Training 1/1 epoch (loss 1.9050): 68%|βββββββ | 637/938 [04:26<02:08, 2.33it/s]
Training 1/1 epoch (loss 1.8082): 68%|βββββββ | 637/938 [04:26<02:08, 2.33it/s]
Training 1/1 epoch (loss 1.8082): 68%|βββββββ | 638/938 [04:26<02:08, 2.34it/s]
Training 1/1 epoch (loss 1.6799): 68%|βββββββ | 638/938 [04:27<02:08, 2.34it/s]
Training 1/1 epoch (loss 1.6799): 68%|βββββββ | 639/938 [04:27<02:07, 2.35it/s]
Training 1/1 epoch (loss 1.8737): 68%|βββββββ | 639/938 [04:27<02:07, 2.35it/s]
Training 1/1 epoch (loss 1.8737): 68%|βββββββ | 640/938 [04:27<02:07, 2.33it/s]
Training 1/1 epoch (loss 1.6740): 68%|βββββββ | 640/938 [04:28<02:07, 2.33it/s]
Training 1/1 epoch (loss 1.6740): 68%|βββββββ | 641/938 [04:28<02:06, 2.36it/s]
Training 1/1 epoch (loss 1.6522): 68%|βββββββ | 641/938 [04:28<02:06, 2.36it/s]
Training 1/1 epoch (loss 1.6522): 68%|βββββββ | 642/938 [04:28<01:59, 2.48it/s]
Training 1/1 epoch (loss 1.8751): 68%|βββββββ | 642/938 [04:28<01:59, 2.48it/s]
Training 1/1 epoch (loss 1.8751): 69%|βββββββ | 643/938 [04:28<01:53, 2.59it/s]
Training 1/1 epoch (loss 1.6355): 69%|βββββββ | 643/938 [04:29<01:53, 2.59it/s]
Training 1/1 epoch (loss 1.6355): 69%|βββββββ | 644/938 [04:29<01:51, 2.65it/s]
Training 1/1 epoch (loss 1.8008): 69%|βββββββ | 644/938 [04:29<01:51, 2.65it/s]
Training 1/1 epoch (loss 1.8008): 69%|βββββββ | 645/938 [04:29<01:44, 2.81it/s]
Training 1/1 epoch (loss 1.8672): 69%|βββββββ | 645/938 [04:29<01:44, 2.81it/s]
Training 1/1 epoch (loss 1.8672): 69%|βββββββ | 646/938 [04:29<01:47, 2.71it/s]
Training 1/1 epoch (loss 1.8779): 69%|βββββββ | 646/938 [04:30<01:47, 2.71it/s]
Training 1/1 epoch (loss 1.8779): 69%|βββββββ | 647/938 [04:30<01:52, 2.59it/s]
Training 1/1 epoch (loss 1.8809): 69%|βββββββ | 647/938 [04:30<01:52, 2.59it/s]
Training 1/1 epoch (loss 1.8809): 69%|βββββββ | 648/938 [04:30<01:57, 2.48it/s]
Training 1/1 epoch (loss 1.8364): 69%|βββββββ | 648/938 [04:31<01:57, 2.48it/s]
Training 1/1 epoch (loss 1.8364): 69%|βββββββ | 649/938 [04:31<01:52, 2.58it/s]
Training 1/1 epoch (loss 1.7834): 69%|βββββββ | 649/938 [04:31<01:52, 2.58it/s]
Training 1/1 epoch (loss 1.7834): 69%|βββββββ | 650/938 [04:31<01:50, 2.61it/s]
Training 1/1 epoch (loss 1.8015): 69%|βββββββ | 650/938 [04:31<01:50, 2.61it/s]
Training 1/1 epoch (loss 1.8015): 69%|βββββββ | 651/938 [04:31<01:49, 2.63it/s]
Training 1/1 epoch (loss 1.6905): 69%|βββββββ | 651/938 [04:32<01:49, 2.63it/s]
Training 1/1 epoch (loss 1.6905): 70%|βββββββ | 652/938 [04:32<01:48, 2.65it/s]
Training 1/1 epoch (loss 1.9489): 70%|βββββββ | 652/938 [04:32<01:48, 2.65it/s]
Training 1/1 epoch (loss 1.9489): 70%|βββββββ | 653/938 [04:32<01:46, 2.68it/s]
Training 1/1 epoch (loss 1.8702): 70%|βββββββ | 653/938 [04:32<01:46, 2.68it/s]
Training 1/1 epoch (loss 1.8702): 70%|βββββββ | 654/938 [04:32<01:39, 2.84it/s]
Training 1/1 epoch (loss 1.8009): 70%|βββββββ | 654/938 [04:33<01:39, 2.84it/s]
Training 1/1 epoch (loss 1.8009): 70%|βββββββ | 655/938 [04:33<01:42, 2.76it/s]
Training 1/1 epoch (loss 1.8503): 70%|βββββββ | 655/938 [04:33<01:42, 2.76it/s]
Training 1/1 epoch (loss 1.8503): 70%|βββββββ | 656/938 [04:33<01:44, 2.71it/s]
Training 1/1 epoch (loss 1.8771): 70%|βββββββ | 656/938 [04:33<01:44, 2.71it/s]
Training 1/1 epoch (loss 1.8771): 70%|βββββββ | 657/938 [04:33<01:41, 2.76it/s]
Training 1/1 epoch (loss 1.7838): 70%|βββββββ | 657/938 [04:34<01:41, 2.76it/s]
Training 1/1 epoch (loss 1.7838): 70%|βββββββ | 658/938 [04:34<01:37, 2.86it/s]
Training 1/1 epoch (loss 1.8617): 70%|βββββββ | 658/938 [04:34<01:37, 2.86it/s]
Training 1/1 epoch (loss 1.8617): 70%|βββββββ | 659/938 [04:34<01:39, 2.79it/s]
Training 1/1 epoch (loss 1.8264): 70%|βββββββ | 659/938 [04:35<01:39, 2.79it/s]
Training 1/1 epoch (loss 1.8264): 70%|βββββββ | 660/938 [04:35<01:47, 2.59it/s]
Training 1/1 epoch (loss 1.8832): 70%|βββββββ | 660/938 [04:35<01:47, 2.59it/s]
Training 1/1 epoch (loss 1.8832): 70%|βββββββ | 661/938 [04:35<01:46, 2.60it/s]
Training 1/1 epoch (loss 1.7966): 70%|βββββββ | 661/938 [04:35<01:46, 2.60it/s]
Training 1/1 epoch (loss 1.7966): 71%|βββββββ | 662/938 [04:35<01:42, 2.70it/s]
Training 1/1 epoch (loss 1.6712): 71%|βββββββ | 662/938 [04:36<01:42, 2.70it/s]
Training 1/1 epoch (loss 1.6712): 71%|βββββββ | 663/938 [04:36<01:48, 2.53it/s]
Training 1/1 epoch (loss 1.8050): 71%|βββββββ | 663/938 [04:36<01:48, 2.53it/s]
Training 1/1 epoch (loss 1.8050): 71%|βββββββ | 664/938 [04:36<01:59, 2.30it/s]
Training 1/1 epoch (loss 1.7886): 71%|βββββββ | 664/938 [04:37<01:59, 2.30it/s]
Training 1/1 epoch (loss 1.7886): 71%|βββββββ | 665/938 [04:37<02:10, 2.09it/s]
Training 1/1 epoch (loss 1.7274): 71%|βββββββ | 665/938 [04:37<02:10, 2.09it/s]
Training 1/1 epoch (loss 1.7274): 71%|βββββββ | 666/938 [04:37<02:07, 2.14it/s]
Training 1/1 epoch (loss 1.7931): 71%|βββββββ | 666/938 [04:38<02:07, 2.14it/s]
Training 1/1 epoch (loss 1.7931): 71%|βββββββ | 667/938 [04:38<02:08, 2.10it/s]
Training 1/1 epoch (loss 1.6403): 71%|βββββββ | 667/938 [04:38<02:08, 2.10it/s]
Training 1/1 epoch (loss 1.6403): 71%|βββββββ | 668/938 [04:38<02:09, 2.09it/s]
Training 1/1 epoch (loss 1.6578): 71%|βββββββ | 668/938 [04:39<02:09, 2.09it/s]
Training 1/1 epoch (loss 1.6578): 71%|ββββββββ | 669/938 [04:39<02:03, 2.18it/s]
Training 1/1 epoch (loss 1.6897): 71%|ββββββββ | 669/938 [04:39<02:03, 2.18it/s]
Training 1/1 epoch (loss 1.6897): 71%|ββββββββ | 670/938 [04:39<01:55, 2.32it/s]
Training 1/1 epoch (loss 1.8503): 71%|ββββββββ | 670/938 [04:40<01:55, 2.32it/s]
Training 1/1 epoch (loss 1.8503): 72%|ββββββββ | 671/938 [04:40<01:54, 2.32it/s]
Training 1/1 epoch (loss 1.9841): 72%|ββββββββ | 671/938 [04:40<01:54, 2.32it/s]
Training 1/1 epoch (loss 1.9841): 72%|ββββββββ | 672/938 [04:40<01:53, 2.35it/s]
Training 1/1 epoch (loss 1.7763): 72%|ββββββββ | 672/938 [04:40<01:53, 2.35it/s]
Training 1/1 epoch (loss 1.7763): 72%|ββββββββ | 673/938 [04:40<01:52, 2.35it/s]
Training 1/1 epoch (loss 1.7631): 72%|ββββββββ | 673/938 [04:41<01:52, 2.35it/s]
Training 1/1 epoch (loss 1.7631): 72%|ββββββββ | 674/938 [04:41<01:52, 2.34it/s]
Training 1/1 epoch (loss 1.7409): 72%|ββββββββ | 674/938 [04:41<01:52, 2.34it/s]
Training 1/1 epoch (loss 1.7409): 72%|ββββββββ | 675/938 [04:41<01:47, 2.44it/s]
Training 1/1 epoch (loss 1.7931): 72%|ββββββββ | 675/938 [04:42<01:47, 2.44it/s]
Training 1/1 epoch (loss 1.7931): 72%|ββββββββ | 676/938 [04:42<01:45, 2.49it/s]
Training 1/1 epoch (loss 1.9395): 72%|ββββββββ | 676/938 [04:42<01:45, 2.49it/s]
Training 1/1 epoch (loss 1.9395): 72%|ββββββββ | 677/938 [04:42<01:41, 2.56it/s]
Training 1/1 epoch (loss 1.7250): 72%|ββββββββ | 677/938 [04:42<01:41, 2.56it/s]
Training 1/1 epoch (loss 1.7250): 72%|ββββββββ | 678/938 [04:42<01:40, 2.58it/s]
Training 1/1 epoch (loss 1.7642): 72%|ββββββββ | 678/938 [04:43<01:40, 2.58it/s]
Training 1/1 epoch (loss 1.7642): 72%|ββββββββ | 679/938 [04:43<01:42, 2.54it/s]
Training 1/1 epoch (loss 1.7624): 72%|ββββββββ | 679/938 [04:43<01:42, 2.54it/s]
Training 1/1 epoch (loss 1.7624): 72%|ββββββββ | 680/938 [04:43<01:49, 2.35it/s]
Training 1/1 epoch (loss 1.7895): 72%|ββββββββ | 680/938 [04:44<01:49, 2.35it/s]
Training 1/1 epoch (loss 1.7895): 73%|ββββββββ | 681/938 [04:44<01:49, 2.35it/s]
Training 1/1 epoch (loss 1.8179): 73%|ββββββββ | 681/938 [04:44<01:49, 2.35it/s]
Training 1/1 epoch (loss 1.8179): 73%|ββββββββ | 682/938 [04:44<01:44, 2.44it/s]
Training 1/1 epoch (loss 1.8383): 73%|ββββββββ | 682/938 [04:44<01:44, 2.44it/s]
Training 1/1 epoch (loss 1.8383): 73%|ββββββββ | 683/938 [04:44<01:44, 2.44it/s]
Training 1/1 epoch (loss 1.8156): 73%|ββββββββ | 683/938 [04:45<01:44, 2.44it/s]
Training 1/1 epoch (loss 1.8156): 73%|ββββββββ | 684/938 [04:45<01:47, 2.37it/s]
Training 1/1 epoch (loss 1.8667): 73%|ββββββββ | 684/938 [04:45<01:47, 2.37it/s]
Training 1/1 epoch (loss 1.8667): 73%|ββββββββ | 685/938 [04:45<01:44, 2.42it/s]
Training 1/1 epoch (loss 1.9215): 73%|ββββββββ | 685/938 [04:46<01:44, 2.42it/s]
Training 1/1 epoch (loss 1.9215): 73%|ββββββββ | 686/938 [04:46<01:39, 2.54it/s]
Training 1/1 epoch (loss 1.7535): 73%|ββββββββ | 686/938 [04:46<01:39, 2.54it/s]
Training 1/1 epoch (loss 1.7535): 73%|ββββββββ | 687/938 [04:46<01:38, 2.54it/s]
Training 1/1 epoch (loss 1.8105): 73%|ββββββββ | 687/938 [04:46<01:38, 2.54it/s]
Training 1/1 epoch (loss 1.8105): 73%|ββββββββ | 688/938 [04:46<01:46, 2.35it/s]
Training 1/1 epoch (loss 1.8560): 73%|ββββββββ | 688/938 [04:47<01:46, 2.35it/s]
Training 1/1 epoch (loss 1.8560): 73%|ββββββββ | 689/938 [04:47<01:47, 2.32it/s]
Training 1/1 epoch (loss 1.8835): 73%|ββββββββ | 689/938 [04:47<01:47, 2.32it/s]
Training 1/1 epoch (loss 1.8835): 74%|ββββββββ | 690/938 [04:47<01:47, 2.30it/s]
Training 1/1 epoch (loss 1.8566): 74%|ββββββββ | 690/938 [04:48<01:47, 2.30it/s]
Training 1/1 epoch (loss 1.8566): 74%|ββββββββ | 691/938 [04:48<01:43, 2.39it/s]
Training 1/1 epoch (loss 1.8473): 74%|ββββββββ | 691/938 [04:48<01:43, 2.39it/s]
Training 1/1 epoch (loss 1.8473): 74%|ββββββββ | 692/938 [04:48<01:40, 2.45it/s]
Training 1/1 epoch (loss 1.7254): 74%|ββββββββ | 692/938 [04:49<01:40, 2.45it/s]
Training 1/1 epoch (loss 1.7254): 74%|ββββββββ | 693/938 [04:49<01:38, 2.49it/s]
Training 1/1 epoch (loss 1.7256): 74%|ββββββββ | 693/938 [04:49<01:38, 2.49it/s]
Training 1/1 epoch (loss 1.7256): 74%|ββββββββ | 694/938 [04:49<01:35, 2.55it/s]
Training 1/1 epoch (loss 1.7889): 74%|ββββββββ | 694/938 [04:49<01:35, 2.55it/s]
Training 1/1 epoch (loss 1.7889): 74%|ββββββββ | 695/938 [04:49<01:35, 2.54it/s]
Training 1/1 epoch (loss 1.6193): 74%|ββββββββ | 695/938 [04:50<01:35, 2.54it/s]
Training 1/1 epoch (loss 1.6193): 74%|ββββββββ | 696/938 [04:50<01:55, 2.09it/s]
Training 1/1 epoch (loss 1.8688): 74%|ββββββββ | 696/938 [04:50<01:55, 2.09it/s]
Training 1/1 epoch (loss 1.8688): 74%|ββββββββ | 697/938 [04:50<01:47, 2.25it/s]
Training 1/1 epoch (loss 1.7537): 74%|ββββββββ | 697/938 [04:51<01:47, 2.25it/s]
Training 1/1 epoch (loss 1.7537): 74%|ββββββββ | 698/938 [04:51<02:04, 1.92it/s]
Training 1/1 epoch (loss 1.6803): 74%|ββββββββ | 698/938 [04:51<02:04, 1.92it/s]
Training 1/1 epoch (loss 1.6803): 75%|ββββββββ | 699/938 [04:51<01:53, 2.10it/s]
Training 1/1 epoch (loss 1.8069): 75%|ββββββββ | 699/938 [04:52<01:53, 2.10it/s]
Training 1/1 epoch (loss 1.8069): 75%|ββββββββ | 700/938 [04:52<01:46, 2.23it/s]
Training 1/1 epoch (loss 1.7211): 75%|ββββββββ | 700/938 [04:52<01:46, 2.23it/s]
Training 1/1 epoch (loss 1.7211): 75%|ββββββββ | 701/938 [04:52<01:46, 2.23it/s]
Training 1/1 epoch (loss 1.7093): 75%|ββββββββ | 701/938 [04:53<01:46, 2.23it/s]
Training 1/1 epoch (loss 1.7093): 75%|ββββββββ | 702/938 [04:53<01:42, 2.31it/s]
Training 1/1 epoch (loss 1.8657): 75%|ββββββββ | 702/938 [04:53<01:42, 2.31it/s]
Training 1/1 epoch (loss 1.8657): 75%|ββββββββ | 703/938 [04:53<01:36, 2.43it/s]
Training 1/1 epoch (loss 1.7381): 75%|ββββββββ | 703/938 [04:53<01:36, 2.43it/s]
Training 1/1 epoch (loss 1.7381): 75%|ββββββββ | 704/938 [04:53<01:37, 2.40it/s]
Training 1/1 epoch (loss 1.7645): 75%|ββββββββ | 704/938 [04:54<01:37, 2.40it/s]
Training 1/1 epoch (loss 1.7645): 75%|ββββββββ | 705/938 [04:54<01:38, 2.37it/s]
Training 1/1 epoch (loss 1.6945): 75%|ββββββββ | 705/938 [04:54<01:38, 2.37it/s]
Training 1/1 epoch (loss 1.6945): 75%|ββββββββ | 706/938 [04:54<01:41, 2.28it/s]
Training 1/1 epoch (loss 1.7883): 75%|ββββββββ | 706/938 [04:55<01:41, 2.28it/s]
Training 1/1 epoch (loss 1.7883): 75%|ββββββββ | 707/938 [04:55<01:40, 2.30it/s]
Training 1/1 epoch (loss 1.7947): 75%|ββββββββ | 707/938 [04:55<01:40, 2.30it/s]
Training 1/1 epoch (loss 1.7947): 75%|ββββββββ | 708/938 [04:55<01:36, 2.37it/s]
Training 1/1 epoch (loss 1.8913): 75%|ββββββββ | 708/938 [04:56<01:36, 2.37it/s]
Training 1/1 epoch (loss 1.8913): 76%|ββββββββ | 709/938 [04:56<01:34, 2.42it/s]
Training 1/1 epoch (loss 1.7108): 76%|ββββββββ | 709/938 [04:56<01:34, 2.42it/s]
Training 1/1 epoch (loss 1.7108): 76%|ββββββββ | 710/938 [04:56<01:32, 2.47it/s]
Training 1/1 epoch (loss 1.8581): 76%|ββββββββ | 710/938 [04:56<01:32, 2.47it/s]
Training 1/1 epoch (loss 1.8581): 76%|ββββββββ | 711/938 [04:56<01:31, 2.49it/s]
Training 1/1 epoch (loss 1.7703): 76%|ββββββββ | 711/938 [04:57<01:31, 2.49it/s]
Training 1/1 epoch (loss 1.7703): 76%|ββββββββ | 712/938 [04:57<01:36, 2.33it/s]
Training 1/1 epoch (loss 1.7567): 76%|ββββββββ | 712/938 [04:57<01:36, 2.33it/s]
Training 1/1 epoch (loss 1.7567): 76%|ββββββββ | 713/938 [04:57<01:34, 2.37it/s]
Training 1/1 epoch (loss 1.7801): 76%|ββββββββ | 713/938 [04:58<01:34, 2.37it/s]
Training 1/1 epoch (loss 1.7801): 76%|ββββββββ | 714/938 [04:58<01:34, 2.38it/s]
Training 1/1 epoch (loss 1.7796): 76%|ββββββββ | 714/938 [04:58<01:34, 2.38it/s]
Training 1/1 epoch (loss 1.7796): 76%|ββββββββ | 715/938 [04:58<01:38, 2.26it/s]
Training 1/1 epoch (loss 1.7178): 76%|ββββββββ | 715/938 [04:58<01:38, 2.26it/s]
Training 1/1 epoch (loss 1.7178): 76%|ββββββββ | 716/938 [04:58<01:33, 2.39it/s]
Training 1/1 epoch (loss 1.7524): 76%|ββββββββ | 716/938 [04:59<01:33, 2.39it/s]
Training 1/1 epoch (loss 1.7524): 76%|ββββββββ | 717/938 [04:59<01:38, 2.25it/s]
Training 1/1 epoch (loss 1.7421): 76%|ββββββββ | 717/938 [05:00<01:38, 2.25it/s]
Training 1/1 epoch (loss 1.7421): 77%|ββββββββ | 718/938 [05:00<01:44, 2.10it/s]
Training 1/1 epoch (loss 1.8539): 77%|ββββββββ | 718/938 [05:00<01:44, 2.10it/s]
Training 1/1 epoch (loss 1.8539): 77%|ββββββββ | 719/938 [05:00<01:43, 2.12it/s]
Training 1/1 epoch (loss 1.7380): 77%|ββββββββ | 719/938 [05:00<01:43, 2.12it/s]
Training 1/1 epoch (loss 1.7380): 77%|ββββββββ | 720/938 [05:00<01:42, 2.13it/s]
Training 1/1 epoch (loss 1.6993): 77%|ββββββββ | 720/938 [05:01<01:42, 2.13it/s]
Training 1/1 epoch (loss 1.6993): 77%|ββββββββ | 721/938 [05:01<01:39, 2.19it/s]
Training 1/1 epoch (loss 1.6829): 77%|ββββββββ | 721/938 [05:01<01:39, 2.19it/s]
Training 1/1 epoch (loss 1.6829): 77%|ββββββββ | 722/938 [05:01<01:37, 2.22it/s]
Training 1/1 epoch (loss 1.5802): 77%|ββββββββ | 722/938 [05:02<01:37, 2.22it/s]
Training 1/1 epoch (loss 1.5802): 77%|ββββββββ | 723/938 [05:02<01:33, 2.30it/s]
Training 1/1 epoch (loss 1.6937): 77%|ββββββββ | 723/938 [05:02<01:33, 2.30it/s]
Training 1/1 epoch (loss 1.6937): 77%|ββββββββ | 724/938 [05:02<01:29, 2.38it/s]
Training 1/1 epoch (loss 1.7786): 77%|ββββββββ | 724/938 [05:03<01:29, 2.38it/s]
Training 1/1 epoch (loss 1.7786): 77%|ββββββββ | 725/938 [05:03<01:30, 2.36it/s]
Training 1/1 epoch (loss 1.7159): 77%|ββββββββ | 725/938 [05:03<01:30, 2.36it/s]
Training 1/1 epoch (loss 1.7159): 77%|ββββββββ | 726/938 [05:03<01:36, 2.19it/s]
Training 1/1 epoch (loss 1.8085): 77%|ββββββββ | 726/938 [05:03<01:36, 2.19it/s]
Training 1/1 epoch (loss 1.8085): 78%|ββββββββ | 727/938 [05:03<01:33, 2.27it/s]
Training 1/1 epoch (loss 1.6172): 78%|ββββββββ | 727/938 [05:04<01:33, 2.27it/s]
Training 1/1 epoch (loss 1.6172): 78%|ββββββββ | 728/938 [05:04<01:34, 2.23it/s]
Training 1/1 epoch (loss 1.6810): 78%|ββββββββ | 728/938 [05:04<01:34, 2.23it/s]
Training 1/1 epoch (loss 1.6810): 78%|ββββββββ | 729/938 [05:04<01:37, 2.15it/s]
Training 1/1 epoch (loss 1.7852): 78%|ββββββββ | 729/938 [05:05<01:37, 2.15it/s]
Training 1/1 epoch (loss 1.7852): 78%|ββββββββ | 730/938 [05:05<01:36, 2.17it/s]
Training 1/1 epoch (loss 1.8237): 78%|ββββββββ | 730/938 [05:05<01:36, 2.17it/s]
Training 1/1 epoch (loss 1.8237): 78%|ββββββββ | 731/938 [05:05<01:36, 2.15it/s]
Training 1/1 epoch (loss 1.7431): 78%|ββββββββ | 731/938 [05:06<01:36, 2.15it/s]
Training 1/1 epoch (loss 1.7431): 78%|ββββββββ | 732/938 [05:06<01:30, 2.28it/s]
Training 1/1 epoch (loss 1.7915): 78%|ββββββββ | 732/938 [05:06<01:30, 2.28it/s]
Training 1/1 epoch (loss 1.7915): 78%|ββββββββ | 733/938 [05:06<01:31, 2.25it/s]
Training 1/1 epoch (loss 1.7975): 78%|ββββββββ | 733/938 [05:07<01:31, 2.25it/s]
Training 1/1 epoch (loss 1.7975): 78%|ββββββββ | 734/938 [05:07<01:25, 2.39it/s]
Training 1/1 epoch (loss 1.8127): 78%|ββββββββ | 734/938 [05:07<01:25, 2.39it/s]
Training 1/1 epoch (loss 1.8127): 78%|ββββββββ | 735/938 [05:07<01:26, 2.35it/s]
Training 1/1 epoch (loss 1.8131): 78%|ββββββββ | 735/938 [05:08<01:26, 2.35it/s]
Training 1/1 epoch (loss 1.8131): 78%|ββββββββ | 736/938 [05:08<01:30, 2.22it/s]
Training 1/1 epoch (loss 1.7628): 78%|ββββββββ | 736/938 [05:08<01:30, 2.22it/s]
Training 1/1 epoch (loss 1.7628): 79%|ββββββββ | 737/938 [05:08<01:31, 2.20it/s]
Training 1/1 epoch (loss 1.7115): 79%|ββββββββ | 737/938 [05:08<01:31, 2.20it/s]
Training 1/1 epoch (loss 1.7115): 79%|ββββββββ | 738/938 [05:08<01:27, 2.29it/s]
Training 1/1 epoch (loss 1.6834): 79%|ββββββββ | 738/938 [05:09<01:27, 2.29it/s]
Training 1/1 epoch (loss 1.6834): 79%|ββββββββ | 739/938 [05:09<01:23, 2.39it/s]
Training 1/1 epoch (loss 1.7998): 79%|ββββββββ | 739/938 [05:09<01:23, 2.39it/s]
Training 1/1 epoch (loss 1.7998): 79%|ββββββββ | 740/938 [05:09<01:22, 2.41it/s]
Training 1/1 epoch (loss 1.7909): 79%|ββββββββ | 740/938 [05:10<01:22, 2.41it/s]
Training 1/1 epoch (loss 1.7909): 79%|ββββββββ | 741/938 [05:10<01:20, 2.46it/s]
Training 1/1 epoch (loss 1.7840): 79%|ββββββββ | 741/938 [05:10<01:20, 2.46it/s]
Training 1/1 epoch (loss 1.7840): 79%|ββββββββ | 742/938 [05:10<01:14, 2.62it/s]
Training 1/1 epoch (loss 1.8485): 79%|ββββββββ | 742/938 [05:10<01:14, 2.62it/s]
Training 1/1 epoch (loss 1.8485): 79%|ββββββββ | 743/938 [05:10<01:11, 2.71it/s]
Training 1/1 epoch (loss 1.7563): 79%|ββββββββ | 743/938 [05:11<01:11, 2.71it/s]
Training 1/1 epoch (loss 1.7563): 79%|ββββββββ | 744/938 [05:11<01:14, 2.61it/s]
Training 1/1 epoch (loss 1.8931): 79%|ββββββββ | 744/938 [05:11<01:14, 2.61it/s]
Training 1/1 epoch (loss 1.8931): 79%|ββββββββ | 745/938 [05:11<01:13, 2.64it/s]
Training 1/1 epoch (loss 1.7380): 79%|ββββββββ | 745/938 [05:12<01:13, 2.64it/s]
Training 1/1 epoch (loss 1.7380): 80%|ββββββββ | 746/938 [05:12<01:24, 2.28it/s]
Training 1/1 epoch (loss 1.9314): 80%|ββββββββ | 746/938 [05:12<01:24, 2.28it/s]
Training 1/1 epoch (loss 1.9314): 80%|ββββββββ | 747/938 [05:12<01:19, 2.39it/s]
Training 1/1 epoch (loss 1.7438): 80%|ββββββββ | 747/938 [05:12<01:19, 2.39it/s]
Training 1/1 epoch (loss 1.7438): 80%|ββββββββ | 748/938 [05:12<01:19, 2.40it/s]
Training 1/1 epoch (loss 1.6624): 80%|ββββββββ | 748/938 [05:13<01:19, 2.40it/s]
Training 1/1 epoch (loss 1.6624): 80%|ββββββββ | 749/938 [05:13<01:25, 2.21it/s]
Training 1/1 epoch (loss 1.7310): 80%|ββββββββ | 749/938 [05:13<01:25, 2.21it/s]
Training 1/1 epoch (loss 1.7310): 80%|ββββββββ | 750/938 [05:13<01:27, 2.16it/s]
Training 1/1 epoch (loss 1.7625): 80%|ββββββββ | 750/938 [05:14<01:27, 2.16it/s]
Training 1/1 epoch (loss 1.7625): 80%|ββββββββ | 751/938 [05:14<01:27, 2.14it/s]
Training 1/1 epoch (loss 1.7623): 80%|ββββββββ | 751/938 [05:14<01:27, 2.14it/s]
Training 1/1 epoch (loss 1.7623): 80%|ββββββββ | 752/938 [05:14<01:22, 2.26it/s]
Training 1/1 epoch (loss 1.8506): 80%|ββββββββ | 752/938 [05:15<01:22, 2.26it/s]
Training 1/1 epoch (loss 1.8506): 80%|ββββββββ | 753/938 [05:15<01:24, 2.20it/s]
Training 1/1 epoch (loss 1.8157): 80%|ββββββββ | 753/938 [05:15<01:24, 2.20it/s]
Training 1/1 epoch (loss 1.8157): 80%|ββββββββ | 754/938 [05:15<01:19, 2.31it/s]
Training 1/1 epoch (loss 1.8044): 80%|ββββββββ | 754/938 [05:16<01:19, 2.31it/s]
Training 1/1 epoch (loss 1.8044): 80%|ββββββββ | 755/938 [05:16<01:19, 2.29it/s]
Training 1/1 epoch (loss 1.7805): 80%|ββββββββ | 755/938 [05:16<01:19, 2.29it/s]
Training 1/1 epoch (loss 1.7805): 81%|ββββββββ | 756/938 [05:16<01:14, 2.44it/s]
Training 1/1 epoch (loss 1.7018): 81%|ββββββββ | 756/938 [05:16<01:14, 2.44it/s]
Training 1/1 epoch (loss 1.7018): 81%|ββββββββ | 757/938 [05:16<01:19, 2.29it/s]
Training 1/1 epoch (loss 1.9108): 81%|ββββββββ | 757/938 [05:17<01:19, 2.29it/s]
Training 1/1 epoch (loss 1.9108): 81%|ββββββββ | 758/938 [05:17<01:16, 2.36it/s]
Training 1/1 epoch (loss 1.7702): 81%|ββββββββ | 758/938 [05:17<01:16, 2.36it/s]
Training 1/1 epoch (loss 1.7702): 81%|ββββββββ | 759/938 [05:17<01:11, 2.49it/s]
Training 1/1 epoch (loss 1.6003): 81%|ββββββββ | 759/938 [05:18<01:11, 2.49it/s]
Training 1/1 epoch (loss 1.6003): 81%|ββββββββ | 760/938 [05:18<01:14, 2.39it/s]
Training 1/1 epoch (loss 1.9598): 81%|ββββββββ | 760/938 [05:18<01:14, 2.39it/s]
Training 1/1 epoch (loss 1.9598): 81%|ββββββββ | 761/938 [05:18<01:09, 2.53it/s]
Training 1/1 epoch (loss 1.7128): 81%|ββββββββ | 761/938 [05:18<01:09, 2.53it/s]
Training 1/1 epoch (loss 1.7128): 81%|ββββββββ | 762/938 [05:18<01:09, 2.54it/s]
Training 1/1 epoch (loss 1.8537): 81%|ββββββββ | 762/938 [05:19<01:09, 2.54it/s]
Training 1/1 epoch (loss 1.8537): 81%|βββββββββ | 763/938 [05:19<01:07, 2.58it/s]
Training 1/1 epoch (loss 1.6021): 81%|βββββββββ | 763/938 [05:19<01:07, 2.58it/s]
Training 1/1 epoch (loss 1.6021): 81%|βββββββββ | 764/938 [05:19<01:05, 2.67it/s]
Training 1/1 epoch (loss 1.8614): 81%|βββββββββ | 764/938 [05:19<01:05, 2.67it/s]
Training 1/1 epoch (loss 1.8614): 82%|βββββββββ | 765/938 [05:19<01:04, 2.69it/s]
Training 1/1 epoch (loss 1.7819): 82%|βββββββββ | 765/938 [05:20<01:04, 2.69it/s]
Training 1/1 epoch (loss 1.7819): 82%|βββββββββ | 766/938 [05:20<01:04, 2.66it/s]
Training 1/1 epoch (loss 1.8829): 82%|βββββββββ | 766/938 [05:20<01:04, 2.66it/s]
Training 1/1 epoch (loss 1.8829): 82%|βββββββββ | 767/938 [05:20<01:05, 2.60it/s]
Training 1/1 epoch (loss 1.7464): 82%|βββββββββ | 767/938 [05:21<01:05, 2.60it/s]
Training 1/1 epoch (loss 1.7464): 82%|βββββββββ | 768/938 [05:21<01:04, 2.63it/s]
Training 1/1 epoch (loss 1.9586): 82%|βββββββββ | 768/938 [05:21<01:04, 2.63it/s]
Training 1/1 epoch (loss 1.9586): 82%|βββββββββ | 769/938 [05:21<01:05, 2.60it/s]
Training 1/1 epoch (loss 1.8062): 82%|βββββββββ | 769/938 [05:21<01:05, 2.60it/s]
Training 1/1 epoch (loss 1.8062): 82%|βββββββββ | 770/938 [05:21<01:07, 2.48it/s]
Training 1/1 epoch (loss 1.6327): 82%|βββββββββ | 770/938 [05:22<01:07, 2.48it/s]
Training 1/1 epoch (loss 1.6327): 82%|βββββββββ | 771/938 [05:22<01:06, 2.52it/s]
Training 1/1 epoch (loss 1.7138): 82%|βββββββββ | 771/938 [05:22<01:06, 2.52it/s]
Training 1/1 epoch (loss 1.7138): 82%|βββββββββ | 772/938 [05:22<01:04, 2.56it/s]
Training 1/1 epoch (loss 1.7044): 82%|βββββββββ | 772/938 [05:23<01:04, 2.56it/s]
Training 1/1 epoch (loss 1.7044): 82%|βββββββββ | 773/938 [05:23<01:02, 2.63it/s]
Training 1/1 epoch (loss 1.6410): 82%|βββββββββ | 773/938 [05:23<01:02, 2.63it/s]
Training 1/1 epoch (loss 1.6410): 83%|βββββββββ | 774/938 [05:23<01:03, 2.59it/s]
Training 1/1 epoch (loss 1.7410): 83%|βββββββββ | 774/938 [05:23<01:03, 2.59it/s]
Training 1/1 epoch (loss 1.7410): 83%|βββββββββ | 775/938 [05:23<01:03, 2.57it/s]
Training 1/1 epoch (loss 1.8240): 83%|βββββββββ | 775/938 [05:24<01:03, 2.57it/s]
Training 1/1 epoch (loss 1.8240): 83%|βββββββββ | 776/938 [05:24<01:11, 2.27it/s]
Training 1/1 epoch (loss 1.8468): 83%|βββββββββ | 776/938 [05:24<01:11, 2.27it/s]
Training 1/1 epoch (loss 1.8468): 83%|βββββββββ | 777/938 [05:24<01:11, 2.26it/s]
Training 1/1 epoch (loss 1.7189): 83%|βββββββββ | 777/938 [05:25<01:11, 2.26it/s]
Training 1/1 epoch (loss 1.7189): 83%|βββββββββ | 778/938 [05:25<01:06, 2.40it/s]
Training 1/1 epoch (loss 1.6140): 83%|βββββββββ | 778/938 [05:25<01:06, 2.40it/s]
Training 1/1 epoch (loss 1.6140): 83%|βββββββββ | 779/938 [05:25<01:07, 2.35it/s]
Training 1/1 epoch (loss 1.8544): 83%|βββββββββ | 779/938 [05:26<01:07, 2.35it/s]
Training 1/1 epoch (loss 1.8544): 83%|βββββββββ | 780/938 [05:26<01:07, 2.35it/s]
Training 1/1 epoch (loss 1.8179): 83%|βββββββββ | 780/938 [05:26<01:07, 2.35it/s]
Training 1/1 epoch (loss 1.8179): 83%|βββββββββ | 781/938 [05:26<01:04, 2.42it/s]
Training 1/1 epoch (loss 1.7204): 83%|βββββββββ | 781/938 [05:26<01:04, 2.42it/s]
Training 1/1 epoch (loss 1.7204): 83%|βββββββββ | 782/938 [05:26<01:03, 2.48it/s]
Training 1/1 epoch (loss 1.7963): 83%|βββββββββ | 782/938 [05:27<01:03, 2.48it/s]
Training 1/1 epoch (loss 1.7963): 83%|βββββββββ | 783/938 [05:27<01:01, 2.51it/s]
Training 1/1 epoch (loss 1.7817): 83%|βββββββββ | 783/938 [05:27<01:01, 2.51it/s]
Training 1/1 epoch (loss 1.7817): 84%|βββββββββ | 784/938 [05:27<01:00, 2.54it/s]
Training 1/1 epoch (loss 1.7286): 84%|βββββββββ | 784/938 [05:27<01:00, 2.54it/s]
Training 1/1 epoch (loss 1.7286): 84%|βββββββββ | 785/938 [05:27<00:59, 2.57it/s]
Training 1/1 epoch (loss 1.7685): 84%|βββββββββ | 785/938 [05:28<00:59, 2.57it/s]
Training 1/1 epoch (loss 1.7685): 84%|βββββββββ | 786/938 [05:28<01:01, 2.48it/s]
Training 1/1 epoch (loss 1.7540): 84%|βββββββββ | 786/938 [05:28<01:01, 2.48it/s]
Training 1/1 epoch (loss 1.7540): 84%|βββββββββ | 787/938 [05:28<01:01, 2.47it/s]
Training 1/1 epoch (loss 1.8726): 84%|βββββββββ | 787/938 [05:29<01:01, 2.47it/s]
Training 1/1 epoch (loss 1.8726): 84%|βββββββββ | 788/938 [05:29<01:00, 2.49it/s]
Training 1/1 epoch (loss 1.6244): 84%|βββββββββ | 788/938 [05:29<01:00, 2.49it/s]
Training 1/1 epoch (loss 1.6244): 84%|βββββββββ | 789/938 [05:29<01:00, 2.48it/s]
Training 1/1 epoch (loss 1.6154): 84%|βββββββββ | 789/938 [05:30<01:00, 2.48it/s]
Training 1/1 epoch (loss 1.6154): 84%|βββββββββ | 790/938 [05:30<01:06, 2.22it/s]
Training 1/1 epoch (loss 1.8052): 84%|βββββββββ | 790/938 [05:30<01:06, 2.22it/s]
Training 1/1 epoch (loss 1.8052): 84%|βββββββββ | 791/938 [05:30<01:02, 2.34it/s]
Training 1/1 epoch (loss 1.6999): 84%|βββββββββ | 791/938 [05:30<01:02, 2.34it/s]
Training 1/1 epoch (loss 1.6999): 84%|βββββββββ | 792/938 [05:30<01:02, 2.35it/s]
Training 1/1 epoch (loss 1.8149): 84%|βββββββββ | 792/938 [05:31<01:02, 2.35it/s]
Training 1/1 epoch (loss 1.8149): 85%|βββββββββ | 793/938 [05:31<00:59, 2.44it/s]
Training 1/1 epoch (loss 1.8890): 85%|βββββββββ | 793/938 [05:31<00:59, 2.44it/s]
Training 1/1 epoch (loss 1.8890): 85%|βββββββββ | 794/938 [05:31<00:55, 2.57it/s]
Training 1/1 epoch (loss 1.7094): 85%|βββββββββ | 794/938 [05:32<00:55, 2.57it/s]
Training 1/1 epoch (loss 1.7094): 85%|βββββββββ | 795/938 [05:32<00:56, 2.53it/s]
Training 1/1 epoch (loss 1.7627): 85%|βββββββββ | 795/938 [05:32<00:56, 2.53it/s]
Training 1/1 epoch (loss 1.7627): 85%|βββββββββ | 796/938 [05:32<00:59, 2.39it/s]
Training 1/1 epoch (loss 1.7221): 85%|βββββββββ | 796/938 [05:32<00:59, 2.39it/s]
Training 1/1 epoch (loss 1.7221): 85%|βββββββββ | 797/938 [05:32<00:55, 2.53it/s]
Training 1/1 epoch (loss 1.8568): 85%|βββββββββ | 797/938 [05:33<00:55, 2.53it/s]
Training 1/1 epoch (loss 1.8568): 85%|βββββββββ | 798/938 [05:33<00:56, 2.48it/s]
Training 1/1 epoch (loss 1.8190): 85%|βββββββββ | 798/938 [05:33<00:56, 2.48it/s]
Training 1/1 epoch (loss 1.8190): 85%|βββββββββ | 799/938 [05:33<00:53, 2.60it/s]
Training 1/1 epoch (loss 1.7564): 85%|βββββββββ | 799/938 [05:34<00:53, 2.60it/s]
Training 1/1 epoch (loss 1.7564): 85%|βββββββββ | 800/938 [05:34<00:54, 2.53it/s]
Training 1/1 epoch (loss 1.8411): 85%|βββββββββ | 800/938 [05:34<00:54, 2.53it/s]
Training 1/1 epoch (loss 1.8411): 85%|βββββββββ | 801/938 [05:34<00:52, 2.61it/s]
Training 1/1 epoch (loss 1.8774): 85%|βββββββββ | 801/938 [05:34<00:52, 2.61it/s]
Training 1/1 epoch (loss 1.8774): 86%|βββββββββ | 802/938 [05:34<00:55, 2.44it/s]
Training 1/1 epoch (loss 1.7558): 86%|βββββββββ | 802/938 [05:35<00:55, 2.44it/s]
Training 1/1 epoch (loss 1.7558): 86%|βββββββββ | 803/938 [05:35<00:55, 2.44it/s]
Training 1/1 epoch (loss 1.7190): 86%|βββββββββ | 803/938 [05:35<00:55, 2.44it/s]
Training 1/1 epoch (loss 1.7190): 86%|βββββββββ | 804/938 [05:35<00:55, 2.42it/s]
Training 1/1 epoch (loss 1.8004): 86%|βββββββββ | 804/938 [05:36<00:55, 2.42it/s]
Training 1/1 epoch (loss 1.8004): 86%|βββββββββ | 805/938 [05:36<00:51, 2.58it/s]
Training 1/1 epoch (loss 1.7891): 86%|βββββββββ | 805/938 [05:36<00:51, 2.58it/s]
Training 1/1 epoch (loss 1.7891): 86%|βββββββββ | 806/938 [05:36<00:51, 2.56it/s]
Training 1/1 epoch (loss 1.7212): 86%|βββββββββ | 806/938 [05:36<00:51, 2.56it/s]
Training 1/1 epoch (loss 1.7212): 86%|βββββββββ | 807/938 [05:36<00:51, 2.55it/s]
Training 1/1 epoch (loss 1.8408): 86%|βββββββββ | 807/938 [05:37<00:51, 2.55it/s]
Training 1/1 epoch (loss 1.8408): 86%|βββββββββ | 808/938 [05:37<00:49, 2.63it/s]
Training 1/1 epoch (loss 1.6715): 86%|βββββββββ | 808/938 [05:37<00:49, 2.63it/s]
Training 1/1 epoch (loss 1.6715): 86%|βββββββββ | 809/938 [05:37<00:50, 2.56it/s]
Training 1/1 epoch (loss 1.8022): 86%|βββββββββ | 809/938 [05:38<00:50, 2.56it/s]
Training 1/1 epoch (loss 1.8022): 86%|βββββββββ | 810/938 [05:38<00:49, 2.58it/s]
Training 1/1 epoch (loss 1.6514): 86%|βββββββββ | 810/938 [05:38<00:49, 2.58it/s]
Training 1/1 epoch (loss 1.6514): 86%|βββββββββ | 811/938 [05:38<00:50, 2.50it/s]
Training 1/1 epoch (loss 1.7558): 86%|βββββββββ | 811/938 [05:38<00:50, 2.50it/s]
Training 1/1 epoch (loss 1.7558): 87%|βββββββββ | 812/938 [05:38<00:47, 2.65it/s]
Training 1/1 epoch (loss 1.7892): 87%|βββββββββ | 812/938 [05:39<00:47, 2.65it/s]
Training 1/1 epoch (loss 1.7892): 87%|βββββββββ | 813/938 [05:39<00:46, 2.72it/s]
Training 1/1 epoch (loss 1.7492): 87%|βββββββββ | 813/938 [05:39<00:46, 2.72it/s]
Training 1/1 epoch (loss 1.7492): 87%|βββββββββ | 814/938 [05:39<00:45, 2.71it/s]
Training 1/1 epoch (loss 1.7564): 87%|βββββββββ | 814/938 [05:39<00:45, 2.71it/s]
Training 1/1 epoch (loss 1.7564): 87%|βββββββββ | 815/938 [05:39<00:46, 2.67it/s]
Training 1/1 epoch (loss 1.8272): 87%|βββββββββ | 815/938 [05:40<00:46, 2.67it/s]
Training 1/1 epoch (loss 1.8272): 87%|βββββββββ | 816/938 [05:40<00:47, 2.57it/s]
Training 1/1 epoch (loss 1.8149): 87%|βββββββββ | 816/938 [05:40<00:47, 2.57it/s]
Training 1/1 epoch (loss 1.8149): 87%|βββββββββ | 817/938 [05:40<00:48, 2.51it/s]
Training 1/1 epoch (loss 1.6836): 87%|βββββββββ | 817/938 [05:41<00:48, 2.51it/s]
Training 1/1 epoch (loss 1.6836): 87%|βββββββββ | 818/938 [05:41<00:46, 2.59it/s]
Training 1/1 epoch (loss 1.9632): 87%|βββββββββ | 818/938 [05:41<00:46, 2.59it/s]
Training 1/1 epoch (loss 1.9632): 87%|βββββββββ | 819/938 [05:41<00:44, 2.65it/s]
Training 1/1 epoch (loss 1.6670): 87%|βββββββββ | 819/938 [05:41<00:44, 2.65it/s]
Training 1/1 epoch (loss 1.6670): 87%|βββββββββ | 820/938 [05:41<00:43, 2.74it/s]
Training 1/1 epoch (loss 1.8066): 87%|βββββββββ | 820/938 [05:42<00:43, 2.74it/s]
Training 1/1 epoch (loss 1.8066): 88%|βββββββββ | 821/938 [05:42<00:43, 2.67it/s]
Training 1/1 epoch (loss 1.9611): 88%|βββββββββ | 821/938 [05:42<00:43, 2.67it/s]
Training 1/1 epoch (loss 1.9611): 88%|βββββββββ | 822/938 [05:42<00:43, 2.64it/s]
Training 1/1 epoch (loss 1.7414): 88%|βββββββββ | 822/938 [05:42<00:43, 2.64it/s]
Training 1/1 epoch (loss 1.7414): 88%|βββββββββ | 823/938 [05:42<00:44, 2.58it/s]
Training 1/1 epoch (loss 1.7383): 88%|βββββββββ | 823/938 [05:43<00:44, 2.58it/s]
Training 1/1 epoch (loss 1.7383): 88%|βββββββββ | 824/938 [05:43<00:45, 2.48it/s]
Training 1/1 epoch (loss 1.6793): 88%|βββββββββ | 824/938 [05:43<00:45, 2.48it/s]
Training 1/1 epoch (loss 1.6793): 88%|βββββββββ | 825/938 [05:43<00:44, 2.56it/s]
Training 1/1 epoch (loss 1.8439): 88%|βββββββββ | 825/938 [05:44<00:44, 2.56it/s]
Training 1/1 epoch (loss 1.8439): 88%|βββββββββ | 826/938 [05:44<00:43, 2.55it/s]
Training 1/1 epoch (loss 1.7987): 88%|βββββββββ | 826/938 [05:44<00:43, 2.55it/s]
Training 1/1 epoch (loss 1.7987): 88%|βββββββββ | 827/938 [05:44<00:41, 2.69it/s]
Training 1/1 epoch (loss 1.8889): 88%|βββββββββ | 827/938 [05:44<00:41, 2.69it/s]
Training 1/1 epoch (loss 1.8889): 88%|βββββββββ | 828/938 [05:44<00:41, 2.64it/s]
Training 1/1 epoch (loss 1.7409): 88%|βββββββββ | 828/938 [05:45<00:41, 2.64it/s]
Training 1/1 epoch (loss 1.7409): 88%|βββββββββ | 829/938 [05:45<00:44, 2.43it/s]
Training 1/1 epoch (loss 1.8424): 88%|βββββββββ | 829/938 [05:45<00:44, 2.43it/s]
Training 1/1 epoch (loss 1.8424): 88%|βββββββββ | 830/938 [05:45<00:44, 2.45it/s]
Training 1/1 epoch (loss 1.8842): 88%|βββββββββ | 830/938 [05:46<00:44, 2.45it/s]
Training 1/1 epoch (loss 1.8842): 89%|βββββββββ | 831/938 [05:46<00:43, 2.47it/s]
Training 1/1 epoch (loss 1.7711): 89%|βββββββββ | 831/938 [05:46<00:43, 2.47it/s]
Training 1/1 epoch (loss 1.7711): 89%|βββββββββ | 832/938 [05:46<00:42, 2.48it/s]
Training 1/1 epoch (loss 1.7423): 89%|βββββββββ | 832/938 [05:47<00:42, 2.48it/s]
Training 1/1 epoch (loss 1.7423): 89%|βββββββββ | 833/938 [05:47<00:43, 2.43it/s]
Training 1/1 epoch (loss 1.7761): 89%|βββββββββ | 833/938 [05:47<00:43, 2.43it/s]
Training 1/1 epoch (loss 1.7761): 89%|βββββββββ | 834/938 [05:47<00:40, 2.56it/s]
Training 1/1 epoch (loss 1.7496): 89%|βββββββββ | 834/938 [05:47<00:40, 2.56it/s]
Training 1/1 epoch (loss 1.7496): 89%|βββββββββ | 835/938 [05:47<00:38, 2.64it/s]
Training 1/1 epoch (loss 1.6326): 89%|βββββββββ | 835/938 [05:48<00:38, 2.64it/s]
Training 1/1 epoch (loss 1.6326): 89%|βββββββββ | 836/938 [05:48<00:39, 2.61it/s]
Training 1/1 epoch (loss 1.9417): 89%|βββββββββ | 836/938 [05:48<00:39, 2.61it/s]
Training 1/1 epoch (loss 1.9417): 89%|βββββββββ | 837/938 [05:48<00:36, 2.74it/s]
Training 1/1 epoch (loss 1.7552): 89%|βββββββββ | 837/938 [05:48<00:36, 2.74it/s]
Training 1/1 epoch (loss 1.7552): 89%|βββββββββ | 838/938 [05:48<00:37, 2.69it/s]
Training 1/1 epoch (loss 1.6731): 89%|βββββββββ | 838/938 [05:49<00:37, 2.69it/s]
Training 1/1 epoch (loss 1.6731): 89%|βββββββββ | 839/938 [05:49<00:36, 2.74it/s]
Training 1/1 epoch (loss 1.6738): 89%|βββββββββ | 839/938 [05:49<00:36, 2.74it/s]
Training 1/1 epoch (loss 1.6738): 90%|βββββββββ | 840/938 [05:49<00:35, 2.76it/s]
Training 1/1 epoch (loss 1.8437): 90%|βββββββββ | 840/938 [05:49<00:35, 2.76it/s]
Training 1/1 epoch (loss 1.8437): 90%|βββββββββ | 841/938 [05:49<00:35, 2.75it/s]
Training 1/1 epoch (loss 1.7364): 90%|βββββββββ | 841/938 [05:50<00:35, 2.75it/s]
Training 1/1 epoch (loss 1.7364): 90%|βββββββββ | 842/938 [05:50<00:36, 2.66it/s]
Training 1/1 epoch (loss 1.8672): 90%|βββββββββ | 842/938 [05:50<00:36, 2.66it/s]
Training 1/1 epoch (loss 1.8672): 90%|βββββββββ | 843/938 [05:50<00:39, 2.38it/s]
Training 1/1 epoch (loss 1.7398): 90%|βββββββββ | 843/938 [05:51<00:39, 2.38it/s]
Training 1/1 epoch (loss 1.7398): 90%|βββββββββ | 844/938 [05:51<00:38, 2.44it/s]
Training 1/1 epoch (loss 1.9123): 90%|βββββββββ | 844/938 [05:51<00:38, 2.44it/s]
Training 1/1 epoch (loss 1.9123): 90%|βββββββββ | 845/938 [05:51<00:36, 2.56it/s]
Training 1/1 epoch (loss 1.8992): 90%|βββββββββ | 845/938 [05:51<00:36, 2.56it/s]
Training 1/1 epoch (loss 1.8992): 90%|βββββββββ | 846/938 [05:51<00:36, 2.53it/s]
Training 1/1 epoch (loss 1.7776): 90%|βββββββββ | 846/938 [05:52<00:36, 2.53it/s]
Training 1/1 epoch (loss 1.7776): 90%|βββββββββ | 847/938 [05:52<00:35, 2.57it/s]
Training 1/1 epoch (loss 1.6970): 90%|βββββββββ | 847/938 [05:52<00:35, 2.57it/s]
Training 1/1 epoch (loss 1.6970): 90%|βββββββββ | 848/938 [05:52<00:35, 2.55it/s]
Training 1/1 epoch (loss 1.8049): 90%|βββββββββ | 848/938 [05:53<00:35, 2.55it/s]
Training 1/1 epoch (loss 1.8049): 91%|βββββββββ | 849/938 [05:53<00:36, 2.47it/s]
Training 1/1 epoch (loss 1.6701): 91%|βββββββββ | 849/938 [05:53<00:36, 2.47it/s]
Training 1/1 epoch (loss 1.6701): 91%|βββββββββ | 850/938 [05:53<00:34, 2.55it/s]
Training 1/1 epoch (loss 1.8959): 91%|βββββββββ | 850/938 [05:53<00:34, 2.55it/s]
Training 1/1 epoch (loss 1.8959): 91%|βββββββββ | 851/938 [05:53<00:33, 2.61it/s]
Training 1/1 epoch (loss 1.8638): 91%|βββββββββ | 851/938 [05:54<00:33, 2.61it/s]
Training 1/1 epoch (loss 1.8638): 91%|βββββββββ | 852/938 [05:54<00:34, 2.48it/s]
Training 1/1 epoch (loss 1.8371): 91%|βββββββββ | 852/938 [05:54<00:34, 2.48it/s]
Training 1/1 epoch (loss 1.8371): 91%|βββββββββ | 853/938 [05:54<00:35, 2.36it/s]
Training 1/1 epoch (loss 1.7596): 91%|βββββββββ | 853/938 [05:55<00:35, 2.36it/s]
Training 1/1 epoch (loss 1.7596): 91%|βββββββββ | 854/938 [05:55<00:34, 2.43it/s]
Training 1/1 epoch (loss 1.8294): 91%|βββββββββ | 854/938 [05:55<00:34, 2.43it/s]
Training 1/1 epoch (loss 1.8294): 91%|βββββββββ | 855/938 [05:55<00:33, 2.45it/s]
Training 1/1 epoch (loss 1.7713): 91%|βββββββββ | 855/938 [05:56<00:33, 2.45it/s]
Training 1/1 epoch (loss 1.7713): 91%|ββββββββββ| 856/938 [05:56<00:35, 2.32it/s]
Training 1/1 epoch (loss 1.8513): 91%|ββββββββββ| 856/938 [05:56<00:35, 2.32it/s]
Training 1/1 epoch (loss 1.8513): 91%|ββββββββββ| 857/938 [05:56<00:33, 2.41it/s]
Training 1/1 epoch (loss 1.8562): 91%|ββββββββββ| 857/938 [05:56<00:33, 2.41it/s]
Training 1/1 epoch (loss 1.8562): 91%|ββββββββββ| 858/938 [05:56<00:32, 2.43it/s]
Training 1/1 epoch (loss 1.7970): 91%|ββββββββββ| 858/938 [05:57<00:32, 2.43it/s]
Training 1/1 epoch (loss 1.7970): 92%|ββββββββββ| 859/938 [05:57<00:32, 2.41it/s]
Training 1/1 epoch (loss 1.6890): 92%|ββββββββββ| 859/938 [05:57<00:32, 2.41it/s]
Training 1/1 epoch (loss 1.6890): 92%|ββββββββββ| 860/938 [05:57<00:30, 2.55it/s]
Training 1/1 epoch (loss 1.7829): 92%|ββββββββββ| 860/938 [05:57<00:30, 2.55it/s]
Training 1/1 epoch (loss 1.7829): 92%|ββββββββββ| 861/938 [05:57<00:30, 2.56it/s]
Training 1/1 epoch (loss 1.8923): 92%|ββββββββββ| 861/938 [05:58<00:30, 2.56it/s]
Training 1/1 epoch (loss 1.8923): 92%|ββββββββββ| 862/938 [05:58<00:28, 2.64it/s]
Training 1/1 epoch (loss 1.9231): 92%|ββββββββββ| 862/938 [05:58<00:28, 2.64it/s]
Training 1/1 epoch (loss 1.9231): 92%|ββββββββββ| 863/938 [05:58<00:27, 2.72it/s]
Training 1/1 epoch (loss 1.7840): 92%|ββββββββββ| 863/938 [05:59<00:27, 2.72it/s]
Training 1/1 epoch (loss 1.7840): 92%|ββββββββββ| 864/938 [05:59<00:28, 2.63it/s]
Training 1/1 epoch (loss 1.8900): 92%|ββββββββββ| 864/938 [05:59<00:28, 2.63it/s]
Training 1/1 epoch (loss 1.8900): 92%|ββββββββββ| 865/938 [05:59<00:27, 2.70it/s]
Training 1/1 epoch (loss 1.6919): 92%|ββββββββββ| 865/938 [05:59<00:27, 2.70it/s]
Training 1/1 epoch (loss 1.6919): 92%|ββββββββββ| 866/938 [05:59<00:27, 2.65it/s]
Training 1/1 epoch (loss 1.5714): 92%|ββββββββββ| 866/938 [06:00<00:27, 2.65it/s]
Training 1/1 epoch (loss 1.5714): 92%|ββββββββββ| 867/938 [06:00<00:28, 2.48it/s]
Training 1/1 epoch (loss 1.7632): 92%|ββββββββββ| 867/938 [06:00<00:28, 2.48it/s]
Training 1/1 epoch (loss 1.7632): 93%|ββββββββββ| 868/938 [06:00<00:28, 2.47it/s]
Training 1/1 epoch (loss 1.7563): 93%|ββββββββββ| 868/938 [06:01<00:28, 2.47it/s]
Training 1/1 epoch (loss 1.7563): 93%|ββββββββββ| 869/938 [06:01<00:28, 2.41it/s]
Training 1/1 epoch (loss 1.8699): 93%|ββββββββββ| 869/938 [06:01<00:28, 2.41it/s]
Training 1/1 epoch (loss 1.8699): 93%|ββββββββββ| 870/938 [06:01<00:27, 2.46it/s]
Training 1/1 epoch (loss 1.7150): 93%|ββββββββββ| 870/938 [06:01<00:27, 2.46it/s]
Training 1/1 epoch (loss 1.7150): 93%|ββββββββββ| 871/938 [06:01<00:26, 2.51it/s]
Training 1/1 epoch (loss 1.8086): 93%|ββββββββββ| 871/938 [06:02<00:26, 2.51it/s]
Training 1/1 epoch (loss 1.8086): 93%|ββββββββββ| 872/938 [06:02<00:26, 2.50it/s]
Training 1/1 epoch (loss 1.9054): 93%|ββββββββββ| 872/938 [06:02<00:26, 2.50it/s]
Training 1/1 epoch (loss 1.9054): 93%|ββββββββββ| 873/938 [06:02<00:25, 2.59it/s]
Training 1/1 epoch (loss 1.8263): 93%|ββββββββββ| 873/938 [06:03<00:25, 2.59it/s]
Training 1/1 epoch (loss 1.8263): 93%|ββββββββββ| 874/938 [06:03<00:24, 2.59it/s]
Training 1/1 epoch (loss 1.8884): 93%|ββββββββββ| 874/938 [06:03<00:24, 2.59it/s]
Training 1/1 epoch (loss 1.8884): 93%|ββββββββββ| 875/938 [06:03<00:25, 2.51it/s]
Training 1/1 epoch (loss 1.7388): 93%|ββββββββββ| 875/938 [06:03<00:25, 2.51it/s]
Training 1/1 epoch (loss 1.7388): 93%|ββββββββββ| 876/938 [06:03<00:24, 2.49it/s]
Training 1/1 epoch (loss 1.7422): 93%|ββββββββββ| 876/938 [06:04<00:24, 2.49it/s]
Training 1/1 epoch (loss 1.7422): 93%|ββββββββββ| 877/938 [06:04<00:25, 2.41it/s]
Training 1/1 epoch (loss 1.7161): 93%|ββββββββββ| 877/938 [06:04<00:25, 2.41it/s]
Training 1/1 epoch (loss 1.7161): 94%|ββββββββββ| 878/938 [06:04<00:28, 2.11it/s]
Training 1/1 epoch (loss 1.7426): 94%|ββββββββββ| 878/938 [06:05<00:28, 2.11it/s]
Training 1/1 epoch (loss 1.7426): 94%|ββββββββββ| 879/938 [06:05<00:28, 2.06it/s]
Training 1/1 epoch (loss 1.9577): 94%|ββββββββββ| 879/938 [06:05<00:28, 2.06it/s]
Training 1/1 epoch (loss 1.9577): 94%|ββββββββββ| 880/938 [06:05<00:26, 2.20it/s]
Training 1/1 epoch (loss 1.7794): 94%|ββββββββββ| 880/938 [06:06<00:26, 2.20it/s]
Training 1/1 epoch (loss 1.7794): 94%|ββββββββββ| 881/938 [06:06<00:26, 2.19it/s]
Training 1/1 epoch (loss 1.8031): 94%|ββββββββββ| 881/938 [06:06<00:26, 2.19it/s]
Training 1/1 epoch (loss 1.8031): 94%|ββββββββββ| 882/938 [06:06<00:24, 2.29it/s]
Training 1/1 epoch (loss 1.8970): 94%|ββββββββββ| 882/938 [06:07<00:24, 2.29it/s]
Training 1/1 epoch (loss 1.8970): 94%|ββββββββββ| 883/938 [06:07<00:25, 2.13it/s]
Training 1/1 epoch (loss 1.9051): 94%|ββββββββββ| 883/938 [06:07<00:25, 2.13it/s]
Training 1/1 epoch (loss 1.9051): 94%|ββββββββββ| 884/938 [06:07<00:23, 2.25it/s]
Training 1/1 epoch (loss 1.8238): 94%|ββββββββββ| 884/938 [06:08<00:23, 2.25it/s]
Training 1/1 epoch (loss 1.8238): 94%|ββββββββββ| 885/938 [06:08<00:26, 2.00it/s]
Training 1/1 epoch (loss 1.7562): 94%|ββββββββββ| 885/938 [06:08<00:26, 2.00it/s]
Training 1/1 epoch (loss 1.7562): 94%|ββββββββββ| 886/938 [06:08<00:23, 2.17it/s]
Training 1/1 epoch (loss 1.7998): 94%|ββββββββββ| 886/938 [06:08<00:23, 2.17it/s]
Training 1/1 epoch (loss 1.7998): 95%|ββββββββββ| 887/938 [06:08<00:21, 2.33it/s]
Training 1/1 epoch (loss 1.7576): 95%|ββββββββββ| 887/938 [06:09<00:21, 2.33it/s]
Training 1/1 epoch (loss 1.7576): 95%|ββββββββββ| 888/938 [06:09<00:20, 2.42it/s]
Training 1/1 epoch (loss 1.8177): 95%|ββββββββββ| 888/938 [06:09<00:20, 2.42it/s]
Training 1/1 epoch (loss 1.8177): 95%|ββββββββββ| 889/938 [06:09<00:20, 2.45it/s]
Training 1/1 epoch (loss 1.6451): 95%|ββββββββββ| 889/938 [06:10<00:20, 2.45it/s]
Training 1/1 epoch (loss 1.6451): 95%|ββββββββββ| 890/938 [06:10<00:19, 2.46it/s]
Training 1/1 epoch (loss 1.8047): 95%|ββββββββββ| 890/938 [06:10<00:19, 2.46it/s]
Training 1/1 epoch (loss 1.8047): 95%|ββββββββββ| 891/938 [06:10<00:18, 2.49it/s]
Training 1/1 epoch (loss 1.7994): 95%|ββββββββββ| 891/938 [06:10<00:18, 2.49it/s]
Training 1/1 epoch (loss 1.7994): 95%|ββββββββββ| 892/938 [06:10<00:17, 2.58it/s]
Training 1/1 epoch (loss 1.9425): 95%|ββββββββββ| 892/938 [06:11<00:17, 2.58it/s]
Training 1/1 epoch (loss 1.9425): 95%|ββββββββββ| 893/938 [06:11<00:16, 2.70it/s]
Training 1/1 epoch (loss 1.7254): 95%|ββββββββββ| 893/938 [06:11<00:16, 2.70it/s]
Training 1/1 epoch (loss 1.7254): 95%|ββββββββββ| 894/938 [06:11<00:17, 2.48it/s]
Training 1/1 epoch (loss 1.7886): 95%|ββββββββββ| 894/938 [06:12<00:17, 2.48it/s]
Training 1/1 epoch (loss 1.7886): 95%|ββββββββββ| 895/938 [06:12<00:17, 2.45it/s]
Training 1/1 epoch (loss 1.7447): 95%|ββββββββββ| 895/938 [06:12<00:17, 2.45it/s]
Training 1/1 epoch (loss 1.7447): 96%|ββββββββββ| 896/938 [06:12<00:16, 2.53it/s]
Training 1/1 epoch (loss 1.6706): 96%|ββββββββββ| 896/938 [06:12<00:16, 2.53it/s]
Training 1/1 epoch (loss 1.6706): 96%|ββββββββββ| 897/938 [06:12<00:16, 2.51it/s]
Training 1/1 epoch (loss 1.7704): 96%|ββββββββββ| 897/938 [06:13<00:16, 2.51it/s]
Training 1/1 epoch (loss 1.7704): 96%|ββββββββββ| 898/938 [06:13<00:15, 2.54it/s]
Training 1/1 epoch (loss 1.7929): 96%|ββββββββββ| 898/938 [06:13<00:15, 2.54it/s]
Training 1/1 epoch (loss 1.7929): 96%|ββββββββββ| 899/938 [06:13<00:17, 2.20it/s]
Training 1/1 epoch (loss 1.7835): 96%|ββββββββββ| 899/938 [06:14<00:17, 2.20it/s]
Training 1/1 epoch (loss 1.7835): 96%|ββββββββββ| 900/938 [06:14<00:18, 2.08it/s]
Training 1/1 epoch (loss 1.6787): 96%|ββββββββββ| 900/938 [06:14<00:18, 2.08it/s]
Training 1/1 epoch (loss 1.6787): 96%|ββββββββββ| 901/938 [06:14<00:16, 2.28it/s]
Training 1/1 epoch (loss 1.6417): 96%|ββββββββββ| 901/938 [06:15<00:16, 2.28it/s]
Training 1/1 epoch (loss 1.6417): 96%|ββββββββββ| 902/938 [06:15<00:16, 2.25it/s]
Training 1/1 epoch (loss 1.7719): 96%|ββββββββββ| 902/938 [06:15<00:16, 2.25it/s]
Training 1/1 epoch (loss 1.7719): 96%|ββββββββββ| 903/938 [06:15<00:16, 2.15it/s]
Training 1/1 epoch (loss 1.8317): 96%|ββββββββββ| 903/938 [06:16<00:16, 2.15it/s]
Training 1/1 epoch (loss 1.8317): 96%|ββββββββββ| 904/938 [06:16<00:15, 2.19it/s]
Training 1/1 epoch (loss 1.8734): 96%|ββββββββββ| 904/938 [06:16<00:15, 2.19it/s]
Training 1/1 epoch (loss 1.8734): 96%|ββββββββββ| 905/938 [06:16<00:15, 2.19it/s]
Training 1/1 epoch (loss 1.8100): 96%|ββββββββββ| 905/938 [06:17<00:15, 2.19it/s]
Training 1/1 epoch (loss 1.8100): 97%|ββββββββββ| 906/938 [06:17<00:15, 2.08it/s]
Training 1/1 epoch (loss 1.6960): 97%|ββββββββββ| 906/938 [06:17<00:15, 2.08it/s]
Training 1/1 epoch (loss 1.6960): 97%|ββββββββββ| 907/938 [06:17<00:14, 2.14it/s]
Training 1/1 epoch (loss 1.7494): 97%|ββββββββββ| 907/938 [06:18<00:14, 2.14it/s]
Training 1/1 epoch (loss 1.7494): 97%|ββββββββββ| 908/938 [06:18<00:13, 2.16it/s]
Training 1/1 epoch (loss 1.7398): 97%|ββββββββββ| 908/938 [06:18<00:13, 2.16it/s]
Training 1/1 epoch (loss 1.7398): 97%|ββββββββββ| 909/938 [06:18<00:13, 2.21it/s]
Training 1/1 epoch (loss 1.8397): 97%|ββββββββββ| 909/938 [06:18<00:13, 2.21it/s]
Training 1/1 epoch (loss 1.8397): 97%|ββββββββββ| 910/938 [06:18<00:11, 2.42it/s]
Training 1/1 epoch (loss 1.7759): 97%|ββββββββββ| 910/938 [06:19<00:11, 2.42it/s]
Training 1/1 epoch (loss 1.7759): 97%|ββββββββββ| 911/938 [06:19<00:10, 2.53it/s]
Training 1/1 epoch (loss 1.7842): 97%|ββββββββββ| 911/938 [06:19<00:10, 2.53it/s]
Training 1/1 epoch (loss 1.7842): 97%|ββββββββββ| 912/938 [06:19<00:09, 2.61it/s]
Training 1/1 epoch (loss 1.8190): 97%|ββββββββββ| 912/938 [06:19<00:09, 2.61it/s]
Training 1/1 epoch (loss 1.8190): 97%|ββββββββββ| 913/938 [06:19<00:09, 2.55it/s]
Training 1/1 epoch (loss 1.8116): 97%|ββββββββββ| 913/938 [06:20<00:09, 2.55it/s]
Training 1/1 epoch (loss 1.8116): 97%|ββββββββββ| 914/938 [06:20<00:09, 2.53it/s]
Training 1/1 epoch (loss 1.7522): 97%|ββββββββββ| 914/938 [06:20<00:09, 2.53it/s]
Training 1/1 epoch (loss 1.7522): 98%|ββββββββββ| 915/938 [06:20<00:09, 2.52it/s]
Training 1/1 epoch (loss 1.8421): 98%|ββββββββββ| 915/938 [06:21<00:09, 2.52it/s]
Training 1/1 epoch (loss 1.8421): 98%|ββββββββββ| 916/938 [06:21<00:08, 2.70it/s]
Training 1/1 epoch (loss 1.7289): 98%|ββββββββββ| 916/938 [06:21<00:08, 2.70it/s]
Training 1/1 epoch (loss 1.7289): 98%|ββββββββββ| 917/938 [06:21<00:08, 2.61it/s]
Training 1/1 epoch (loss 1.7255): 98%|ββββββββββ| 917/938 [06:21<00:08, 2.61it/s]
Training 1/1 epoch (loss 1.7255): 98%|ββββββββββ| 918/938 [06:21<00:08, 2.48it/s]
Training 1/1 epoch (loss 1.7002): 98%|ββββββββββ| 918/938 [06:22<00:08, 2.48it/s]
Training 1/1 epoch (loss 1.7002): 98%|ββββββββββ| 919/938 [06:22<00:07, 2.44it/s]
Training 1/1 epoch (loss 1.7690): 98%|ββββββββββ| 919/938 [06:22<00:07, 2.44it/s]
Training 1/1 epoch (loss 1.7690): 98%|ββββββββββ| 920/938 [06:22<00:07, 2.43it/s]
Training 1/1 epoch (loss 1.7258): 98%|ββββββββββ| 920/938 [06:23<00:07, 2.43it/s]
Training 1/1 epoch (loss 1.7258): 98%|ββββββββββ| 921/938 [06:23<00:06, 2.51it/s]
Training 1/1 epoch (loss 1.7297): 98%|ββββββββββ| 921/938 [06:23<00:06, 2.51it/s]
Training 1/1 epoch (loss 1.7297): 98%|ββββββββββ| 922/938 [06:23<00:06, 2.64it/s]
Training 1/1 epoch (loss 1.6881): 98%|ββββββββββ| 922/938 [06:23<00:06, 2.64it/s]
Training 1/1 epoch (loss 1.6881): 98%|ββββββββββ| 923/938 [06:23<00:05, 2.61it/s]
Training 1/1 epoch (loss 1.7419): 98%|ββββββββββ| 923/938 [06:24<00:05, 2.61it/s]
Training 1/1 epoch (loss 1.7419): 99%|ββββββββββ| 924/938 [06:24<00:05, 2.57it/s]
Training 1/1 epoch (loss 1.8167): 99%|ββββββββββ| 924/938 [06:24<00:05, 2.57it/s]
Training 1/1 epoch (loss 1.8167): 99%|ββββββββββ| 925/938 [06:24<00:05, 2.55it/s]
Training 1/1 epoch (loss 1.6783): 99%|ββββββββββ| 925/938 [06:25<00:05, 2.55it/s]
Training 1/1 epoch (loss 1.6783): 99%|ββββββββββ| 926/938 [06:25<00:04, 2.46it/s]
Training 1/1 epoch (loss 1.8280): 99%|ββββββββββ| 926/938 [06:25<00:04, 2.46it/s]
Training 1/1 epoch (loss 1.8280): 99%|ββββββββββ| 927/938 [06:25<00:04, 2.52it/s]
Training 1/1 epoch (loss 1.7765): 99%|ββββββββββ| 927/938 [06:25<00:04, 2.52it/s]
Training 1/1 epoch (loss 1.7765): 99%|ββββββββββ| 928/938 [06:25<00:03, 2.51it/s]
Training 1/1 epoch (loss 1.6783): 99%|ββββββββββ| 928/938 [06:26<00:03, 2.51it/s]
Training 1/1 epoch (loss 1.6783): 99%|ββββββββββ| 929/938 [06:26<00:03, 2.46it/s]
Training 1/1 epoch (loss 1.7923): 99%|ββββββββββ| 929/938 [06:26<00:03, 2.46it/s]
Training 1/1 epoch (loss 1.7923): 99%|ββββββββββ| 930/938 [06:26<00:03, 2.47it/s]
Training 1/1 epoch (loss 1.6811): 99%|ββββββββββ| 930/938 [06:27<00:03, 2.47it/s]
Training 1/1 epoch (loss 1.6811): 99%|ββββββββββ| 931/938 [06:27<00:02, 2.48it/s]
Training 1/1 epoch (loss 1.7944): 99%|ββββββββββ| 931/938 [06:27<00:02, 2.48it/s]
Training 1/1 epoch (loss 1.7944): 99%|ββββββββββ| 932/938 [06:27<00:02, 2.51it/s]
Training 1/1 epoch (loss 1.7992): 99%|ββββββββββ| 932/938 [06:27<00:02, 2.51it/s]
Training 1/1 epoch (loss 1.7992): 99%|ββββββββββ| 933/938 [06:27<00:01, 2.60it/s]
Training 1/1 epoch (loss 1.7764): 99%|ββββββββββ| 933/938 [06:28<00:01, 2.60it/s]
Training 1/1 epoch (loss 1.7764): 100%|ββββββββββ| 934/938 [06:28<00:01, 2.50it/s]
Training 1/1 epoch (loss 1.9447): 100%|ββββββββββ| 934/938 [06:28<00:01, 2.50it/s]
Training 1/1 epoch (loss 1.9447): 100%|ββββββββββ| 935/938 [06:28<00:01, 2.53it/s]
Training 1/1 epoch (loss 1.8155): 100%|ββββββββββ| 935/938 [06:29<00:01, 2.53it/s]
Training 1/1 epoch (loss 1.8155): 100%|ββββββββββ| 936/938 [06:29<00:00, 2.55it/s]
Training 1/1 epoch (loss 1.7864): 100%|ββββββββββ| 936/938 [06:29<00:00, 2.55it/s]
Training 1/1 epoch (loss 1.7864): 100%|ββββββββββ| 937/938 [06:29<00:00, 2.49it/s]
Training 1/1 epoch (loss 1.8080): 100%|ββββββββββ| 937/938 [06:29<00:00, 2.49it/s]
Training 1/1 epoch (loss 1.8080): 100%|ββββββββββ| 938/938 [06:29<00:00, 2.47it/s]
Training 1/1 epoch (loss 1.8080): 100%|ββββββββββ| 938/938 [06:29<00:00, 2.41it/s] |