Image classification using ResNet9

Epoch [0], last_lr: 0.00381, train_loss: 0.7395, val_loss: 4.0981, val_acc: 0.6237

Exception ignored in: <bound method _MultiProcessingDataLoaderIter.__del__ of <torch.utils.data.dataloader._MultiProcessingDataLoaderIter object at 0x7f6b8018d630>> Traceback (most recent call last): File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 1203, in __del__ self._shutdown_workers() File "/usr/local/lib/python3.6/dist-packages/torch/utils/data/dataloader.py", line 1177, in _shutdown_workers w.join(timeout=_utils.MP_STATUS_CHECK_INTERVAL) File "/usr/lib/python3.6/multiprocessing/process.py", line 122, in join assert self._parent_pid == os.getpid(), 'can only join a child process' AssertionError: can only join a child process Traceback (most recent call last): File "/usr/lib/python3.6/multiprocessing/queues.py", line 240, in _feed send_bytes(obj) File "/usr/lib/python3.6/multiprocessing/connection.py", line 200, in send_bytes self._send_bytes(m[offset:offset + size]) File "/usr/lib/python3.6/multiprocessing/connection.py", line 404, in _send_bytes self._send(header + buf) File "/usr/lib/python3.6/multiprocessing/connection.py", line 368, in _send n = write(self._handle, buf) OSError: [Errno 9] Bad file descriptor Traceback (most recent call last): File "/usr/lib/python3.6/multiprocessing/queues.py", line 230, in _feed close() File "/usr/lib/python3.6/multiprocessing/connection.py", line 177, in close self._close() File "/usr/lib/python3.6/multiprocessing/connection.py", line 361, in _close _close(self._handle) OSError: [Errno 9] Bad file descriptor

Epoch [1], last_lr: 0.00934, train_loss: 0.8963, val_loss: 20.2365, val_acc: 0.3823 Epoch [2], last_lr: 0.00972, train_loss: 0.7808, val_loss: 5.0843, val_acc: 0.4982 Epoch [3], last_lr: 0.00812, train_loss: 0.3994, val_loss: 0.5071, val_acc: 0.8319 Epoch [4], last_lr: 0.00556, train_loss: 0.2540, val_loss: 0.3765, val_acc: 0.8946 Epoch [5], last_lr: 0.00283, train_loss: 0.1985, val_loss: 0.7331, val_acc: 0.7945 Epoch [6], last_lr: 0.00077, train_loss: 0.1050, val_loss: 0.1670, val_acc: 0.9277 Epoch [7], last_lr: 0.00000, train_loss: 0.0783, val_loss: 0.1415, val_acc: 0.9477 CPU times: user 8.18 s, sys: 6.08 s, total: 14.3 s Wall time: 7min 25s

Can someone tell me why am I getting :point_up_2: this strange error after epoch[0].
Even though all other epochs are giving right output

Here is link to my notebook in case you need :point_right: notebook link