What’s the best approach to apply different learning rates to early layers using PyTorch?

What’s the best approach to apply different learning rates to early layers using PyTorch? How to unfreeze the last layers when using transfer learning?

1 Like

You can check out for differential learning implementations. Also fastai has a implementation too for reference. - @PrajwalPrashanth