
התחל במצב לא מקוון עם האפליקציה Player FM !
torch.nn
Manage episode 294363178 series 2921809
What goes into the implementation of torch.nn? Why do NN modules exist in the first place? What's the function of Parameter? How do modules actually track all the parameters in question? What is all of the goop in the top level NN module class? What are some new developments in torch.nn modules? What are some open problems with our modules?
Further reading:
- Implementation of nn.Module https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/module.py
- nn.Module is complicated and that means its sometimes a bit slow. Some analysis at https://dev-discuss.pytorch.org/t/overhead-in-nn-module-causing-massive-slowdowns-compared-to-raw-cublas-or-torchscript/110
- Lazy modules PR https://github.com/pytorch/pytorch/pull/44538 and factory kwargs https://github.com/pytorch/pytorch/pull/54508
Liner notes:
- python for hackability (c++ is reimplemented)
- parameters
- parameter collection (for optimization)
- buffers: not considered optimizable
- modules
- functorial operation (_apply)
- jit script: staged computation (init is not scripted)
-
__call__
to forward (extra instrumentation) - serialization / state_dict
- new stuff: device kwarg (joel schlosser)
- new stuff: lazy modules (emcastillo)
- open problems: parameter initialization
83 פרקים
Manage episode 294363178 series 2921809
What goes into the implementation of torch.nn? Why do NN modules exist in the first place? What's the function of Parameter? How do modules actually track all the parameters in question? What is all of the goop in the top level NN module class? What are some new developments in torch.nn modules? What are some open problems with our modules?
Further reading:
- Implementation of nn.Module https://github.com/pytorch/pytorch/blob/master/torch/nn/modules/module.py
- nn.Module is complicated and that means its sometimes a bit slow. Some analysis at https://dev-discuss.pytorch.org/t/overhead-in-nn-module-causing-massive-slowdowns-compared-to-raw-cublas-or-torchscript/110
- Lazy modules PR https://github.com/pytorch/pytorch/pull/44538 and factory kwargs https://github.com/pytorch/pytorch/pull/54508
Liner notes:
- python for hackability (c++ is reimplemented)
- parameters
- parameter collection (for optimization)
- buffers: not considered optimizable
- modules
- functorial operation (_apply)
- jit script: staged computation (init is not scripted)
-
__call__
to forward (extra instrumentation) - serialization / state_dict
- new stuff: device kwarg (joel schlosser)
- new stuff: lazy modules (emcastillo)
- open problems: parameter initialization
83 פרקים
Tất cả các tập
×ברוכים הבאים אל Player FM!
Player FM סורק את האינטרנט עבור פודקאסטים באיכות גבוהה בשבילכם כדי שתהנו מהם כרגע. זה יישום הפודקאסט הטוב ביותר והוא עובד על אנדרואיד, iPhone ואינטרנט. הירשמו לסנכרון מנויים במכשירים שונים.