Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout

Neural Information Processing Systems 

The vast majority of deep models use multiple gradient signals, typically corresponding to a sum of multiple loss terms, to update a shared set of trainable weights.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found