Goto

Collaborating Authors

 map


Revisiting Multi-Task Learning with ROCK: a Deep Residual Auxiliary Block for Visual Detection

Taylor Mordan, Nicolas THOME, Gilles Henaff, Matthieu Cord

Neural Information Processing Systems

Extensiveexperiments on NYUv2 dataset (object detection with scene classification, depth prediction, and surface normal estimation as auxiliary tasks) validate the relevance of the approach and its superiority to flat MTL approaches.





Supplementary: CharacterizingGeneralizationunder Out-Of-DistributionShiftsinDeepMetricLearning

Neural Information Processing Systems

Subsequently, we select train-test splits from the same iteration steps. These settings are used throughout our study. For the few-shot experiments, the same pipeline parameters were utilized with changes noted in the respectivesection. However,thefactthatFIDscores are relatively close to another despite large semantic differences between datasets may indicate that FID based on our utilised FID estimator (Sec. Beyond these limits, generic representations learned byself-supervised learning may offerbetter zero-shot generalization,asalsodiscussedonSec.


c622c085c04eadc473f08541b255320e-Supplemental.pdf

Neural Information Processing Systems

The positive with the lowest rankx1 has a gradient in the good direction, since it leads to increasex1'sscore because the correct ordering is not reached (the negativeinstance WecanseeinFig.2bthatthis change enables tohavegradients inthecorrect directions forthetwopositiveinstancesx1 and x2 (tending to increase their scores), and for the negative instancex3 (tending to decrease its score). However there is still vanishing gradients. Overall, LSupAP has all the desired properties: i) A correct gradient flow during training, ii) No vanishing gradients while the correct ranking isnot reached, iii)Being anupper bound onthe AP lossLAP. We now write that each positive instance that respects the constraint ofLcalibr. A.3 Choiceofδ In the main paper we introduceδ in Eq. (4) to defineH .


Text-InfusedAttentionandForeground-Aware ModelingforZero-ShotTemporalActionDetection

Neural Information Processing Systems

Our simple approach results insuperior performance compared toprevious methods. Despite this improvement, we further identify a common-action bias issue that the cross-modal baseline over-focus on common sub-actions due to a lack of ability todiscriminate text-related visual parts.


Bootstrap Your Object Detector via Mixed Training

Neural Information Processing Systems

We introduce MixTraining, a new training paradigm for object detection that can improve the performance of existing detectors for free. MixTraining enhances data augmentation by utilizing augmentations of different strengths while excluding the strong augmentations of certain training samples that may be detrimental to training. In addition, it addresses localization noise and missing labels in human annotations by incorporating pseudo boxes that can compensate for these errors. Both of these MixTraining capabilities are made possible through bootstrapping on the detector, which can be used to predict the difficulty of training on a strong augmentation, as well as to generate reliable pseudo boxes thanks to the robustness of neural networks to labeling error. MixTraining is found to bring consistent improvements across various detectors on the COCO dataset.


Inclusion of Role into Named Entity Recognition and Ranking

Shukla, Neelesh Kumar, Singh, Sanasam Ranbir

arXiv.org Artificial Intelligence

Most of the Natural Language Processing systems are involved in entity-based processing for several tasks like Information Extraction, Question-Answering, Text-Summarization and so on. A new challenge comes when entities play roles according to their act or attributes in certain context. Entity Role Detection is the task of assigning such roles to the entities. Usually real-world entities are of types: person, location and organization etc. Roles could be considered as domain-dependent subtypes of these types. In the cases, where retrieving a subset of entities based on their roles is needed, poses the problem of defining the role and entities having those roles. This paper presents the study of study of solving Entity Role Detection problem by modeling it as Named Entity Recognition (NER) and Entity Retrieval/Ranking task. In NER, these roles could be considered as mutually exclusive classes and standard NER methods like sequence tagging could be used. For Entity Retrieval, Roles could be formulated as Query and entities as Collection on which the query needs to be executed. The aspect of Entity Retrieval task, which is different than document retrieval task is that the entities and roles against which they need to be retrieved are indirectly described. We have formulated automated ways of learning representative words and phrases and building representations of roles and entities using them. We have also explored different contexts like sentence and document. Since the roles depend upon context, so it is not always possible to have large domain-specific dataset or knowledge bases for learning purposes, so we have tried to exploit the information from small dataset in domain-agnostic way.