AITopics | second stage

HumanLiker: AHuman-like Object Detector to Model the Manual Labeling Process

Neural Information Processing SystemsApr-24-2026, 14:55:31 GMT

Popular object detection models generate bounding boxes in a different way than we humans. As an example, modern detectors yield object box either upon the regression of its center and width/height (center-guided detector), or by grouping paired estimated corners (corner-guided detector). However, that is not the pattern we manually label an object due to high degrees of freedom in searching centers or low efficiency of grouping corners. Empirically, humans run two steps to locate an object bounding box manually: 1) click the mouse at the top-left corner of object, and then drag the mouse to the bottom-right corner; 2) refine the corner positions to make the bounding box more precisely, if necessary. Inspired by this manual labeling process, we propose a novel human-like detector, termed as HumanLiker, which is devised as a two-stage end-to-end detector to simulate the two aforementioned. Like we humans in manual labeling, HumanLiker can effectively avert both the thorny center searching and heuristic corner grouping. Different from the mainstream detector branches, i.e., the center/corner-guided methods, the HumanLiker provides a new paradigm which integrates the advantages of both branches to balance the detection efficiency and bounding box quality. On MS-COCO test-dev set, HumanLiker can achieve 50.2%/51.6%

artificial intelligence, humanliker, machine learning, (14 more...)

Neural Information Processing Systems

Genre: Research Report (0.68)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

We provide a simple pseudo-2

Neural Information Processing SystemsFeb-19-2026, 03:54:34 GMT

We thank all the reviewers for their constructive comments. We will provide details in the final draft. MCUNet shows consistent improvement across different devices (F746, H743) and tasks (classification, detection). R1: Whether the overall network topology brings major improvement. R2: Why the auto-tuning in TVM fails to work on MCUs.

artificial intelligence, procedure, search space, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence (0.58)
Information Technology > Communications (0.36)

Add feedback

61960fdfda4d4e95fa1c1f6e64bfe8bc-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 16:12:55 GMT

artificial intelligence, machine learning, variation, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Vision (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

481c70828a4ff20d31a646cc6cc95f3d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 09:22:57 GMT

artificial intelligence, machine learning, natural language, (15 more...)

Neural Information Processing Systems

Country: Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Add feedback

621461af90cadfdaf0e8d4cc25129f91-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-12-2026, 09:12:28 GMT

final version, hinge loss, second stage, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.33)

Add feedback

TrashorTreasure?AnInteractiveDual-Stream StrategyforSingleImageReflectionSeparation

Neural Information Processing SystemsFeb-11-2026, 06:08:56 GMT

Existing deep learning based solutions typically restore the target layers individually, or with some concerns at the end of the output, barely taking into account the interaction across thetwostreams/branches. Inorder toutilize information more efficiently, this work presents a general yet simple interactive strategy, namely your trash is my treasure(YTMT), for constructing dual-stream decomposition networks.

artificial intelligence, incvpr, machine learning, (18 more...)

Neural Information Processing Systems

Country:

Asia > China (0.05)
North America > United States > Washington > King County > Seattle (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

EfficientCAPER: AnEnd-to-End FrameworkforFast andRobustCategory-LevelArticulatedObjectPose Estimation

Neural Information Processing SystemsFeb-11-2026, 05:58:01 GMT

Human life is populated with articulated objects.

artificial intelligence, efficientcaper, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > China > Anhui Province > Hefei (0.04)
Europe > United Kingdom > Wales > Swansea (0.04)
(3 more...)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

9793671e4be9858a69a32545204d59d1-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 22:27:20 GMT

configuration, objective, scenario, (12 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

95f03faf3763e1b1ce2c3de62da8f090-Paper-Conference.pdf

Neural Information Processing SystemsFeb-10-2026, 21:32:53 GMT

binaural audio, diffusion model, information, (14 more...)

Neural Information Processing Systems

Country:

Asia > Middle East > Israel (0.04)
Asia > China (0.04)
Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.41)

Add feedback

MoVQ: Modulating QuantizedVectorsforHigh-FidelityImage Generation ADiscussiononMaskedImageReconstruction

Neural Information Processing SystemsFeb-10-2026, 20:45:29 GMT

Inothercolumns, werandomly masksome tokens (first row), and we sample the invisible tokens based on the visible tokens for the second stage. Here, we show top-1 results in 1 step (second row), and random results in 8 steps (third row),respectively. Interestingly, our model with 95% masked tokens (i.e., 12 tokens are visible among 256 tokens in each channel) is able to generate pluralistic images in only one step by selecting the top 1 token. More importantly, the corresponding results reflect identity attributes of original unmaskedinputs. When the tokens are totally masked (i.e., 100% mask ratio), the model generates plausible and diversity results byrandomly sampling tokens inmultiple steps.

artificial intelligence, movq, thisisanextensionoffig, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

Filters

Collaborating Authors

second stage

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

HumanLiker: AHuman-like Object Detector to Model the Manual Labeling Process

We provide a simple pseudo-2

61960fdfda4d4e95fa1c1f6e64bfe8bc-Supplemental-Conference.pdf

481c70828a4ff20d31a646cc6cc95f3d-Paper-Conference.pdf

621461af90cadfdaf0e8d4cc25129f91-AuthorFeedback.pdf

TrashorTreasure?AnInteractiveDual-Stream StrategyforSingleImageReflectionSeparation

EfficientCAPER: AnEnd-to-End FrameworkforFast andRobustCategory-LevelArticulatedObjectPose Estimation

9793671e4be9858a69a32545204d59d1-Supplemental-Conference.pdf

95f03faf3763e1b1ce2c3de62da8f090-Paper-Conference.pdf

MoVQ: Modulating QuantizedVectorsforHigh-FidelityImage Generation ADiscussiononMaskedImageReconstruction