conquer
RL without TD learning
In this post, I'll introduce a reinforcement learning (RL) algorithm based on an "alternative" paradigm: divide and conquer We can do Reinforcement Learning (RL) based on divide and conquer, instead of temporal difference (TD) learning. There are two classes of algorithms in RL: on-policy RL and off-policy RL. On-policy RL means we can use fresh data collected by the current policy. In other words, we have to throw away old data each time we update the policy. Algorithms like PPO and GRPO (and policy gradient methods in general) belong to this category.
ConQuER: Modular Architectures for Control and Bias Mitigation in IQP Quantum Generative Models
Zou, Xiaocheng, Duan, Shijin, Fleming, Charles, Liu, Gaowen, Kompella, Ramana Rao, Ren, Shaolei, Xu, Xiaolin
Quantum generative models based on instantaneous quantum polynomial (IQP) circuits show great promise in learning complex distributions while maintaining classical train-ability. However, current implementations suffer from two key limitations: lack of controllability over generated outputs and severe generation bias towards certain expected patterns. We present a Controllable Quantum Generative Framework, ConQuER, which addresses both challenges through a modular circuit architecture. ConQuER embeds a lightweight controller circuit that can be directly combined with pre-trained IQP circuits to precisely control the output distribution without full retraining. Leveraging the advantages of IQP, our scheme enables precise control over properties such as the Hamming Weight distribution with minimal parameter and gate overhead. In addition, inspired by the controller design, we extend this modular approach through data-driven optimization to embed implicit control paths in the underlying IQP architecture, significantly reducing generation bias on structured datasets. ConQuER retains efficient classical training properties and high scalability.
ConQuer: A Framework for Concept-Based Quiz Generation
Fu, Yicheng, Wang, Zikui, Yang, Liuxin, Huo, Meiqing, Dai, Zhongdongming
Quizzes play a crucial role in education by reinforcing students' understanding of key concepts and encouraging self-directed exploration. However, compiling high-quality quizzes can be challenging and require deep expertise and insight into specific subject matter. Although LLMs have greatly enhanced the efficiency of quiz generation, concerns remain regarding the quality of these AI-generated quizzes and their educational impact on students. To address these issues, we introduce ConQuer, a concept-based quiz generation framework that leverages external knowledge sources. We employ comprehensive evaluation dimensions to assess the quality of the generated quizzes, using LLMs as judges. Our experiment results demonstrate a 4.8% improvement in evaluation scores and a 77.52% win rate in pairwise comparisons against baseline quiz sets. Ablation studies further underscore the effectiveness of each component in our framework. Code available at https://github.com/sofyc/ConQuer.
Devious humour and painful puns: will the cryptic crossword remain the last thing AI can't conquer?
The Times hosts an annual crossword-solving competition and it remains, until such time as the Guardian has its own version, the gold standard. This year's competitors included a dog. Rather, an AI represented as a jolly coffee-drinking dog named Ross (a name hidden in "crossword"), and who is embedded on the Crossword Genius smartphone app. The human competitors at the event โ which took place at Times' parent company News UK's London headquarters, in the shadow of the Shard โ were, as usual, bafflingly fast: pondering the next clue while scribbling the letters of the previous. An AI can conceivably "think" about multiple puzzles at once: so did it outwit us mortals?
On Image Search in Histopathology
Tizhoosh, H. R., Pantanowitz, Liron
Pathology images of histopathology can be acquired from camera-mounted microscopes or whole slide scanners. Utilizing similarity calculations to match patients based on these images holds significant potential in research and clinical contexts. Recent advancements in search technologies allow for nuanced quantification of cellular structures across diverse tissue types, facilitating comparisons and enabling inferences about diagnosis, prognosis, and predictions for new patients when compared against a curated database of diagnosed and treated cases. In this paper, we comprehensively review the latest developments in image search technologies for histopathology, offering a concise overview tailored for computational pathology researchers seeking effective, fast and efficient image search methods in their work.
UniChest: Conquer-and-Divide Pre-training for Multi-Source Chest X-Ray Classification
Dai, Tianjie, Zhang, Ruipeng, Hong, Feng, Yao, Jiangchao, Zhang, Ya, Wang, Yanfeng
Vision-Language Pre-training (VLP) that utilizes the multi-modal information to promote the training efficiency and effectiveness, has achieved great success in vision recognition of natural domains and shown promise in medical imaging diagnosis for the Chest X-Rays (CXRs). However, current works mainly pay attention to the exploration on single dataset of CXRs, which locks the potential of this powerful paradigm on larger hybrid of multi-source CXRs datasets. We identify that although blending samples from the diverse sources offers the advantages to improve the model generalization, it is still challenging to maintain the consistent superiority for the task of each source due to the existing heterogeneity among sources. To handle this dilemma, we design a Conquer-and-Divide pre-training framework, termed as UniChest, aiming to make full use of the collaboration benefit of multiple sources of CXRs while reducing the negative influence of the source heterogeneity. Specially, the ``Conquer" stage in UniChest encourages the model to sufficiently capture multi-source common patterns, and the ``Divide" stage helps squeeze personalized patterns into different small experts (query networks). We conduct thorough experiments on many benchmarks, e.g., ChestX-ray14, CheXpert, Vindr-CXR, Shenzhen, Open-I and SIIM-ACR Pneumothorax, verifying the effectiveness of UniChest over a range of baselines, and release our codes and pre-training models at https://github.com/Elfenreigen/UniChest.
Chinese robot combines wheels and legs to conquer any terrain
A new Chinese robot W1 by LimX Dynamics combines wheels and legs to conquer any terrain. Have you ever wondered what would happen if you combined a dog and a car? Well, you might get something like W1, a wheeled quadruped robot that can switch between walking and rolling modes depending on the terrain. W1 is the first product of LimX Dynamics, a Chinese company that specializes in legged robotics technology. It is part of a growing trend and demand for legged robots, especially in China, where the government and the industry are investing heavily in robotics and artificial intelligence.
Artificial Intelligence is Forcing Us to Answer Some Very Human Questions
Chris Dixon, who invested early in companies ranging from Warby Parker to Kickstarter, once wrote that the next big thing always starts out looking like a toy. That's certainly true of artificial intelligence, which started out playing games like chess, go and playing humans on the game show Jeopardy! Yet today, AI has become so pervasive we often don't even recognize it anymore. Besides enabling us to speak to our phones and get answers back, intelligent algorithms are often working in the background, providing things like predictive maintenance for machinery and automating basic software tasks. As the technology becomes more powerful, it's also forcing us to ask some uncomfortable questions that were once more in the realm of science fiction or late-night dorm room discussions.
The Perfect Text Editor for Jupyter: A Complete Python IDE
This article is part of a series. Check out the full series: Part I, Part II, Part III. Over the past few days, we've been building a complete Python IDE inside Jupyter. In this article, we will add the final touches and package everything in a Docker image to create a portable working environment for data scientists and Machine Learning engineers. It's not even an IPython UI, as many may think.
ConQueR: Query Contrast Voxel-DETR for 3D Object Detection
Zhu, Benjin, Wang, Zhe, Shi, Shaoshuai, Xu, Hang, Hong, Lanqing, Li, Hongsheng
Although DETR-based 3D detectors can simplify the detection pipeline and achieve direct sparse predictions, their performance still lags behind dense detectors with post-processing for 3D object detection from point clouds. DETRs usually adopt a larger number of queries than GTs (e.g., 300 queries v.s. 40 objects in Waymo) in a scene, which inevitably incur many false positives during inference. In this paper, we propose a simple yet effective sparse 3D detector, named Query Contrast Voxel-DETR (ConQueR), to eliminate the challenging false positives, and achieve more accurate and sparser predictions. We observe that most false positives are highly overlapping in local regions, caused by the lack of explicit supervision to discriminate locally similar queries. We thus propose a Query Contrast mechanism to explicitly enhance queries towards their best-matched GTs over all unmatched query predictions. This is achieved by the construction of positive and negative GT-query pairs for each GT, and a contrastive loss to enhance positive GT-query pairs against negative ones based on feature similarities. ConQueR closes the gap of sparse and dense 3D detectors, and reduces up to ~60% false positives. Our single-frame ConQueR achieves new state-of-the-art (sota) 71.6 mAPH/L2 on the challenging Waymo Open Dataset validation set, outperforming previous sota methods (e.g., PV-RCNN++) by over 2.0 mAPH/L2.