Goto

Collaborating Authors

 Asia


Probabilistic data quality assessment for structural monitoring data via outlier-resistant conditional diffusion model

arXiv.org Machine Learning

Data quality assessment is an essential step that ensures the reliability of the subsequent structural health monitoring (SHM) tasks. This study proposes a prediction deviation-based SHM data quality assessment method using a univariate implicit auto-regressive model, enabling outlier diagnosis and data cleaning. The proposed conditional diffusion model (CDM) augments the standard diffusion model with a conditional embedding module to incorporate temporal context, quartile normalization to mitigate distribution skew, and a Huber loss to enhance robustness against outliers. Within this univariate implicit autoregressive framework, each data point is assigned an outlier probability, quantifying its degree of "outlier-ness", and a global quality evaluation score is computed to characterize the overall dataset quality. Extensive case studies utilizing operational data from real-world structures demonstrate that the proposed framework significantly improves the accuracy of data quality assessment, outperforming other strong baselines representative of clustering, isolation-based, and deep reconstruction methods. The effectiveness and robustness of the proposed framework are further demonstrated by the findings of ablation experiments and hyperparameter analysis.


Unsupervised Polychromatic Neural Representation for CTMetal Artifact Reduction

Neural Information Processing Systems

Emerging neural reconstruction techniques based on tomography (e.g., NeRF, NeAT, and NeRP) have started showing unique capabilities in medical imaging. In this work, we present a novel Polychromatic neural representation (Polyner) to tackle the challenging problem of CT imaging when metallic implants exist within the human body. CT metal artifacts arise from the drastic variation of metal's attenuation coefficients at various energy levels of the X-ray spectrum, leading to a nonlinear metal effect in CT measurements. Recovering CT images from metal-affected measurements hence poses a complicated nonlinear inverse problem where empirical models adopted in previous metal artifact reduction (MAR) approaches lead to signal loss and strongly aliased reconstructions.


DynPoint: Dynamic Neural Point For View Synthesis

Neural Information Processing Systems

The introduction of neural radiance fields has greatly improved the effectiveness of view synthesis for monocular videos. However, existing algorithms face difficulties when dealing with uncontrolled or lengthy scenarios, and require extensive training time specific to each new scenario. To tackle these limitations, we propose DynPoint, an algorithm designed to facilitate the rapid synthesis of novel views for unconstrained monocular videos. Rather than encoding the entirety of the scenario information into a latent representation, DynPoint concentrates on predicting the explicit 3D correspondence between neighboring frames to realize information aggregation. Specifically, this correspondence prediction is achieved through the estimation of consistent depth and scene flow information across frames. Subsequently, the acquired correspondence is utilized to aggregate information from multiple reference frames to a target frame, by constructing hierarchical neural point clouds. The resulting framework enables swift and accurate view synthesis for desired views of target frames. The experimental results obtained demonstrate the considerable acceleration of training time achieved - typically an order of magnitude - by our proposed method while yielding comparable outcomes compared to prior approaches. Furthermore, our method exhibits strong robustness in handling long-duration videos without learning a canonical representation of video content.



Recursion in Recursion: Two-Level Nested Recursion for Length Generalization with Scalability

Neural Information Processing Systems

Binary Balanced Tree Recursive Neural Networks (BBT-RvNNs) enforce sequence composition according to a preset balanced binary tree structure. Thus, their nonlinear recursion depth (which is the tree depth) is just log2 n(nbeing the sequence length). Such logarithmic scaling makes BBT-RvNNs efficient and scalable on long sequence tasks such as Long Range Arena (LRA). However, such computational efficiency comes at a cost because BBT-RvNNs cannot solve simple arithmetic tasks like ListOps. On the flip side, RvNN models (e.g., Beam Tree RvNN) that do succeed on ListOps (and other structure-sensitive tasks like formal logical inference) are generally several times more expensive (in time and space) than even Recurrent Neural Networks.


Double Randomized Underdamped Langevin with Dimension-Independent Convergence Guarantee

Neural Information Processing Systems

This paper focuses on the high-dimensional sampling of log-concave distributions with composite structures: p (dx) exp( g(x) f(x))dx. We develop a double randomization technique, which leads to a fast underdamped Langevin algorithm with a dimension-independent convergence guarantee.



Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation

Neural Information Processing Systems

Vision-Language Pre-training has demonstrated its remarkable zero-shot recognition ability and potential to learn generalizable visual representations from language supervision. Taking a step ahead, language-supervised semantic segmentation enables spatial localization of textual inputs by learning pixel grouping solely from image-text pairs. Nevertheless, the state-of-the-art suffers from clear semantic gaps between visual and textual modality: plenty of visual concepts appeared in images are missing in their paired captions. Such semantic misalignment circulates in pre-training, leading to inferior zero-shot performance in dense predictions due to insufficient visual concepts captured in textual representations. To close such semantic gap, we propose Concept Curation (CoCu), a pipeline that leverages CLIP to compensate for the missing semantics. For each image-text pair, we establish a concept archive that maintains potential visually-matched concepts with our proposed vision-driven expansion and text-to-vision-guided ranking. Relevant concepts can thus be identified via cluster-guided sampling and fed into pre-training, thereby bridging the gap between visual and textual semantics. Extensive experiments over a broad suite of 8 segmentation benchmarks show that CoCu achieves superb zeroshot transfer performance and greatly boosts language-supervised segmentation baseline by a large margin, suggesting the value of bridging semantic gap in pretraining data.



Japanese airline starts testing robot baggage handlers, and the early returns are not impressive

FOX News

Fecal vandal's nearly weeklong crime spree comes to an end when police catch her in the act Catching the horny landlady teaching your boyfriend mouth-to-mouth is not a sign that it's time to move MAGA bikini congresswoman sends a message to big brother, Dale Earnhardt turns 75 & MLB fan gets pulverized! Wait... Who is actually using highway rest stop BBQ grills? Hilary Duff's latest Instagram content has suburban millennial moms gasping, a tennis match turns nasty & MEAT Opening day at Six Flags St. Louis ended in chaos after brawl with as many as 100 people broke out Mountain climber survives terrifying 500-foot fall in California's Sierra Nevada, night stranded on ledge Ella Langley's brand deal with American Eagle shows Bud Light how it could've been in 2023, fan fight & MEAT Shannon Elizabeth, to nobody's surprise, cashes in on OnlyFans with reported 7-figure payday in her first week Airline doesn't buy couple's claim that they were praying, bans them for attempting to join mile high club Bill Maher & David Cross get into heated war of words over'looney left' & trans rights, including 3-year-old'Map wars': Brit Hume says redistricting battle is'as bitter' as he's ever seen it Candidates make their case as California governor's race intensifies Hegseth, Caine defend Pentagon's budget request on Capitol Hill Greg Gutfeld: Walz tries to appear'above it all,' but is'drowning' in corruption Ukraine is'militarily' defeated: Trump Trump posts AI image of himself with a gun, says Iran'better get smart soon' Trump calls Comey a'dirty cop' and a'crooked man' Sen. Rand Paul backs White House ballroom after WHCA shooting Steven Hilton says voter ID push could boost GOP turnout in California governor's race There's no question that robots are going to be coming for some folks' jobs sooner rather than later, and it looks like baggage handlers could be one of the first on the robo-chopping block. Japan Airlines is going to start rolling out its humanoid robots to help with baggage at Tokyo's Haneda Airport. Now, while I'm usually not one to celebrate something like this -- I feel it's just one step closer to all of us having to pay our respects to robot overlords -- I was excited about it.