Goto

Collaborating Authors

 forte


Strong, Accurate, and Low-Cost Robot Manipulator

Chebly, Georges, Little, Spencer, Perera, Nisal, Abedeen, Aliya, Suzuki, Ken, Kim, Donghyun

arXiv.org Artificial Intelligence

--This paper presents Forte, a fully 3D-printable, 6-DoF robotic arm designed to achieve near industrial-grade performance - 0 . As an accessible robot for broad applications across classroom education to AI experiments, Forte pushes forward the performance limitations of existing low-cost educational arms. We introduce a cost-effective mechanical design that combines capstan-based cable drives, timing belts, simple tensioning mechanisms, and lightweight 3D-printed structures, along with topology optimization for structural stiffness. Through careful drivetrain engineering, we minimize backlash and maintain control fidelity without relying on high-power electronics or expensive manufacturing processes. Experimental validation demonstrates that Forte achieves high repeatability and load capacity, offering a compelling robotic platform for both classroom instruction and advanced robotics research. Can we build a 6-degree-of-freedom (DoF) robotic arm with a material cost under $400, while achieving a half-meter workspace, a payload capacity of more than 0.5 kg, and repeatability within 0. 5 mm? We introduce Forte, a fully 3D-printed robotic manipulator, developed to affirmatively answer this question. In light of surging interest in robotics and artificial intelligence, providing accessible, hands-on educational tools has never been more important, as practical experience and experimental validation are essential components of robotics education.


Composable NLP Workflows for BERT-based Ranking and QA System

Kumar, Gaurav, Dandu, Murali Mohana Krishna

arXiv.org Artificial Intelligence

There has been a lot of progress towards building NLP models that scale to multiple tasks. However, real-world systems contain multiple components and it is tedious to handle cross-task interaction with varying levels of text granularity. In this work, we built an end-to-end Ranking and Question-Answering (QA) system using Forte, a toolkit that makes composable NLP pipelines. We utilized state-of-the-art deep learning models such as BERT, RoBERTa in our pipeline, evaluated the performance on MS-MARCO and Covid-19 datasets using metrics such as BLUE, MRR, F1 and compared the results of ranking and QA systems with their corresponding benchmark results. The modular nature of our pipeline and low latency of reranker makes it easy to build complex NLP applications easily.


Forte : Finding Outliers with Representation Typicality Estimation

Ganguly, Debargha, Morningstar, Warren, Yu, Andrew, Chaudhary, Vipin

arXiv.org Artificial Intelligence

Generative models can now produce photorealistic synthetic data which is virtually indistinguishable from the real data used to train it. This is a significant evolution over previous models which could produce reasonable facsimiles of the training data, but ones which could be visually distinguished from the training data by human evaluation. Recent work on OOD detection has raised doubts that generative model likelihoods are optimal OOD detectors due to issues involving likelihood misestimation, entropy in the generative process, and typicality. We speculate that generative OOD detectors also failed because their models focused on the pixels rather than the semantic content of the data, leading to failures in near-OOD cases where the pixels may be similar but the information content is significantly different. We hypothesize that estimating typical sets using self-supervised learners leads to better OOD detectors. We introduce a novel approach that leverages representation learning, and informative summary statistics based on manifold estimation, to address all of the aforementioned issues. Our method outperforms other unsupervised approaches and achieves state-of-the art performance on well-established challenging benchmarks, and new synthetic data detection tasks.


Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation

Li, Cheng-Yi, Chang, Kao-Jung, Yang, Cheng-Fu, Wu, Hsin-Yu, Chen, Wenting, Bansal, Hritik, Chen, Ling, Yang, Yi-Ping, Chen, Yu-Chun, Chen, Shih-Pin, Lirng, Jiing-Feng, Chang, Kai-Wei, Chiou, Shih-Hwa

arXiv.org Artificial Intelligence

Multi-modal large language models (MLLMs) have been given free rein to explore exciting medical applications with a primary focus on radiology report generation. Nevertheless, the preliminary success in 2D radiology captioning is incompetent to reflect the real-world diagnostic challenge in the volumetric 3D anatomy. To mitigate three crucial limitation aspects in the existing literature, including (1) data complexity, (2) model capacity, and (3) evaluation metric fidelity, we collected an 18,885 text-scan pairs 3D-BrainCT dataset and applied clinical visual instruction tuning (CVIT) to train BrainGPT models to generate radiology-adherent 3D brain CT reports. Statistically, our BrainGPT scored BLEU-1 = 44.35, BLEU-4 = 20.38, METEOR = 30.13, ROUGE-L = 47.6, and CIDEr-R = 211.77 during internal testing and demonstrated an accuracy of 0.91 in captioning midline shifts on the external validation CQ500 dataset. By further inspecting the captioned report, we reported that the traditional metrics appeared to measure only the surface text similarity and failed to gauge the information density of the diagnostic purpose. To close this gap, we proposed a novel Feature-Oriented Radiology Task Evaluation (FORTE) to estimate the report's clinical relevance (lesion feature and landmarks). Notably, the BrainGPT model scored an average FORTE F1-score of 0.71 (degree=0.661; landmark=0.706; feature=0.693; impression=0.779). To demonstrate that BrainGPT models possess objective readiness to generate human-like radiology reports, we conducted a Turing test that enrolled 11 physician evaluators, and around 74% of the BrainGPT-generated captions were indistinguishable from those written by humans. Our work embodies a holistic framework that showcased the first-hand experience of curating a 3D brain CT dataset, fine-tuning anatomy-sensible language models, and proposing robust radiology evaluation metrics.


Global Big Data Conference

#artificialintelligence

Flat materials that can morph into three-dimensional shapes have potential applications in architecture, medicine, robotics, space travel, and much more. But programming these shape changes requires complex and time-consuming computations. Now, researchers from the Harvard John A. Paulson School of Engineering and Applied Sciences (SEAS) have developed a platform that uses machine learning to program the transformation of 2D stretchable surfaces into specific 3D shapes. "While machine learning methods have been classically employed for image recognition and language processing, they have also recently emerged as powerful tools to solve mechanics problems," said Katia Bertoldi, the William and Ami Kuan Danoff Professor of Applied Mechanics at SEAS and senior author of the study. "In this work we demonstrate that these tools can be extended to study the mechanics of transformable, inflatable systems."


Catching the Fakes

Communications of the ACM

Counterfeiting is a big business. Nearly $509 billion of fake and pirated products were sold internationally in 2016. In that year, the latest for which data was available, counterfeit goods made up 3.3% of international trade, up from 2.5% three years earlier, according to the Organization for Economic Cooperation and Development. That figure, which does not include domestic trade in fakes, not only means companies are losing revenue and consumers are not getting their money's worth; counterfeiting also helps fund organized crime. Because it skirts safety regulations, makers of counterfeits could use toxic materials or produce unsafe products.


Machine learning in cybersecurity moves needle, doesn't negate threats

#artificialintelligence

Using artificial intelligence and machine learning in cybersecurity is gaining ground. Most IT leaders are looking at intelligent solutions, according to the May 2017 report "Next Generation Cybersecurity Analytics and Operations Survey." The survey was commissioned by DFLabs, a provider of security automation and orchestration technology, and researched by Enterprise Strategy Group (ESG). "Most of the people [in the survey] were definitely saying that machine learning is something they're evaluating from a strong security standpoint," said Dario Forte, CEO of survey sponsor DFLabs. The report, based on a survey of 412 IT and cybersecurity professionals, found that 93% of IT leaders are using or planning to use these types of solutions: 12% of respondents have deployed machine learning technologies designed for security analytics and operations automation and orchestration; another 27% said they're doing so on a limited basis, while 22% said they're adding them.


The grocery shopping app for the 99%

#artificialintelligence

While the convenience of getting groceries delivered to doorsteps is enticing, it tends to be a costly luxury. The vast majority of Americans are still strolling down store aisles themselves to check items off their shopping lists. And it's this demographic that startup Basket is targeting. The company is highlighting price transparency to helping people discover the best priced items at local stores. It does so by leveraging the power of the crowd. Basket has built up a database of grocery store items and their prices by tasking shoppers with capturing that information.