AITopics | Jiang, Junchen

Collaborating Authors

Jiang, Junchen

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Automatic and Efficient Customization of Neural Networks for ML Applications

Liu, Yuhan, Wan, Chengcheng, Du, Kuntai, Hoffmann, Henry, Jiang, Junchen, Lu, Shan, Maire, Michael

arXiv.org Artificial IntelligenceOct-7-2023

ML APIs have greatly relieved application developers of the burden to design and train their own neural network models -- classifying objects in an image can now be as simple as one line of Python code to call an API. However, these APIs offer the same pre-trained models regardless of how their output is used by different applications. This can be suboptimal as not all ML inference errors can cause application failures, and the distinction between inference errors that can or cannot cause failures varies greatly across applications. To tackle this problem, we first study 77 real-world applications, which collectively use six ML APIs from two providers, to reveal common patterns of how ML API output affects applications' decision processes. Inspired by the findings, we propose ChameleonAPI, an optimization framework for ML APIs, which takes effect without changing the application source code. ChameleonAPI provides application developers with a parser that automatically analyzes the application to produce an abstract of its decision process, which is then used to devise an application-specific loss function that only penalizes API output errors critical to the application. ChameleonAPI uses the loss function to efficiently train a neural network model customized for each application and deploys it to serve API invocations from the respective application via existing interface. Compared to a baseline that selects the best-of-all commercial ML API, we show that ChameleonAPI reduces incorrect application decisions by 43%.

application, artificial intelligence, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2310.04685

Country: North America > United States (0.93)

Genre: Research Report (0.64)

Industry: Information Technology > Services (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

OneAdapt: Fast Adaptation for Deep Learning Applications via Backpropagation

Du, Kuntai, Liu, Yuhan, Hao, Yitian, Zhang, Qizheng, Wang, Haodong, Huang, Yuyang, Ananthanarayanan, Ganesh, Jiang, Junchen

arXiv.org Artificial IntelligenceOct-3-2023

Deep learning inference on streaming media data, such as object detection in video or LiDAR feeds and text extraction from audio waves, is now ubiquitous. To achieve high inference accuracy, these applications typically require significant network bandwidth to gather high-fidelity data and extensive GPU resources to run deep neural networks (DNNs). While the high demand for network bandwidth and GPU resources could be substantially reduced by optimally adapting the configuration knobs, such as video resolution and frame rate, current adaptation techniques fail to meet three requirements simultaneously: adapt configurations (i) with minimum extra GPU or bandwidth overhead; (ii) to reach near-optimal decisions based on how the data affects the final DNN's accuracy, and (iii) do so for a range of configuration knobs. This paper presents OneAdapt, which meets these requirements by leveraging a gradient-ascent strategy to adapt configuration knobs. The key idea is to embrace DNNs' differentiability to quickly estimate the accuracy's gradient to each configuration knob, called AccGrad. Specifically, OneAdapt estimates AccGrad by multiplying two gradients: InputGrad (i.e. how each configuration knob affects the input to the DNN) and DNNGrad (i.e. how the DNN input affects the DNN inference output). We evaluate OneAdapt across five types of configurations, four analytic tasks, and five types of input data. Compared to state-of-the-art adaptation schemes, OneAdapt cuts bandwidth usage and GPU usage by 15-59% while maintaining comparable accuracy or improves accuracy by 1-5% while using equal or fewer resources.

artificial intelligence, deep learning application, machine learning, (3 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3620678.3624653

2310.02422

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Sayer: Using Implicit Feedback to Optimize System Policies

Lécuyer, Mathias, Kim, Sang Hoon, Nanavati, Mihir, Jiang, Junchen, Sen, Siddhartha, Sharma, Amit, Slivkins, Aleksandrs

arXiv.org Machine LearningOct-28-2021

We observe that many system policies that make threshold decisions involving a resource (e.g., time, memory, cores) naturally reveal additional, or implicit feedback. For example, if a system waits X min for an event to occur, then it automatically learns what would have happened if it waited

implicit feedback, machine learning, reinforcement learning, (20 more...)

arXiv.org Machine Learning

2110.14874

Country: North America > United States (0.30)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.88)
Information Technology > Data Science > Data Mining > Big Data (0.67)

Add feedback

Ekya: Continuous Learning of Video Analytics Models on Edge Compute Servers

Bhardwaj, Romil, Xia, Zhengxu, Ananthanarayanan, Ganesh, Jiang, Junchen, Karianakis, Nikolaos, Shu, Yuanchao, Hsieh, Kevin, Bahl, Victor, Stoica, Ion

arXiv.org Artificial IntelligenceDec-18-2020

Video analytics applications use edge compute servers for the analytics of the videos (for bandwidth and privacy). Compressed models that are deployed on the edge servers for inference suffer from data drift, where the live video data diverges from the training data. Continuous learning handles data drift by periodically retraining the models on new data. Our work addresses the challenge of jointly supporting inference and retraining tasks on edge servers, which requires navigating the fundamental tradeoff between the retrained model's accuracy and the inference accuracy. Our solution Ekya balances this tradeoff across multiple models and uses a micro-profiler to identify the models that will benefit the most by retraining. Ekya's accuracy gain compared to a baseline scheduler is 29% higher, and the baseline requires 4x more GPU resources to achieve the same accuracy as Ekya.

accuracy, deep learning, neural network, (20 more...)

arXiv.org Artificial Intelligence

2012.10557

Country:

Europe (0.46)
North America > United States > New York (0.14)

Genre: Research Report (1.00)

Industry:

Information Technology (0.93)
Education > Educational Setting > Continuing Education (0.71)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Addressing Training Bias via Automated Image Annotation

Xiao, Zhujun, Zhu, Yanzi, Chen, Yuxin, Zhao, Ben Y., Jiang, Junchen, Zheng, Haitao

arXiv.org Machine LearningOct-10-2018

Build accurate DNN models requires training on large labeled, context specific datasets, especially those matching the target scenario. We believe advances in wireless localization, working in unison with cameras, can produce automated annotation of targets on images and videos captured in the wild. Using pedestrian and vehicle detection as examples, we demonstrate the feasibility, benefits, and challenges of an automatic image annotation system. Our work calls for new technical development on passive localization, mobile data analytics, and error-resilient ML models, as well as design issues in user privacy policies.

deep learning, localization, neural network, (21 more...)

arXiv.org Machine Learning

1809.10242

Country:

Europe (0.46)
North America > United States > California (0.14)

Genre: Research Report > New Finding (0.67)

Industry:

Education (0.98)
Information Technology > Security & Privacy (0.46)
Transportation > Ground > Road (0.31)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Data Science (1.00)
Information Technology > Communications > Networks (1.00)
(2 more...)

Add feedback