10 interesting Deep learning libraries to checkout
It has 10 tasks like retrieval, captioning, visual question answering, multimodal classification, Natural Language Visual Reasoning, Visual Dialogue, Video/Image-text Retrieval etc. It also contains 20 datasets and 30 pre-trained SOTA models for foundation language-vision models. NeMo: NVIDIA's NeMo is a is a conversational AI toolbox to work on automatic speech recognition (ASR), text-to-speech synthesis (TTS), large language models (LLMs), and natural language processing(NLP). NeMo's main goal is to assist researchers from industry and academia in reusing previous work (code and pretrained models) and to facilitate the development of new conversational AI models. Various model architectures are available for Object Detection, Instance Segmentation, Panoptic Segmentation, Contrastive Learning and Distillation. One can use existing or new datasets/models, also customize them for your problems.
Nov-5-2022, 22:50:18 GMT