AITopics | Large Language Model

Collaborating Authors

Large Language Model

News Overviews Instructional Materials AI-Alerts Classics

EVA-02: A Visual Representation for Neon Genesis

Fang, Yuxin, Sun, Quan, Wang, Xinggang, Huang, Tiejun, Wang, Xinlong, Cao, Yue

arXiv.org Artificial IntelligenceMar-22-2023

We launch EVA-02, a next-generation Transformer-based visual representation pre-trained to reconstruct strong and robust language-aligned vision features via masked image modeling. With an updated plain Transformer architecture as well as extensive pre-training from an open & accessible giant CLIP vision encoder, EVA-02 demonstrates superior performance compared to prior state-of-the-art approaches across various representative vision tasks, while utilizing significantly fewer parameters and compute budgets. Notably, using exclusively publicly accessible training data, EVA-02 with only 304M parameters achieves a phenomenal 90.0 fine-tuning top-1 accuracy on ImageNet-1K val set. Additionally, our EVA-02-CLIP can reach up to 80.4 zero-shot top-1 on ImageNet-1K, outperforming the previous largest & best open-sourced CLIP with only ~1/6 parameters and ~1/6 image-text training data. We offer four EVA-02 variants in various model sizes, ranging from 6M to 304M parameters, all with impressive performance. To facilitate open access and open research, we release the complete suite of EVA-02 to the community at https://github.com/baaivision/EVA/tree/master/EVA-02.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2303.11331

Country: Asia > China > Beijing > Beijing (0.04)

Genre:

Research Report > New Finding (0.67)
Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

SemDeDup: Data-efficient learning at web-scale through semantic deduplication

Abbas, Amro, Tirumala, Kushal, Simig, Dániel, Ganguli, Surya, Morcos, Ari S.

arXiv.org Artificial IntelligenceMar-22-2023

Progress in machine learning has been driven in large part by massive increases in data. However, large web-scale datasets such as LAION are largely uncurated beyond searches for exact duplicates, potentially leaving much redundancy. Here, we introduce SemDeDup, a method which leverages embeddings from pre-trained models to identify and remove semantic duplicates: data pairs which are semantically similar, but not exactly identical. Removing semantic duplicates preserves performance and speeds up learning. Analyzing a subset of LAION, we show that SemDeDup can remove 50% of the data with minimal performance loss, effectively halving training time. Moreover, performance increases out of distribution. Also, analyzing language models trained on C4, a partially curated dataset, we show that SemDeDup improves over prior approaches while providing efficiency gains. SemDeDup provides an example of how simple ways of leveraging quality embeddings can be used to make models learn faster with less data.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2303.0954

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > Massachusetts (0.04)
North America > Dominican Republic (0.04)
(3 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.47)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

China's Answer to ChatGPT Flubs Its First Lines

WIREDMar-21-2023, 22:00:00 GMT

When rumors began swirling last month about the Chinese search giant Baidu working on a chatbot to rival OpenAI's ChatGPT, it seemed like the perfect move. Baidu has invested heavily in artificial intelligence over the past decade and could harness the technology for its leading search engine, as Microsoft has done for Bing and Google says it will do too. Yet when Baidu unveiled Ernie Bot, or 文心一言 "Wenxin Yiyan" in Chinese, in Beijing earlier this month, the news fell flat. Robin Li, Baidu's CEO, admitted halfway through the launch stream that demos of Ernie Bot answering general knowledge questions, summarizing information from the web, and generating images were prerecorded, leading to snarky commentary on Chinese social media. It didn't help that OpenAI had introduced a major upgrade, called GPT-4, to the AI technology that powers ChatGPT only the day before.

baidu, chatgpt, china, (6 more...)

WIRED

AI-Alerts: 2023 > 2023-03 > AAAI AI-Alert for Mar 22, 2023 (1.00)

Country: Asia > China > Beijing > Beijing (0.26)

Industry: Information Technology > Services (0.57)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.53)

Add feedback

Does ChatGPT have a character limit? Here's how to bypass it

#artificialintelligenceMar-21-2023, 20:26:25 GMT

Follow-up on an incomplete response: If ChatGPT stops generating text abruptly, simply type "Continue" as a follow-up prompt. You can also specify the last sentence and ask the chatbot to continue where it left off. Write a more descriptive prompt: If ChatGPT generated too little text and didn't get to reach its character limit, you will need to modify your prompt. Simply specify the number of words you want it to write. An example would be "Write a 500-word essay on climate change".

ask chatgpt, character limit, chatgpt, (15 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Bill Gates: AI is most important tech advance in decades

BBC NewsMar-21-2023, 19:56:43 GMT

In his blog, Mr Gates said he challenged the OpenAI team in 2022 to train an AI that can pass an Advanced Placement (AP) Biology exam - roughly equivalent to an A-level exam - with the strict rule that the AI could not be specifically trained to answer Biology questions.

bill gate, exam

BBC News

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.44)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.44)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.44)

Add feedback

Google Introduces Bard, a Collaborative Generative AI Experiment - Cyber Kendra

#artificialintelligenceMar-21-2023, 19:15:16 GMT

Bard can assist users in achieving their goals by giving them tips on how to read more books, explain quantum physics in simple terms, or create outlines for their blog posts. The AI tool is powered by a research large language model (LLM), which is a lightweight and optimized version of LaMDA. Bard will be updated with newer and more capable models over time. LLMs are prediction engines that generate responses by selecting one word at a time from words that are likely to come next. Picking the most probable choice every time would not lead to creative responses, so there's some flexibility factored in.

bard, collaborative generative ai experiment, google introduce bard, (2 more...)

#artificialintelligence

Country: North America > United States (0.07)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.40)

Add feedback

Bing A.I. and the Dawn of the Post-Search Internet

The New YorkerMar-21-2023, 19:06:54 GMT

Around a year ago, I wrote a column about users' growing frustration with Google Search, as automated summaries, sponsored content, and S.E.O.-tailored spam increasingly crowded out the kinds of useful Web site results that Googling was supposed to produce. Google's search algorithm wasn't directing us to the information that we wanted to find (for instance, in my case at the time, the elusive perfect toaster) so much as bombarding us with the half-baked recommendations of content mills. Yet Google Search has maintained its dominance partly out of habit and partly because no competing service has offered a viable alternative--until now. On February 7th, Microsoft began a beta launch of a version of its search engine, Bing, in the form of an A.I. chatbot, powered by GPT-4, the latest iteration of OpenAI's large-language model ChatGPT. Instead of directing users to external sites, the new Bing can simply generate its own answers to any query.

bing, google search, microsoft, (15 more...)

The New Yorker

Country:

North America > United States > New York (0.05)
Europe > Iceland (0.05)

Industry:

Media (0.72)
Information Technology > Services (0.58)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.90)

Add feedback

Microsoft boosts Word, Excel and PowerPoint with GPT-4 - Plugavel

#artificialintelligenceMar-21-2023, 18:42:53 GMT

Information had leaked at the beginning of the year that MicrosoftMicrosoft intended to integrateartificial intelligenceartificial intelligence in Office software. It is now done, and just like with Bing, the firm has used GPT-4, the new version of the large language model (LLM) behind ChatGPTChatGPT. Called Copilot, this AI is intended to assist in productivity and therefore differs significantly from ChatGPT. Copilot will be accessible from all applications Microsoft 365Microsoft 365. This includes including Word, Excel and PowerPoint, as well as Teams and Outlook.

copilot, excel and powerpoint, microsoft graph, (3 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)

Add feedback

Google Bard AI hands-on: A work in progress with plenty of caveats

EngadgetMar-21-2023, 17:09:56 GMT

Google has made Bard more widely available to users in the US and the UK today, and I have been spending some time with the company's chatbot to see how its generative AI compares to ChatGPT and Bing AI. Like we saw in the screenshots Google provided with today's announcement, the interface here is very similar to Bing AI in that there is a wide text input at the bottom of the screen and a dialogue-based layout. But there are a few key differences between Google's and Microsoft's offerings. With Bing AI, you'll have to either hit Chat or scroll up from search results to get to the conversation page, whereas you don't have to do that for the Bard website. Microsoft has a broom icon to the left of the input bar to clear the slate and start a new topic, while Google has a column on the left with options for "Reset chat," "Bard Activity," "FAQ and "Help & Support." It's also worth noting the language Google painstakingly uses here. Once I navigated to the website, I was greeted with an ...

bard, google, plank, (12 more...)

Engadget

Country: North America > United States (0.25)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.70)

Add feedback

How AI experts are using GPT-4

#artificialintelligenceMar-21-2023, 16:50:19 GMT

Unlike OpenAI's viral hit ChatGPT, which is freely accessible to the general public, GPT-4 is currently accessible only to developers. It's still early days for the tech, and it'll take a while for it to feed through into new products and services. Still, people are already testing its capabilities out in the open. Here are my top picks of the fun ways they're doing that. In an example that went viral on Twitter, Jackson Greathouse Fall, a brand designer, asked GPT-4 to make as much money as possible with an initial budget of $100.

ai expert, gpt-4, présentation, (2 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.59)

Add feedback