Goto

Collaborating Authors

 Generative AI


Diffusion Models in Vision: A Survey

arXiv.org Artificial Intelligence

Denoising diffusion models represent a recent emerging topic in computer vision, demonstrating remarkable results in the area of generative modeling. A diffusion model is a deep generative model that is based on two stages, a forward diffusion stage and a reverse diffusion stage. In the forward diffusion stage, the input data is gradually perturbed over several steps by adding Gaussian noise. In the reverse stage, a model is tasked at recovering the original input data by learning to gradually reverse the diffusion process, step by step. Diffusion models are widely appreciated for the quality and diversity of the generated samples, despite their known computational burdens, i.e. low speeds due to the high number of steps involved during sampling. In this survey, we provide a comprehensive review of articles on denoising diffusion models applied in vision, comprising both theoretical and practical contributions in the field. First, we identify and present three generic diffusion modeling frameworks, which are based on denoising diffusion probabilistic models, noise conditioned score networks, and stochastic differential equations. We further discuss the relations between diffusion models and other deep generative models, including variational auto-encoders, generative adversarial networks, energy-based models, autoregressive models and normalizing flows. Then, we introduce a multi-perspective categorization of diffusion models applied in computer vision. Finally, we illustrate the current limitations of diffusion models and envision some interesting directions for future research.


ChatGPT gets banned in Italy as the fight against AI begins - AIVAnet

#artificialintelligence

ChatGPT has been temporarily banned in Italy due to privacy concerns and faces a Federal Trade Commission (FTC) complaint in the U.S. that calls for new releases of ChatGPT to be halted. According to the Associated Press, the Italian Data Protection Authority will maintain the ban "until ChatGPT respects privacy." The problem with user data being visible to others during ChatGPT's March 20 outage was mentioned as the reason for this action. No details were shared about how this ban would be enforced or whether it would affect OpenAI partners that use ChatGPT, such as Microsoft's Bing Chat. ChatGPT: how to use the AI chatbot everyone's talking about OpenAI and ChatGPT logos are marked do not enter with a red circle and line symbol.


Cybersecurity experts argue that pausing GPT-4 development is pointless

#artificialintelligence

Join top executives in San Francisco on July 11-12, to hear how leaders are integrating and optimizing AI investments for success. Earlier this week, a group of more than 1,800 artificial intelligence (AI) leaders and technologists ranging from Elon Musk to Steve Wozniak issued an open letter calling on all AI labs to immediately pause development for six months on AI systems more powerful than GPT-4 due to "profound risks to society and humanity." While a pause could serve to help better understand and regulate the societal risks created by generative AI, some argue that it's also an attempt for lagging competitors to catch up on AI research with leaders in the space like OpenAI. According to Gartner distinguished VP analyst Avivah Litan, who spoke with VentureBeat about the issue, "The six-month pause is a plea to stop the training of models more powerful than GPT-4. GPT 4.5 will soon be followed by GPT-5, which is expected to achieve AGI (artificial general intelligence). Once AGI arrives, it will likely be too late to institute safety controls that effectively guard human use of these systems."


diffusion-models-in-ai-everything-you-need-to-know

#artificialintelligence

In the AI ecosystem, diffusion models are setting up the direction and pace of technological advancement. They are revolutionizing the way we approach complex generative AI tasks. These models are based on the mathematics of gaussian principles, variance, differential equations, and generative sequences. Modern AI-centric products and solutions developed by Nvidia, Google, Adobe, and OpenAI have put diffusion models at the center of the limelight. DALL.E 2, Stable Diffusion, and Midjourney are prominent examples of diffusion models that are making rounds on the internet recently.


Stability AI CEO Hints At IPO Plans, Calls For Transparency In AI Governance - Tcitnews

#artificialintelligence

During the Cerebral Valley AI Conference in San Francisco on Thursday, Emad Mostaque, the CEO and founder of Stability AI, revealed that he plans to take the open-source platform public within the next few years. Stability AI is a leader in generative artificial intelligence and an Open AI rival. Mostaque dismissed the idea of Stability AI being acquired. He explained that going public requires having amazing revenue, margins, and distribution. The company is only 17 months old, and the business model of Stability AI's open-source platform will be seen more clearly in the next year.


The Digital Insider

#artificialintelligence

In recent months, Mr. Altman has done more than anyone else to usher in this future--and commercialize it. OpenAI, the company he leads, in November released ChatGPT, the chatbot with an uncanny ability to produce humanlike writing that has become one of the most viral products in the history of technology. In the process, OpenAI went from a small nonprofit into a multibillion-dollar company, at near record speed, thanks in part to the launch of a for-profit arm that enabled it to raise $13 billion from Microsoft Corp., according to investor documents. This success has come as part of a delicate balancing act. Mr. Altman said he fears what could happen if AI is rolled out into society recklessly. He co-founded OpenAI eight years ago as a research nonprofit, arguing that it's uniquely dangerous to have profits be the main driver of developing powerful AI models. He is so wary of profit as an incentive in AI development that he has taken no direct financial stake in the business he built, he said--an anomaly in Silicon Valley, where founders of successful startups typically get rich off their equity.


The Contradictions of Sam Altman, AI Crusader

WSJ.com: WSJD - Technology

Sam Altman, the 37-year-old startup-minting guru at the forefront of the artificial intelligence boom, has long dreamed of a future in which computers could converse and learn like humans. One of his clearest childhood memories is sitting up late in his bedroom in suburban St. Louis, playing with the Macintosh LC II he had gotten for his eighth birthday when he had the sudden realization: "Someday, the computer was going to learn to think," he said.


As Google, Microsoft, Facebook, and the rest of big tech compete in generative AI, tying up content publishers will be critical for their LLMs. Here are 21 licensing targets they should lock up - CB Insights Research

#artificialintelligence

Large language models (LLMs) trained on more text will generally be superior to an LLM with less. As a result, expect publishers with valuable text content to become a licensing battleground for LLM makers and for language acquisition costs (LAC) to become a real expense. Google is said to pay $15B per year to be the default search engine on Apple devices. These traffic acquisition costs (TAC) are in aggregate over $50B a year for Google -- but the exclusivity gained is critical for Google to cement its search lead. With the battle for large language model (LLM) supremacy underway, exclusive access to language/text will also become critical.


OpenAI's GPT-4 violates FTC rules, argues AI policy group

#artificialintelligence

Join top executives in San Francisco on July 11-12, to hear how leaders are integrating and optimizing AI investments for success. The Federal Trade Commission (FTC) received a new complaint today from the Center for AI and Digital Policy (CAIDP), which calls for an investigation of OpenAI and its product GPT-4. The complaint argues that the FTC has declared that the use of AI should be "transparent, explainable, fair, and empirically sound while fostering accountability," but claims that OpenAI's GPT-4 "satisfies none of these requirements" and is "biased, deceptive, and a risk to privacy and public safety." CAIDP is a Washington, D.C.-based independent, nonprofit research organization that "assesses national AI policies and practices, trains AI policy leaders, and promotes democratic values for AI." It is headed by president and founder Marc Rotenberg and senior research director Merve Hickok.


Microsoft Adds GPT-4 to its Defensive Suite in Security Copilot

#artificialintelligence

AI hands are reaching further into the tech industry. Microsoft has added Security Copilot, a natural language chatbot that can write and analyze code, to its suite of products enabled by OpenAI's GPT-4 generative AI model. Security Copilot, which was announced on Wednesday, is now in preview for select customers. Microsoft will release more information through its email updates about when Security Copilot might become generally available. Microsoft Security Copilot is a natural language artificial intelligence data set that will appear as a prompt bar.