Goto

Collaborating Authors

 Generative AI


The End of Foreign-Language Education

The Atlantic - Technology

A few days ago, I watched a video of myself talking in perfect Chinese. I've been studying the language on and off for only a few years, and I'm far from fluent. But there I was, pronouncing each character flawlessly in the correct tone, just as a native speaker would. Gone were my grammar mistakes and awkward pauses, replaced by a smooth and slightly alien-sounding voice. "My favorite food is sushi," I said--wo zui xihuan de shiwu shi shousi--with no hint of excitement or joy.


The Download: Adobe's AI ambitions, and how work is changing

MIT Technology Review

Since the beginning of the generative AI boom, there has been a fight over how large AI models are trained. And in the other camp are artists who argue that AI companies have taken their intellectual property without consent or compensation. It released its image-generating model Firefly, which is integrated into its popular photo editing tool Photoshop, one year ago. In an exclusive interview with MIT Technology Review, Adobe's AI leaders are adamant this is the only way forward. At stake is not just the livelihood of creators, they say, but our whole information ecosystem.


How Adobe's bet on non-exploitative AI is paying off

MIT Technology Review

In an exclusive interview with MIT Technology Review, Adobe's AI leaders are adamant this is the only way forward. At stake is not just the livelihood of creators, they say, but our whole information ecosystem. What they have learned shows that building responsible tech doesn't have to come at the cost of doing business. "We worry that the industry, Silicon Valley in particular, does not pause to ask the'how' or the'why.' Just because you can build something doesn't mean you should build it without consideration of the impact that you're creating," says David Wadhwani, president of Adobe's digital media business.


Google, Qualcomm and Intel launch bid to break Nvidia's grip on AI

The Japan Times

Nvidia earned its 2.2 trillion market cap by producing artificial-intelligence chips that have become the lifeblood powering the new era of generative AI developers from startups to Microsoft, OpenAI and Google parent Alphabet. Almost as important to its hardware is the company's nearly 20 years' worth of computer code, which helps make competition with the company nearly impossible. More than 4 million global developers rely on Nvidia's CUDA software platform to build AI and other apps. Now a coalition of technology companies that includes Qualcomm, Google and Intel plans to loosen Nvidia's chokehold by going after the chip giant's secret weapon: the software that keeps developers tied to Nvidia chips. They are part of an expanding group of financiers and companies hacking away at Nvidia's dominance in AI.


Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization

arXiv.org Artificial Intelligence

Large language models (LLM), such as Google's Minerva and OpenAI's GPT families, are becoming increasingly capable of solving mathematical quantitative reasoning problems. However, they still make unjustified logical and computational errors in their reasoning steps and answers. In this paper, we leverage the fact that if the training corpus of LLMs contained sufficiently many examples of formal mathematics (e.g. in Isabelle, a formal theorem proving environment), they can be prompted to translate i.e. autoformalize informal mathematical statements into formal Isabelle code -- which can be verified automatically for internal consistency. This provides a mechanism to automatically reject solutions whose formalized versions are inconsistent within themselves or with the formalized problem statement. We evaluate our method on GSM8K, MATH and MultiArith datasets and demonstrate that our approach provides a consistently better heuristic than vanilla majority voting -- the previously best method to identify correct answers, by more than 12% on GSM8K. In our experiments it improves results consistently across all datasets and LLM model sizes. The code can be found at https://github.com/jinpz/dtv. Recently, language models (Devlin et al., 2018; Brown et al., 2020; Chowdhery et al., 2022) have advanced significantly in many natural language processing tasks such as machine translation, question answering, summarization, etc. More recent large language models (LLMs) such as Minerva (Lewkowycz et al., 2022), GPT3.5 (OpenAI) and GPT4 (OpenAI, 2023) have also become increasingly capable of solving quantitative reasoning problems, ranging from middle school math word problems (Cobbe et al., 2021) to challenging high school mathematical competition problems (Hendrycks et al., 2021). By training or finetuning the model on high-quality natural language mathematical and scientific text, these LLMs can generate self-contained step-by-step solutions to quantitative reasoning problems without relying on external tools. However, just like human beings, the solutions LLMs generate are prone to simple calculation errors and unjustified logical leaps. Following Wang et al. (2022); Lewkowycz et al. (2022), one can sample many proposed solutions, extract the final answer from each, and select the most common answer. While aggregating answers like this improves performance at the problem level, the most common answer is sometimes wrong. Ideally, we would like a better heuristic to identify the correct answer.


Practical Applications of Advanced Cloud Services and Generative AI Systems in Medical Image Analysis

arXiv.org Artificial Intelligence

The medical field is one of the important fields in the application of artificial intelligence technology. With the explosive growth and diversification of medical data, as well as the continuous improvement of medical needs and challenges, artificial intelligence technology is playing an increasingly important role in the medical field. Artificial intelligence technologies represented by computer vision, natural language processing, and machine learning have been widely penetrated into diverse scenarios such as medical imaging, health management, medical information, and drug research and development, and have become an important driving force for improving the level and quality of medical services.The article explores the transformative potential of generative AI in medical imaging, emphasizing its ability to generate syntheticACM-2 data, enhance images, aid in anomaly detection, and facilitate image-to-image translation. Despite challenges like model complexity, the applications of generative models in healthcare, including Med-PaLM 2 technology, show promising results. By addressing limitations in dataset size and diversity, these models contribute to more accurate diagnoses and improved patient outcomes. However, ethical considerations and collaboration among stakeholders are essential for responsible implementation. Through experiments leveraging GANs to augment brain tumor MRI datasets, the study demonstrates how generative AI can enhance image quality and diversity, ultimately advancing medical diagnostics and patient care.


The artificial intelligence experts who believe the AI boom could fizzle or even be a new dotcom crash: 'We are starting to see signs it might be a dud'

Daily Mail - Science & tech

Generative AI has been predicted to add trillions to the world economy in a productivity boost never before seen in history (if it doesn't wipe out humanity first). A growing number of sceptics, including some leading AI scientists, are wondering whether the tech might not deliver on its promises to boost the world economy. Goldman Sachs famously predicted that generative AI would bring about'sweeping changes' to the world economy, driving a 7 trillion increase in global GDP and lifting productivity growth by 1.5 percent this decade. Professor Gary Marcus of New York University wrote on Substack that'we are starting to see signs' that generative AI might be a'dud'. Among the warning signs was a report in the Wall Street Journal suggesting that customers found the 30 a month price of Microsoft's new AI-boosted Copilot software too expensive.


The Download: defining open source AI, and replacing Siri

MIT Technology Review

Suddenly, "open source" is the latest buzzword in AI circles. Meta has pledged to create open-source artificial general intelligence. And Elon Musk is suing OpenAI over its lack of open-source AI models. Meanwhile, a growing number of tech leaders and companies are setting themselves up as open-source champions. But there's a fundamental problem--no one can agree on what "open-source AI" means.


Learn how to leverage AI in your career with this 30 e-degree

PCWorld

As artificial intelligence continues to advance, there are more and more consumer-friendly tools at your disposal. Still, each has a learning curve, and if you really want to leverage AI to enhance your creativity and increase your productivity, The 2024 Complete ChatGPT & Gemini AI Advanced E-Degree can help. This course is led by Eduonix Learning Solutions (4.0/5-star instructor rating), a leader in online education. Here, you'll get ten hours of hands-on content that investigates a range of generative AI tools. You'll explore creative tools like ChatGPT, Gemini AI, GPT 3.5, GPT 4, Bard, DALL-E 2, and more as you learn how to leverage these tools to boost your productivity in text, image, video, and audio formats.


Iso-Diffusion: Improving Diffusion Probabilistic Models Using the Isotropy of the Additive Gaussian Noise

arXiv.org Artificial Intelligence

Denoising Diffusion Probabilistic Models (DDPMs) have accomplished much in the realm of generative AI. Despite their high performance, there is room for improvement, especially in terms of sample fidelity by utilizing statistical properties that impose structural integrity, such as isotropy. Minimizing the mean squared error between the additive and predicted noise alone does not impose constraints on the predicted noise to be isotropic. Thus, we were motivated to utilize the isotropy of the additive noise as a constraint on the objective function to enhance the fidelity of DDPMs. Our approach is simple and can be applied to any DDPM variant. We validate our approach by presenting experiments conducted on four synthetic 2D datasets as well as on unconditional image generation. As demonstrated by the results, the incorporation of this constraint improves the fidelity metrics, Precision and Density for the 2D datasets as well as for the unconditional image generation.