Large Language Model
These Were Our Favorite Tech Stories ... :: Human Robots#
This time last year we were commemorating the end of a decade and looking ahead to the next one. Enter the year that felt like a decade all by itself: 2020. News written in January, the before-times, feels hopelessly out of touch with all that came after. Stories published in the early days of the pandemic are, for the most part, similarly naive. The year’s news cycle was swift and brutal, ping-ponging from pandemic to extreme social and political tension, whipsawing economies, and natural disasters. Hope. Despair. Loneliness. Grief. Grit. More hope. Another lockdown. It’s been a hell of a year. Though 2020 was dominated by big, hairy societal change, science and technology took significant steps forward. Researchers singularly focused on the pandemic and collaborated on solutions to a degree never before seen. New technologies converged to deliver vaccines in record time. The dark side of tech, from biased algorithms to the threat of omnipresent surveillance and corporate control of artificial intelligence, continued to rear its head. Meanwhile, AI showed uncanny command of language, joined Reddit threads, and made inroads into some of science’s grandest challenges. Mars rockets flew for the first time, and a private company delivered astronauts to the International Space Station. Deprived of night life, concerts, and festivals, millions traveled to virtual worlds instead. Anonymous jet packs flew over LA. Mysterious monoliths appeared and disappeared worldwide. It was all, you know, very 2020. For this year’s (in-no-way-all-encompassing) list of fascinating stories in tech and science, we tried to select those that weren’t totally dated by the news, but rose above it in some way. So, without further ado: This year’s picks. How Science Beat the Virus Ed Yong | The Atlantic “Much like famous initiatives such as the Manhattan Project and the Apollo program, epidemics focus the energies of large groups of scientists. …But ‘nothing in history was even close to the level of pivoting that’s happening right now,’ Madhukar Pai of McGill University told me. … No other disease has been scrutinized so intensely, by so much combined intellect, in so brief a time.” ‘It Will Change Everything’: DeepMind’s AI Makes Gigantic Leap in Solving Protein Structures Ewen Callaway | Nature “In some cases, AlphaFold’s structure predictions were indistinguishable from those determined using ‘gold standard’ experimental methods such as X-ray crystallography and, in recent years, cryo-electron microscopy (cryo-EM). AlphaFold might not obviate the need for these laborious and expensive methods—yet—say scientists, but the AI will make it possible to study living things in new ways.” OpenAI’s Latest Breakthrough Is Astonishingly Powerful, But Still Fighting Its Flaws James Vincent | The Verge “What makes GPT-3 amazing, they say, is not that it can tell you that the capital of Paraguay is Asunción (it is) or that 466 times 23.5 is 10,987 (it’s not), but that it’s capable of answering both questions and many more beside simply because it was trained on more data for longer than other programs. If there’s one thing we know that the world is creating more and more of, it’s data and computing power, which means GPT-3’s descendants are only going to get more clever.” Artificial General Intelligence: Are We Close, and Does It Even Make Sense to Try? Will Douglas Heaven | MIT Technology Review “A machine that could think like a person has been the guiding vision of AI research since the earliest days—and remains its most divisive idea. …So why is AGI controversial? Why does it matter? And is it a reckless, misleading dream—or the ultimate goal?” The Dark Side of Big Tech’s Funding for AI Research Tom Simonite | Wired “Timnit Gebru’s exit from Google is a powerful reminder of how thoroughly companies dominate the field, with the biggest computers and the most resources. …[Meredith] Whittaker of AI Now says properly probing the societal effects of AI is fundamentally incompatible with corporate labs. ‘That kind of research that looks at the power and politics of AI is and must be inherently adversarial to the firms that are profiting from this technology.’i” We’re Not Prepared for the End of Moore’s Law David Rotman | MIT Technology Review “Quantum computing, carbon nanotube transistors, even spintronics, are enticing possibilities—but none are obvious replacements for the promise that Gordon Moore first saw in a simple integrated circuit. We need the research investments now to find out, though. Because one prediction is pretty much certain to come true: we’re always going to want more computing power.” Inside the Race to Build the Best Quantum Computer on Earth Gideon Lichfield | MIT Technology Review “Regardless of whether you agree with Google’s position [on ‘quantum supremacy’] or IBM’s, the next goal is clear, Oliver says: to build a quantum computer that can do something useful. …The trouble is that it’s nearly impossible to predict what the first useful task will be, or how big a computer will be needed to perform it.” The Secretive Company That Might End Privacy as We Know It Kashmir Hill | The New York Times “Searching someone by face could become as easy as Googling a name. Strangers would be able to listen in on sensitive conversations, take photos of the participants and know personal secrets. Someone walking down the street would be immediately identifiable—and his or her home address would be only a few clicks away. It would herald the end of public anonymity.” Wrongfully Accused by an Algorithm Kashmir Hill | The New York Times “Mr. Williams knew that he had not committed the crime in question. What he could not have known, as he sat in the interrogation room, is that his case may be the first known account of an American being wrongfully arrested based on a flawed match from a facial recognition algorithm, according to experts on technology and the law.” Predictive Policing Algorithms Are Racist. They Need to Be Dismantled. Will Douglas Heaven | MIT Technology Review “A number of studies have shown that these tools perpetuate systemic racism, and yet we still know very little about how they work, who is using them, and for what purpose. All of this needs to change before a proper reckoning can take pace. Luckily, the tide may be turning.” The Panopticon Is Already Here Ross Andersen | The Atlantic “Artificial intelligence has applications in nearly every human domain, from the instant translation of spoken language to early viral-outbreak detection. But Xi [Jinping] also wants to use AI’s awesome analytical powers to push China to the cutting edge of surveillance. He wants to build an all-seeing digital system of social control, patrolled by precog algorithms that identify potential dissenters in real time.” The Case For Cities That Aren’t Dystopian Surveillance States Cory Doctorow | The Guardian “Imagine a human-centered smart city that knows everything it can about things. It knows how many seats are free on every bus, it knows how busy every road is, it knows where there are short-hire bikes available and where there are potholes. …What it doesn’t know is anything about individuals in the city.” The Modern World Has Finally Become Too Complex for Any of Us to Understand Tim Maughan | OneZero “One of the dominant themes of the last few years is that nothing makes sense. …I am here to tell you that the reason so much of the world seems incomprehensible is that it is incomprehensible. From social media to the global economy to supply chains, our lives rest precariously on systems that have become so complex, and we have yielded so much of it to technologies and autonomous actors that no one totally comprehends it all.” The Conscience of Silicon Valley Zach Baron | GQ “What I really hoped to do, I said, was to talk about the future and how to live in it. This year feels like a crossroads; I do not need to explain what I mean by this. …I want to destroy my computer, through which I now work and ‘have drinks’ and stare at blurry simulations of my parents sometimes; I want to kneel down and pray to it like a god. I want someone—I want Jaron Lanier—to tell me where we’re going, and whether it’s going to be okay when we get there. Lanier just nodded. All right, then.” Yes to Tech Optimism. And Pessimism. Shira Ovide | The New York Times “Technology is not something that exists in a bubble; it is a phenomenon that changes how we live or how our world works in ways that help and hurt. That calls for more humility and bridges across the optimism-pessimism divide from people who make technology, those of us who write about it, government officials and the public. We need to think on the bright side. And we need to consider the horribles.” How Afrofuturism Can Help the World Mend C. Brandon Ogbunu | Wired “…[W. E. B. DuBois’] ‘The Comet’ helped lay the foundation for a paradigm known as Afrofuturism. A century later, as a comet carrying disease and social unrest has upended the world, Afrofuturism may be more relevant than ever. Its vision can help guide us out of the rubble, and help us to consider universes of better alternatives.” Wikipedia Is the Last Best Place on the Internet Richard Cooke | Wired “More than an encyclopedia, Wikipedia has become a community, a library, a constitution, an experiment, a political manifesto—the closest thing there is to an online public square. It is one of the few remaining places that retains the faintly utopian glow of the early World Wide Web.” Can Genetic Engineering Bring Back the American Chestnut? Gabriel Popkin | The New York Times Magazine “The geneticists’ research forces conservationists to confront, in a new and sometimes discomfiting way, the prospect that repairing the natural world does not necessarily mean returning to an unblemished Eden. It may instead mean embracing a role that we’ve already assumed: engineers of everything, including nature.” At the Limits of Thought David C. Krakauer | Aeon “A schism is emerging in the scientific enterprise. On the one side is the human mind, the source of every story, theory, and explanation that our species holds dear. On the other stand the machines, whose algorithms possess astonishing predictive power but whose inner workings remain radically opaque to human observers.” Is the Internet Conscious? If It Were, How Would We Know? Meghan O’Gieblyn | Wired “Does the internet behave like a creature with an internal life? Does it manifest the fruits of consciousness? There are certainly moments when it seems to. Google can anticipate what you’re going to type before you fully articulate it to yourself. Facebook ads can intuit that a woman is pregnant before she tells her family and friends. It is easy, in such moments, to conclude that you’re in the presence of another mind—though given the human tendency to anthropomorphize, we should be wary of quick conclusions.” The Internet Is an Amnesia Machine Simon Pitt | OneZero “There was a time when I didn’t know what a Baby Yoda was. Then there was a time I couldn’t go online without reading about Baby Yoda. And now, Baby Yoda is a distant, shrugging memory. Soon there will be a generation of people who missed the whole thing and for whom Baby Yoda is as meaningless as it was for me a year ago.” Digital Pregnancy Tests Are Almost as Powerful as the Original IBM PC Tom Warren | The Verge “Each test, which costs less than $5, includes a processor, RAM, a button cell battery, and a tiny LCD screen to display the result. …Foone speculates that this device is ‘probably faster at number crunching and basic I/O than the CPU used in the original IBM PC.’ IBM’s original PC was based on Intel’s 8088 microprocessor, an 8-bit chip that operated at 5Mhz. The difference here is that this is a pregnancy test you pee on and then throw away.” The Party Goes on in Massive Online Worlds Cecilia D’Anastasio | Wired “We’re more stand-outside types than the types to cast a flashy glamour spell and chat up the nearest cat girl. But, hey, it’s Final Fantasy XIV online, and where my body sat in New York, the epicenter of America’s Covid-19 outbreak, there certainly weren’t any parties.” The Facebook Groups Where People Pretend the Pandemic Isn’t Happening Kaitlyn Tiffany | The Atlantic “Losing track of a friend in a packed bar or screaming to be heard over a live band is not something that’s happening much in the real world at the moment, but it happens all the time in the 2,100-person Facebook group ‘a group where we all pretend we’re in the same venue.’ So does losing shoes and Juul pods, and shouting matches over which bands are the saddest, and therefore the greatest.” Did You Fly a Jetpack Over Los Angeles This Weekend? Because the FBI Is Looking for You Tom McKay | Gizmodo “Did you fly a jetpack over Los Angeles at approximately 3,000 feet on Sunday? Some kind of tiny helicopter? Maybe a lawn chair with balloons tied to it? If the answer to any of the above questions is ‘yes,’ you should probably lay low for a while (by which I mean cool it on the single-occupant flying machine). That’s because passing airline pilots spotted you, and now it’s this whole thing with the FBI and the Federal Aviation Administration, both of which are investigating.” Image Credit: Thomas Kinto / Unsplash Continue reading →
CIZSL++: Creativity Inspired Generative Zero-Shot Learning
Elhoseiny, Mohamed, Yi, Kai, Elfeki, Mohamed
Zero-shot learning (ZSL) aims at understanding unseen categories with no training examples from class-level descriptions. To improve the discriminative power of ZSL, we model the visual learning process of unseen categories with inspiration from the psychology of human creativity for producing novel art. First, we propose CIZSL-v1 as a creativity inspired model for generative ZSL. We relate ZSL to human creativity by observing that ZSL is about recognizing the unseen, and creativity is about creating a likable unseen. We introduce a learning signal inspired by creativity literature that explores the unseen space with hallucinated class-descriptions and encourages careful deviation of their visual feature generations from seen classes while allowing knowledge transfer from seen to unseen classes. Second, CIZSL-v2 is proposed as an improved version of CIZSL-v1 for generative zero-shot learning. CIZSL-v2 consists of an investigation of additional inductive losses for unseen classes along with a semantic guided discriminator. Empirically, we show consistently that CIZSL losses can improve generative ZSL models on the challenging task of generalized ZSL from a noisy text on CUB and NABirds datasets. We also show the advantage of our approach to Attribute-based ZSL on AwA2, aPY, and SUN datasets. We also show that CIZSL-v2 has improved performance compared to CIZSL-v1.
Timnit Gebru's Exit From Google Exposes a Crisis in AI
Alex Hanna is a sociologist and senior research scientist on the Ethical AI team at Google. Meredith Whittaker is the Minderoo Research Professor at NYU, the faculty director of the AI Now Institute at NYU, and a long-time tech worker who helped lead labor organizing at Google. This year has held many things, among them bold claims of artificial intelligence breakthroughs. Industry commentators speculated that the language-generation model GPT-3 may have achieved "artificial general intelligence," while others lauded Alphabet subsidiary DeepMind's protein-folding algorithm--Alphafold--and its capacity to "transform biology." While the basis of such claims is thinner than the effusive headlines, this hasn't done much to dampen enthusiasm across the industry, whose profits and prestige are dependent on AI's proliferation.
The Future is Here! Have You Checked OpenAI's GPT-3 Yet?
Ever wonder how close has AI gotten to impersonating human beings? The latest GPT-3 can code computer programs, compose tweets, summarize emails, write news, answer questions, translate languages, and write fiction and poetry too. It can take up almost any virtual English language task. In the latest recent milestone on YouTube, it created an app that functions similar to Instagram. Dubbed as one of the most important advancements in AI in recent years, GPT-3 or Generative Pre-Trained Transformer 3 has raised the AI goal posts many notches toward the stratosphere.
EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets
Chen, Xiaohan, Cheng, Yu, Wang, Shuohang, Gan, Zhe, Wang, Zhangyang, Liu, Jingjing
Deep, heavily overparameterized language models such as BERT, XLNet and T5 have achieved impressive success in many natural language processing (NLP) tasks. However, their high model complexity requires enormous computation resources and extremely long training time for both pre-training and fine-tuning. Many works have studied model compression on large NLP models, but only focusing on reducing inference time while still requiring an expensive training process. Other works use extremely large batch sizes to shorten the pre-training time, at the expense of higher computational resource demands. In this paper, inspired by the Early-Bird Lottery Tickets recently studied for computer vision tasks, we propose EarlyBERT, a general computationally-efficient training algorithm applicable to both pre-training and fine-tuning of large-scale language models. By slimming the self-attention and fully-connected sub-layers inside a transformer, we are the first to identify structured winning tickets in the early stage of BERT training. We apply those tickets towards efficient BERT training, and conduct comprehensive pre-training and fine-tuning experiments on GLUE and SQuAD downstream tasks. Our results show that EarlyBERT achieves comparable performance to standard BERT, with 35 45% less training time. Large-scale pre-trained language models (e.g., BERT (Devlin et al., 2018), XLNet (Yang et al., 2019), T5 (Raffel et al., 2019)) have significantly advanced the state of the art in the NLP field.
Directed Beam Search: Plug-and-Play Lexically Constrained Language Generation
Pascual, Damian, Egressy, Beni, Bolli, Florian, Wattenhofer, Roger
Large pre-trained language models are capable of generating realistic text. However, controlling these models so that the generated text satisfies lexical constraints, i.e., contains specific words, is a challenging problem. Given that state-of-the-art language models are too large to be trained from scratch in a manageable time, it is desirable to control these models without re-training them. Methods capable of doing this are called plug-and-play. Recent plug-and-play methods have been successful in constraining small bidirectional language models as well as forward models in tasks with a restricted search space, e.g., machine translation. However, controlling large transformer-based models to meet lexical constraints without re-training them remains a challenge. In this work, we propose Directed Beam Search (DBS), a plug-and-play method for lexically constrained language generation. Our method can be applied to any language model, is easy to implement and can be used for general language generation. In our experiments we use DBS to control GPT-2. We demonstrate its performance on keyword-to-phrase generation and we obtain comparable results as a state-of-the-art non-plug-and-play model for lexically constrained story generation.
DeepMind's MuZero Masters Games Even Without Learning The Rules
If you want to be great at games, consider paying attention to Albert Einstein’s advice, which is to learn the rules of the game, and then play better than anyone else. Of course, this is easier said than done. But if you’re an AI like DeepMind, then it would be much easier for you — so much easier that you can even skip Einstein’s first advice.DeepMind, a subsidiary of Alphabet, has previously made groundbreaking strides using reinforcement le...
Artificial intelligence in 2020: the AIhub roundup
As 2020 draws to a close we look back on some of the notable research developments, awards, conferences and policy in the world of artificial intelligence. In February it was reported that MIT researchers had used a machine-learning algorithm to identify a powerful new antibiotic compound. In laboratory tests, the drug, called halicin, killed many disease-causing bacteria, including some strains that had been resistant to all existing antibiotics. Progress in the field of AI in healthcare has continued apace during 2020. A lot of this work is about providing clinicians with extra tools in their armoury.
Top Machine Learning Research Papers Released In 2020
It has been only two weeks into the last month of the year and arxiv.org, the popular repository for ML research papers has already witnessed close to 600 uploads. This should give one the idea of the pace at which machine learning research is proceeding; however, keeping track of all these research work is almost impossible. Every year, the research that gets maximum noise is usually from companies like Google and Facebook; from top universities like MIT; from research labs and most importantly from the conferences like NeurIPS or ACL. In this article, we have compiled a list of interesting machine learning research work that has made some noise this year. This is the seminal paper that introduced the most popular ML model of the year -- GPT-3.
10 AI Predictions For 2021
Prediction #6: The U.S. federal government will adopt a more proactive policy approach to AI in 2021 ... [ ] under President Biden. Below are 10 bold predictions about what will unfold in the world of artificial intelligence in 2021, from academic research to startups to capital markets to regulation. To keep ourselves honest, we will revisit these predictions in December 2021 to grade how we did. Autonomous vehicle developers like Waymo and Cruise have massive ongoing cash needs. Public market investors are thirsty for IPOs.