Personal
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction
Hu, Xuming, Meng, Shiao, Zhang, Chenwei, Yang, Xiangli, Wen, Lijie, King, Irwin, Yu, Philip S.
Abstract--Information Extraction (IE) aims to extract structured information from heterogeneous sources. IE from natural language texts include sub-tasks such as Named Entity Recognition (NER), Relation Extraction (RE), and Event Extraction (EE). Most IE systems require comprehensive understandings of sentence structure, implied semantics, and domain knowledge to perform well; thus, IE tasks always need adequate external resources and annotations. However, it takes time and effort to obtain more human annotations. Low-Resource Information Extraction (LRIE) strives to use unsupervised data, reducing the required resources and human annotation. In practice, existing systems either utilize self-training schemes to generate pseudo labels that will cause the gradual drift problem, or leverage consistency regularization methods which inevitably possess confirmation bias. To alleviate confirmation bias due to the lack of feedback loops in existing LRIE learning paradigms, we develop a Gradient Imitation Reinforcement Learning (GIRL) method to encourage pseudo-labeled data to imitate the gradient descent direction on labeled data, which can force pseudo-labeled data to achieve better optimization capabilities similar to labeled data. Based on how well the pseudo-labeled data imitates the instructive gradient descent direction obtained from labeled data, we design a reward to quantify the imitation process and bootstrap the optimization capability of pseudo-labeled data through trial and error. In addition to learning paradigms, GIRL is not limited to specific sub-tasks, and we leverage GIRL to solve all IE sub-tasks (named entity recognition, relation extraction, and event extraction) in low-resource settings (semi-supervised IE and few-shot IE).
On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Korbak, Tomasz, Elsahar, Hady, Kruszewski, Germรกn, Dymetman, Marc
The availability of large pre-trained models is changing the landscape of Machine Learning research and practice, moving from a training-from-scratch to a fine-tuning paradigm. While in some applications the goal is to "nudge" the pre-trained distribution towards preferred outputs, in others it is to steer it towards a different distribution over the sample space. Two main paradigms have emerged to tackle this challenge: Reward Maximization (RM) and, more recently, Distribution Matching (DM). RM applies standard Reinforcement Learning (RL) techniques, such as Policy Gradients, to gradually increase the reward signal. DM prescribes to first make explicit the target distribution that the model is fine-tuned to approximate. Here we explore the theoretical connections between the two paradigms, and show that methods such as KL-control developed for RM can also be construed as belonging to DM. We further observe that while DM differs from RM, it can suffer from similar training difficulties, such as high gradient variance. We leverage connections between the two paradigms to import the concept of baseline into DM methods. We empirically validate the benefits of adding a baseline on an array of controllable language generation tasks such as constraining topic, sentiment, and gender distributions in texts sampled from a language model. We observe superior performance in terms of constraint satisfaction, stability and sample efficiency.
AI and data analytics may not be as complicated as it seems
Artificial Intelligence (AI) is built on data. Yet, many organizations are still finding it hard to implement AI properly to make the most out of their data. There are concerns that the AI is not able to comprehend the data the way they want it to, especially with more businesses having their data stored across the multi-cloud and even on-premise. When it comes to data analytics, SAS has been a household vendor in the industry for years. The data analytics leader continues to pioneer new innovations when it comes to providing businesses with the insights they need in the best way possible.
DeviantArt provides a way for artists to opt out of AI art generators
DeviantArt, the Wix-owned artist community, today announced a new protection for creators to disallow art-generating AI systems from being developed using their artwork. An option on the site will allow artists to preclude third parties from scraping their content for AI development purposes, aiming to prevent work from being swept up without artists' knowledge or permission. "AI technology for creation is a powerful force we can't ignore. . . . It would be impossible for DeviantArt to try to block or censor this art technology," CEO Moti Levy told TechCrunch in an email interview. "We see so many instances where AI tools help artists' creativity, allowing them to express themselves in ways they could not in the past. That said, we believe we have a responsibility to all creators. To support AI art, we must also implement fair tools and add protections in this domain."
Do you think AI Projects Fail? Because I do? [REASONING IS HERE]
There is no surprise that AI and ML have become the key ingredients of modern technology and cyberspace. From wearables to robotics, AI is almost everywhere and in every sector. Most companies extend their hands to AI vendors to adopt AI into their workflow. They spent lots of time, money, and effort to ensure a successful project. However, Gartner estimated that more than 85 percent of AI projects fail and render errors. Another report says that around 70 percent of companies say that implementing AI has minimal or zero impact on overall workflow efficiency.
AI and Copyright Law: How Copyright Applies to AI-Generated Content - Trust Insights Marketing Analytics Consulting
Who owns these fabulous works of art generated by systems and models like OpenAI's DALL-E or Stability.ai's What about blog content created by tools like GoCharlie or Copy.ai? To engage Ruth's services as an attorney, visit their website at GeekLawFirm.com. This interview does not constitute legal advice or create a client-attorney relationship with anyone. The information contained in this interview is presented on an "as is" basis with no guarantee of completeness, accuracy, usefulness, timeliness, or of the results obtained from the use of this information and without warranty of any kind, express or implied, including, but not limited to warranties of performance, merchantability, or fitness for a particular purpose. While we have taken every reasonable precaution to insure that the content is accurate, errors can occur. In all cases you should consult with a qualified professional familiar with your particular situation for advice concerning specific matters. What follows is an AI-generated transcript. The transcript may contain errors and is not a substitute for watching the video. Please note the following warning disclosure and disclaimer, this interview does not constitute legal advice or create a client attorney relationship with anyone.
Fuzziness, Indeterminacy and Soft Sets: Frontiers and Perspectives
The present paper comes across the main steps that laid from Zadeh's fuzziness ana Atanassov's intuitionistic fuzzy sets to Smarandache's indeterminacy and to Molodstov's soft sets. Two hybrid methods for assessment and decision making respectively under fuzzy conditions are also presented through suitable examples that use soft sets and real intervals as tools. The decision making method improves an earlier method of Maji et al. Further, it is described how the concept of topological space, the most general category of mathematical spaces, can be extended to fuzzy structures and how to generalize the fundamental mathematical concepts of limit, continuity compactness and Hausdorff space within such kind of structures. In particular, fuzzy and soft topological spaces are defined and examples are given to illustrate these generalizations.
Creative Writing with an AI-Powered Writing Assistant: Perspectives from Professional Writers
Ippolito, Daphne, Yuan, Ann, Coenen, Andy, Burnam, Sehmon
Writing complete stories is considered a hallmark display of human intelligence, and thus researchers in artificial intelligence (AI) and natural language generation (NLG) have long used it as a pinnacle task for their research (Klein et al., 1973; Meehan, 1977; Turner, 1993; Dehn, 1981; Liu and Singh, 2002; McIntyre and Lapata, 2009). Creative writing and storytelling present unique challenges for automatic language generation: story arcs extend over thousands of words, stories typically contain multiple characters with their own distinctive personas and voices, and well-written stories have an authorial voice that is consistent and identifiable. At the same time, lies and fabrications-common generation flaws which are a liability in tasks like machine translation and automatic summarization-can be an asset in the creative domain. In recent years, the field of NLG has progressed by leaps and bounds due to the development of neural language models capable of learning the structure of language by ingesting billions of written words (Chowdhery et al., 2022; Zhang et al., 2022; Brown et al., 2020). There has been considerable work in applying these advancements toward the development of AI-powered tools for creative writing, but nearly all previous research in this space has evaluated their methods either with amateur writers or with crowd workers paid to assess performance on narrowly defined tasks (Clark et al., 2018; Roemmele and Gordon, 2015; Nichols et al., 2020). While these sorts of evaluations are valuable as preliminary assessments, we believe it is also crucial to solicit feedback from actual domain experts in creative writing: professional writers, educators, and language experts. Skilled writers comprise a unique user group with a different set of needs and expectations than amateurs.
Explainable Neural Networks: Revolutionizing AI - A Spotlight from Eric Lanoix
"As far as AI in banking is concerned, explainability and fairness will be must-haves in 2-3 years because bill C-27 and OSFI expectations are going to require them." Ahead of the REโขWORK - Toronto AI Summit, we asked Eric Lanoix, Vice President, Quantitative Risk at Coast Capital Savings his thoughts on the topic. Here's what he had to say: What do you think is the most important advancement for AI in Finance? What are some recent wins from an AI project you are working on? What challenges did you face during it?
How Do You Define Unfair Bias in AI?
Art is subjective and everyone has their own opinion about it. When I saw the expressionist painting Blue Poles, by Jackson Pollock, I was reminded of the famous quote by Rudyard Kipling, "It's clever, but is it Art?" Pollock's piece looks like paint messily spilled onto a drop sheet protecting the floor. The debate of what constitutes art has a long history that will probably never be settled, there is no definitive definition of art. Similarly, there is no broadly accepted objective definition for the quality of a piece of art, with the closest definition being from Orson Welles, "I don't know anything about art but I know what I like." Similarly, people recognize unfair bias when they see it, but it is quite difficult to create a single objective definition.