-
Notifications
You must be signed in to change notification settings - Fork 19
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
8c565b0
commit f2b5c2b
Showing
1 changed file
with
119 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,119 @@ | ||
--- | ||
layout: redirect | ||
title: "Last Week in AI #255" | ||
excerpt: "Web swamped with AI-translated gibberish 🗑️, Self-Rewarding Language Models revolutionize LLMs 🔄, FTC probes AI partnerships of tech giants 🕵️♂️, and more!" | ||
image: | ||
feature: assets/img/digests/255/ | ||
credit: <a href="<Image Source Link>"> <Author> / <Source Name> </a> | ||
categories: [digests] | ||
permalink: /digests/the-two-hundred-and-fifty-fifth | ||
sidebartoc: true | ||
redirect: https://lastweekin.ai/p/255 | ||
--- | ||
|
||
### Top News | ||
|
||
#### [A ‘Shocking’ Amount of the Web Is Already AI-Translated Trash, Scientists Determine](https://www.vice.com/en/article/y3w4gw/a-shocking-amount-of-the-web-is-already-ai-translated-trash-scientists-determine) | ||
![](https://video-images.vice.com/articles/65a810b91a77481371ba2552/lede/1705514234136-gettyimages-1434945213.jpeg?image-resize-opts=Y3JvcD0xeHc6MC44NDIzeGg7MHh3LDAuMDMwNHhoJnJlc2l6ZT0xMjAwOiomcmVzaXplPTEyMDA6Kg) | ||
|
||
A recent study by researchers at the Amazon Web Services AI lab reveals that over half of the sentences on the internet have been translated into two or more languages, often with deteriorating quality due to poor machine translation (MT). The study, which analyzed a corpus of 6.38 billion sentences, found that 57.1% of the sentences were translated into at least three languages. The quality of translations varies significantly, with "low-resource" languages, particularly those spoken in Africa and the Global South, suffering from insufficient training data, resulting in inaccurate text. The study also found a selection bias towards shorter, "more predictable" sentences from low-quality articles, suggesting that a large portion of the internet in lower-resource languages is poorly machine-translated, raising concerns for the development of large language models in these languages. | ||
|
||
#### [Self-Rewarding Language Models](https://arxiv.org/abs/2401.10020v1) | ||
The article discusses the concept of Self-Rewarding Language Models (SRLMs), a new approach to improving the performance of Large Language Models (LLMs) by incorporating a self-improving reward model. Unlike traditional methods such as Reinforcement Learning from Human Feedback (RLHF) and Direct Preference Optimization (DPO), which rely on human preference data and often face limitations due to the quality and size of this data, SRLMs continually update the reward model during LLM alignment, eliminating these bottlenecks. The SRLMs act as instruction following models, generating responses for prompts and evaluating new instruction following examples to add to their training set. The article also presents an experiment where a Llama 2 70B model was fine-tuned on Open Assistant, resulting in improved instruction following performance and reward modeling ability. This suggests the potential for developing superior LLMs that can provide higher quality preference datasets to themselves in each iteration of training. | ||
|
||
#### [Alphabet, Amazon, Microsoft Face FTC Inquiry on AI Partners](https://www.bloomberg.com/news/articles/2024-01-25/alphabet-amazon-anthropic-microsoft-openai-get-ftc-inquiry-lrthp0es) | ||
![](https://assets.bwbx.io/images/users/iqjWHBFdfxIU/iW9cKrBVlxAU/v1/1200x800.jpg) | ||
The article discusses an inquiry by the Federal Trade Commission (FTC) into tech giants Alphabet, Amazon, and Microsoft regarding their partnerships in the field of artificial intelligence (AI). The FTC is investigating these companies' AI collaborations to ensure they are not violating any antitrust laws or engaging in anti-competitive practices. The inquiry is part of a broader scrutiny of big tech companies' dominance and influence in various sectors. The outcome of this investigation could have significant implications for the future of AI development and the tech industry as a whole. | ||
|
||
### Other News | ||
#### Tools | ||
![](https://i0.wp.com/electrek.co/wp-content/uploads/sites/3/2021/08/Tesla-Full-Self-Driving-Beta-Hero.jpg?resize=1200%2C628&quality=82&strip=all&ssl=1) | ||
|
||
[Tesla finally releases FSD v12, its last hope for self-driving](https://electrek.co/2024/01/22/tesla-releases-fsd-v12-last-hope-self-driving/) - Tesla releases FSD v12, its last hope for self-driving, introducing end-to-end neural nets to power vehicle controls, with the update being rolled out to customers after being used in the internal test fleet. | ||
|
||
[Google is using AI to organize and customize your Chrome browser](https://www.theverge.com/2024/1/23/24047843/google-chrome-browser-ai-organize-tabs-themes) - Google is using AI to enhance Chrome browser with features like tab organization, automatic theme generation, and a "Help me write" tool, aiming to integrate AI into web interaction and creation. | ||
|
||
[Opera to launch new AI-powered browser for iOS in Europe following Apple’s DMA changes](https://techcrunch.com/2024/01/26/opera-to-launch-new-ai-powered-browser-for-ios-in-europe-following-apples-dma-changes/) - Opera is launching a new AI-powered browser for iOS in Europe following Apple's DMA changes, allowing developers to offer non-WebKit-based browsers and providing iPhone users with an alternative to Safari. | ||
|
||
[Introducing Stable LM 2 1.6B](https://stability.ai/news/introducing-stable-lm-2) - Introducing Stable LM 2 1.6B, a state-of-the-art 1.6 billion parameter small language model trained on multilingual data, with a compact size and speed to lower hardware barriers for developers, and the release of the last pre-training checkpoint and optimizer states for fine-tuning. | ||
|
||
[OpenAI drops prices and fixes ‘lazy’ GPT-4 that refused to work](https://techcrunch.com/2024/01/25/openai-drops-prices-and-fixes-lazy-gpt-4-that-refused-to-work/) - OpenAI drops prices for API access and introduces new models, including a fix for the "lazy" GPT-4, while also releasing new text embedding models and a free moderation API. | ||
|
||
#### Business | ||
![](https://s.yimg.com/ny/api/res/1.2/hGc60HMgqrX0h_DEsI2u9Q--/YXBwaWQ9aGlnaGxhbmRlcjt3PTEyMDA7aD03OTA-/https://s.yimg.com/os/creatr-uploaded-images/2024-01/c368cba0-bac6-11ee-a7ef-975c812b949e) | ||
|
||
[Nvidia, Microsoft, Google, and others partner with US government on AI research program](https://finance.yahoo.com/news/nvidia-microsoft-google-and-others-partner-with-us-government-on-ai-research-program-160111850.html) - US government partners with tech giants to launch National Artificial Intelligence Research Resource (NAIRR) pilot program, aiming to provide researchers and educators across the country with access to high-powered AI technologies. | ||
|
||
[Voice cloning startup ElevenLabs lands $80M, achieves unicorn status](https://techcrunch.com/2024/01/22/voice-cloning-startup-elevenlabs-lands-80m-achieves-unicorn-status/) - ElevenLabs, a voice cloning startup, has raised $80 million in funding, achieved unicorn status, and faced criticism for misuse of its AI-powered tools, while also attempting to address concerns from voice actors and compete with other synthetic voice startups and Big Tech companies. | ||
|
||
[Waymo looks to launch full fleet of robotaxis in LA](https://electrek.co/2024/01/22/waymo-looks-to-launch-full-fleet-of-robotaxis-in-la/) - Waymo plans to expand its driverless robotaxi service in Los Angeles, facing potential challenges due to the fallout from Cruise and concerns from regulators, despite its success in San Francisco and claims of safety. | ||
|
||
[Baidu's Ernie AI chatbot to power Samsung's new Galaxy S24 smartphones](https://www.cnbc.com/2024/01/26/baidus-ernie-ai-chatbot-to-power-samsungs-new-galaxy-s24-smartphones.html) - Baidu's Ernie AI chatbot will be integrated into Samsung's Galaxy S24 smartphones, enabling real-time call translation and other advanced features. | ||
|
||
[Ola Founder’s Krutrim Becomes First $1 Billion Indian AI Startup](https://www.bloomberg.com/news/articles/2024-01-26/ola-founder-s-krutrim-becomes-first-1-billion-indian-ai-startup) - Ola Founder's Krutrim achieves the milestone of becoming the first $1 billion Indian AI startup. | ||
|
||
[Alphabet Shares Flirt With Record High on AI Hype](https://www.bloomberg.com/news/articles/2024-01-24/alphabet-flirts-with-record-high-as-ai-hype-propels-stock-higher) - Alphabet's shares are approaching a record high due to the excitement surrounding AI. | ||
|
||
#### Research | ||
![](https://news.utexas.edu/wp-content/uploads/2024/01/klivans-dimakis-C-2400x1600-1.jpg) | ||
|
||
[New Texas Center Will Create Generative AI Computing Cluster Among Largest of Its Kind](https://news.utexas.edu/2024/01/25/new-texas-center-will-create-generative-ai-computing-cluster-among-largest-of-its-kind/) - University of Texas at Austin is creating a powerful artificial intelligence hub with a new GPU computing cluster to lead in research and offer world-class AI infrastructure to a wide range of partners, focusing on biosciences, health care, computer vision, and natural language processing. | ||
|
||
[ChatQA: Building GPT-4 Level Conversational QA Models](https://arxiv.org/abs/2401.10225v1) - Building on the success of ChatGPT, this article introduces ChatQA-70B, a white-box conversational QA model with GPT-4 level accuracy, achieved through a two-stage instruction tuning recipe, an enhanced retriever for retrieval-augmented generation, and careful data curation. | ||
|
||
[DiffusionGPT: LLM-Driven Text-to-Image Generation System](https://arxiv.org/abs/2401.10061v1) - DiffusionGPT is an all-in-one text-to-image generation system that leverages a Large Language Model (LLM) to seamlessly integrate various generative models, addressing challenges faced by existing stable diffusion models and offering a training-free, efficient, and pioneering solution. | ||
|
||
[LEGO:Language Enhanced Multi-modal Grounding Model](https://arxiv.org/abs/2401.06071v1) - Advancements in large language models have led to the development of LEGO, a multi-modal grounding model that comprehends inputs across various modalities and addresses the issue of limited data through a diverse and high-quality multi-modal training dataset. | ||
|
||
[Deep Learning Tackles Deep Uncertainty ](https://eos.org/editor-highlights/deep-learning-tackles-deep-uncertainty) - Deep learning using neural networks is being used to emulate melt rates at the base of Antarctic ice shelves, offering a faster and potentially more accurate method for modeling future sea level rise. | ||
|
||
[Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data](https://arxiv.org/abs/2401.10891v1) - Unleashing the power of large-scale unlabeled data for monocular depth estimation, this article discusses the benefits and challenges of using massive, diverse, and cheap unlabeled images, as well as the approach of jointly training large-scale labeled and unlabeled images to enhance the model's performance. | ||
|
||
[VMamba: Visual State Space Model](https://arxiv.org/abs/2401.10166v1) - VMamba is a novel visual state space model with global receptive fields and dynamic weights, addressing the computational complexity issue of attention mechanism in visual tasks and achieving promising results across various visual tasks. | ||
|
||
[WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models](https://arxiv.org/abs/2401.13919) - WebVoyager outperforms GPT-4 and text-only setups with a 55.7% task success rate, showcasing the capabilities of large multimodal models in building an end-to-end web agent. | ||
|
||
[Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models](https://arxiv.org/abs/2401.06102v2) - A unifying framework called Patchscopes allows for inspecting hidden representations of language models, aligning with values of openness, community, excellence, and user data privacy. | ||
|
||
[New Theory Suggests Chatbots Can Understand Text](https://www.quantamagazine.org/new-theory-suggests-chatbots-can-understand-text-20240122/) - AI chatbots like Bard and ChatGPT may have the ability to understand and generate humanlike text, as new research suggests that the largest language models can develop new skills and combine them in a way that hints at understanding, challenging the notion that they are just "stochastic parrots." | ||
|
||
[Using artificial intelligence and satellites, U of M helps farmers detect aphid infestations](https://www.cbsnews.com/minnesota/news/u-of-m-utilizes-artificial-intelligence-and-satellites-to-help-farmers-detect-aphid-infestations/) - University of Minnesota is using artificial intelligence and satellites to help farmers detect aphid infestations, aiming to create a website or app for farmers to use. | ||
|
||
[Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text](https://arxiv.org/abs/2401.12070v1) - Detecting machine-generated text using zero-shot detection methods, such as spotting LLMs with binoculars, is a key focus for individuals and organizations working with arXivLabs. | ||
|
||
[Humans Still Cheaper Than AI in Vast Majority of Jobs, MIT Finds](https://www.bloomberg.com/news/articles/2024-01-22/humans-still-cheaper-than-ai-in-vast-majority-of-jobs-mit-finds) - MIT study finds that humans are still more cost-effective than AI in the majority of jobs, countering fears of widespread job displacement. | ||
|
||
#### Concerns | ||
![](https://www.japantimes.co.jp/japantimes/uploads/images/2024/01/19/275593.jpg) | ||
|
||
[Akutagawa Prize draws controversy after win for work that used ChatGPT](https://www.japantimes.co.jp/culture/2024/01/19/books/akutagawa-prize-book-chatgpt/) - AI-generated novel wins controversial Akutagawa Prize, sparking debate over the use of ChatGPT in literature. | ||
|
||
[Generative AI’s end-run around copyright won’t be resolved by the courts](https://www.aisnakeoil.com/p/generative-ais-end-run-around-copyright) - Generative AI companies face copyright lawsuits, with the recent complaint by the New York Times highlighting near-verbatim text copies by ChatGPT, leading to a strong position for the Times, but the legal argument focuses on output similarity rather than the ethical and economic harm of training data appropriation, potentially resulting in a pyrrhic victory for creators and publishers. | ||
|
||
[Taylor Swift Is Living Every Woman’s AI Porn Nightmare](https://www.vice.com/en/article/qjvajd/taylor-swift-is-living-every-womans-ai-porn-nightmare) - AI-generated nudes of Taylor Swift are spreading across social media platforms, and tech companies are struggling to crack down on the abuse, highlighting the consequences of the rise of AI-generated content and the challenges in mitigating the production of harmful content. | ||
|
||
[George Carlin Estate Sues Creators of AI-Generated Comedy Special: ‘Computer-Generated Click-Bait’](https://www.rollingstone.com/culture/culture-news/george-carlin-estate-ai-comedy-special-lawsuit-1234954818/) - George Carlin's estate sues creators of AI-generated comedy special for unauthorized use of the comedian's copyrighted works, denouncing the special as "computer-generated click-bait" that detracts from Carlin's comedic works and harms his reputation. | ||
|
||
[Man sues Macy’s, saying false facial recognition match led to jail assault](https://www.washingtonpost.com/technology/2024/01/22/facial-recognition-wrongful-identification-assault/) - Faulty facial recognition match leads to wrongful arrest and jail assault, highlighting the dangers of technology's use by law enforcement. | ||
|
||
[San Francisco takes legal action over ‘unsafe,’ ‘disruptive’ self-driving cars](https://www.washingtonpost.com/technology/2024/01/23/san-francisco-lawsuit-robotaxi-waymo-cruise/) - San Francisco is suing the state over the expansion of autonomous car companies in the city, citing serious safety incidents and public nuisance caused by the vehicles. | ||
|
||
[DOJ and SEC investigate GM-owned self-driving car company Cruise](https://www.washingtonpost.com/technology/2024/01/25/cruise-investigation-doj-sec/) - DOJ and SEC investigate GM-owned self-driving car company Cruise following an October incident where one of its cars hit a pedestrian and dragged her 20 feet, leading to a federal probe and criticism of the company's response and transparency. | ||
|
||
[Cruise wasn’t hiding the pedestrian-dragging video from regulators — it just had bad internet](https://www.theverge.com/2024/1/25/24050791/cruise-pedestrian-dragging-video-driverless-report) - Cruise's attempt to send a video of a pedestrian-dragging incident to regulators was hindered by internet connectivity issues, leading to accusations of misleading behavior and a subsequent investigation by the Department of Justice and the Securities and Exchange Commission. | ||
|
||
[New Hampshire Officials to Investigate A.I. Robocalls Mimicking Biden](https://www.nytimes.com/2024/01/22/business/media/biden-robocall-ai-new-hampshire.html) - AI-generated robocalls impersonating President Biden urged New Hampshire voters not to participate in the primary election, prompting an investigation by state officials. | ||
|
||
#### Policy | ||
![](https://www.404media.co/content/images/size/w1200/2024/01/Screenshot-2024-01-24-at-11.12.05-AM.png) | ||
|
||
[Iceland Has Its Own AI George Carlin Moment, Considers Law Against Deepfaking the Dead](https://www.404media.co/iceland-considers-laws-against-deepfaking-the-dead-after-ai-resurrects-beloved-comedian/) - Iceland considers restrictions on using AI to reanimate dead people after national broadcaster reanimates beloved comedian for New Year's Eve celebration. | ||
|
||
#### Fun | ||
![](https://www.nme.com/wp-content/uploads/2024/01/[email protected]) | ||
|
||
[Guns N’ Roses share AI-generated video for ‘The General’](https://www.nme.com/news/music/guns-n-roses-unveil-ai-generated-music-video-the-general-3576628) - Guns N’ Roses released an AI-generated video for their track ‘The General’, combining live footage with animated sequences to create a trippy and bold visual experience. | ||
|
||
<hr> | ||
|
||
Copyright © 2024 Skynet Today, All rights reserved. |