Skip to content

Commit

Permalink
digest 294
Browse files Browse the repository at this point in the history
  • Loading branch information
andreykurenkov committed Nov 5, 2024
1 parent aa9fbb6 commit 2e74858
Show file tree
Hide file tree
Showing 2 changed files with 128 additions and 0 deletions.
128 changes: 128 additions & 0 deletions _posts/digests/2024-11-04-294.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,128 @@
---
layout: redirect
title: "Last Week in AI #294"
excerpt: "AI Minecraft simulation 🎮, OpenAI's ChatGPT gets search engine upgrade 🔍, and the future of AI robots at home 🤖, and more!"
image:
feature: assets/img/digests/294/Screenshot-2024-10-31-at-2.28.25PM.png?resize=1200,586
credit: <a href="<Image Source Link>"> <Author> / <Source Name> </a>
categories: [digests]
permalink: /digests/the-two-hundred-and-ninety-fourth
sidebartoc: true
redirect: https://lastweekin.ai/p/294
---

### Top News

#### [Decart’s AI simulates a real-time, playable version of Minecraft](https://techcrunch.com/2024/10/31/decarts-ai-simulates-a-real-time-playable-version-of-minecraft/)
![](https://techcrunch.com/wp-content/uploads/2024/10/Screenshot-2024-10-31-at-2.28.25PM.png?resize=1200,586)

Decart, an Israeli AI firm, has launched Oasis, an "open-world" AI model that simulates a real-time, playable version of Minecraft. The model, which was trained on Minecraft gameplay videos, generates frames in real time based on keyboard and mouse movements, simulating the game's physics, rules, and graphics. Despite its current low resolution and tendency to "forget" level layouts, Decart is working on improvements, including the ability to create a custom "world" from an uploaded image. Future versions of Oasis, optimized for Etched's upcoming AI accelerator chips, could potentially generate up to 4K gameplay. However, questions about copyright implications arise as Decart did not mention obtaining Microsoft's permission to train the model on Minecraft footage.


#### [OpenAI’s search engine is now live in ChatGPT](https://www.theverge.com/2024/10/31/24283906/openai-chatgpt-live-web-search-searchgpt)
![](https://cdn.vox-cdn.com/thumbor/2e0-IzeyJ7KzcdVPwSc9wHJt9zc=/0x0:2040x1360/1200x628/filters:focal(1020x680:1021x681)/cdn.vox-cdn.com/uploads/chorus_asset/file/25462005/STK155_OPEN_AI_CVirginia_B.jpg)

OpenAI has integrated a web search feature into its AI-powered chatbot, ChatGPT, closing a competitive gap with rivals like Microsoft Copilot and Google Gemini. The feature, which can be manually triggered or activated based on queries, allows users to access real-time information during conversations. The search functionality, built with a mix of technologies including Microsoft's Bing, will be available across all ChatGPT platforms and is based on a fine-tuned version of GPT-4o. Despite the new feature, OpenAI will continue to update its training data to ensure users have access to the latest advancements. The launch comes amid a surge in AI-powered search across tech giants, with Meta developing its own solution and Google expanding its AI overview feature.


#### [This Is a Glimpse of the Future of AI Robots](https://www.wired.com/story/physical-intelligence-home-robot/)
![](https://media.wired.com/photos/6723cd4c5b922e5f7c40ff3a/191:100/w_1280,c_limit/Screenshot%202024-10-31%20at%2011.32.25%20AM.png)

San Francisco-based startup, Physical Intelligence, is developing an artificial intelligence (AI) model capable of performing a variety of household chores, moving the concept from science fiction to reality. The AI model is trained on an extensive amount of data, similar to large language models (LLMs) used in chatbots, but with a focus on physical tasks. The company's approach involves using data from various types of robots, creating a general-purpose learning algorithm for the physical world. This development, as stated by the company's CEO, Karol Hausman, is akin to training language models, and it brings the prospect of integrating AI models like ChatGPT into the physical world.



### Other News
#### Tools
![](https://www.marktechpost.com/wp-content/uploads/2024/10/Screenshot-2024-10-30-at-7.08.17 PM.png)

[OpenAI Releases SimpleQA: A New AI Benchmark that Measures the Factuality of Language Models](https://www.marktechpost.com/2024/10/30/openai-releases-simpleqa-a-new-ai-benchmark-that-measures-the-factuality-of-language-models/) - OpenAI has released SimpleQA, a new benchmark that measures the factuality of responses generated by language models, focusing on short, fact-seeking questions with a single, indisputable answer, and designed to remain challenging for the latest AI models.

[Meta unveils AI tools to give robots a human touch in physical world](https://venturebeat.com/ai/meta-unveils-ai-tools-to-give-robots-a-human-touch-in-physical-world/) - Meta unveils AI tools for robots to interact with the physical world, including touch perception models, tactile sensors, and a benchmark for evaluating human-robot collaboration.

[This AI-generated Minecraft may represent the future of real-time video generation](https://www.technologyreview.com/2024/10/31/1106461/this-ai-generated-minecraft-may-represent-the-future-of-real-time-video-generation/) - AI companies Decart and Etched are developing AI-generated Minecraft with the potential for real-time interactive video, aiming to improve performance and energy efficiency with a new specialized chip.

[Google preps ‘Jarvis’ AI agent that works in Chrome](https://9to5google.com/2024/10/26/google-jarvis-agent-chrome/) - Google is developing an AI agent called Project Jarvis, which will operate in Chrome and automate everyday web-based tasks, powered by Gemini 2.0 and expected to be previewed in December.

[Runway Adds Precise Camera Controls to its AI Video Editor](https://petapixel.com/2024/11/04/runway-adds-precise-camera-controls-to-its-ai-video-editor/) - Runway's new Advanced Camera Control feature allows precise panning, tracking, and zooming around AI subjects, catering to filmmakers and Hollywood studios.

[Anthropic’s Claude AI chatbot now has a desktop app](https://www.theverge.com/2024/10/31/24284742/claude-ai-macos-windows-desktop-app) - Anthropic's AI chatbot Claude now has a desktop app and a new "computer use" feature that allows it to control a computer by looking at a screen, moving the cursor, clicking buttons, and entering text.

[xAI adds image understanding capabilities to Grok](https://techcrunch.com/2024/10/28/xai-adds-image-understanding-capabilities-to-grok/) - xAI, owned by Elon Musk, has integrated image-understanding capabilities into its Grok AI model, allowing paid users on X social platform to upload images and ask the AI chatbot questions about them, with plans to further enhance its functionality.

[Watch out, Midjourney — Recraft just announced new AI image generator model](https://www.tomsguide.com/ai/ai-image-video/watch-out-midjourney-recraft-just-announced-new-ai-image-generator-model) - Recraft has unveiled its latest AI image generation model, Recraft V3, which offers designer-centric features and sets a new benchmark for quality among AI image generators, surpassing competitors like Midjourney and OpenAI.

[Claude AI can now analyze PDFs - here's how to try it](https://www.zdnet.com/article/claude-ai-can-now-analyze-pdfs-heres-how-to-try-it-and-why-youll-want-to/) - Anthropic's Claude 3.5 Sonnet AI model now has the ability to analyze PDF files, including text, images, charts, and graphs, but this feature is only available through a paid professional subscription or the API.

#### Business
![](https://static01.nyt.com/images/2024/11/04/multimedia/04db-bezos1-hlpt/04db-bezos1-hlpt-facebookJumbo.jpg)

[Physical Intelligence, a Robot A.I. Specialist, Raises Millions From Bezos](https://www.nytimes.com/2024/11/04/business/dealbook/physical-intelligence-robot-ai.html) - Physical Intelligence, a start-up specializing in artificial intelligence for robots, raises $400 million from major investors including Jeff Bezos, aiming to create foundational software for any robot.

[Alphabet's Waymo Serving Over 150,000 Paid Robotaxi Rides Every Week Now, Surging 50% In 2 Months](https://www.benzinga.com/news/24/10/41618591/alphabets-waymo-serving-over-150-000-paid-robotaxi-rides-every-week-now-surging-50-in-2-months) - Waymo, the robotaxi operator, has seen a 50% surge in paid rides in just two months, now providing over 150,000 trips per week and planning to expand its operations.

[Waymo is now valued at a staggering $45 billion](https://electrek.co/2024/11/01/waymo-is-now-valued-at-a-staggering-45-billion/) - Waymo, Alphabet's autonomous driving unit, has received a significant amount of fresh capital, leading to a valuation of over $45 billion, and plans to expand its robotaxi service in various cities.

[Zoox custom robotaxis are finally coming to San Francisco and Las Vegas](https://techcrunch.com/2024/10/30/zoox-custom-robotaxis-are-finally-coming-to-san-francisco-and-las-vegas/) - Zoox, an Amazon-owned AV company, is set to launch its purpose-built autonomous vehicles in San Francisco and Las Vegas, starting with an "explorer" program for early riders and a gradual expansion of its robotaxi service.

[What if A.I. Is Actually Good for Hollywood?](https://www.nytimes.com/2024/11/01/magazine/ai-hollywood-movies-cgi.html) - A Hollywood visual-effects start-up is using artificial intelligence to create seamless digital renderings of human faces, revolutionizing the industry and hinting at the potential for A.I. to accomplish high-quality visual effects at a fraction of the production cost.

[Universal Music partners with AI company building an ‘ethical’ music generator](https://www.theverge.com/2024/10/28/24282030/universal-music-group-partners-with-klay-ai-company-ethical-music-generator-foundational-model) - Universal Music partners with AI company Klay Vision to create an "ethical" foundational model for AI music generation, aiming to collaborate with the music industry and creators while respecting copyright and likeness rights.

[Meta says it’s making its Llama models available for US national security applications](https://techcrunch.com/2024/11/04/meta-says-its-making-its-llama-models-available-for-us-national-security-applications/) - Meta is making its Llama series of AI models available to U.S. government agencies and contractors working on national security applications, in an effort to combat the perception that its "open" AI is aiding foreign adversaries.

[SAG-AFTRA Inks Deal With AI Company Ethovox To Build Foundational Voice Model For Digital Replicas](https://deadline.com/2024/10/sag-aftra-ai-deal-ethovox-voice-replicas-1236160257/) - SAG-AFTRA has partnered with Ethovox to create a foundational voice model for digital replicas, ensuring fair compensation and consent for voice actors involved, while also advocating for more contractual protection in the age of AI.

[Microsoft just delayed Recall again](https://www.theverge.com/2024/10/31/24284572/microsoft-recall-delay-december-windows-insider-testing) - Microsoft is delaying the rollout of its Recall feature for Copilot Plus PCs once again, this time to refine the experience and ensure a secure and trusted user experience.

[Meta strikes multi-year AI deal with Reuters](https://www.axios.com/2024/10/25/meta-reuters-ai-news-facebook-instagram) - nan

[Perplexity CEO offers AI company’s services to replace striking NYT staff](https://techcrunch.com/2024/11/04/perplexity-ceo-offers-ai-companys-services-to-replace-striking-nyt-staff/) - Perplexity CEO offers AI company’s services to replace striking NYT staff, sparking controversy and criticism.

[Anthropic hikes the price of its Haiku model](https://techcrunch.com/2024/11/04/anthropic-hikes-the-price-of-its-haiku-model/) - Anthropic's new AI model, Claude 3.5 Haiku, is pricier than its predecessor and lacks image analysis capabilities, despite outperforming the previous flagship model on certain benchmarks.

#### Research
![](https://oasis-model.github.io/favicon.ico)

[Oasis: A Universe in a Transformer](https://oasis-model.github.io/) - Oasis is the first playable, real-time, open-world AI model, a video game entirely generated by AI, which takes in user keyboard input and generates real-time gameplay, including physics, game rules, and graphics, and is the first step in research towards more complex interactive worlds.

[Built in four days, this $120 robot arm cleans a spill with help from GPT-4o](https://techcrunch.com/2024/11/04/built-in-four-days-this-120-robot-arm-cleans-a-spill-with-help-from-gpt-4o/) - Pair of roboticists at UC Berkeley and ETH Zurich leverage generative AI to program a $120 robot arm to clean spills in just four days.

[Can Language Models Replace Programmers? REPOCOD Says 'Not Yet'](https://arxiv.org/abs/2410.21647v1) - Language models have shown impressive code generation abilities, but the REPOCOD benchmark reveals that they are not yet capable of replacing human programmers in real-world software development.

[Distinguishing Ignorance from Error in LLM Hallucinations](https://arxiv.org/abs/2410.22071v1) - Distinguishing between two types of hallucinations in large language models is crucial for detecting and mitigating errors, and a new approach called Wrong Answer despite having Correct Knowledge (WACK) is introduced to address this issue.

[Are LLMs Better than Reported? Detecting Label Errors and Mitigating Their Effect on Model Performance](https://arxiv.org/abs/2410.18889v1) - Large language models (LLMs) can detect label errors in datasets, revealing that reported model performance may be higher than previously thought, and propose methods to mitigate the impact of mislabeled data on model training.

[MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark](https://arxiv.org/abs/2410.19168v1) - MMAU is a benchmark designed to evaluate multimodal audio understanding models on tasks requiring expert-level knowledge and complex reasoning, challenging models to tackle tasks akin to those faced by experts.

[Mind Your Step (by Step): Chain-of-Thought can Reduce Performance on Tasks where Thinking Makes Humans Worse](https://arxiv.org/abs/2410.21333v1) - Chain-of-thought (CoT) prompting can reduce performance on tasks where thinking makes humans worse, as shown by experiments across various settings and models.

[OS-ATLAS: A Foundation Action Model for Generalist GUI Agents](https://arxiv.org/abs/2410.23218v1) - OS-ATLAS is a foundational GUI action model that excels at GUI grounding and OOD agentic tasks through innovations in both data and modeling, providing significant performance improvements over previous state-of-the-art models.

[Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms](https://arxiv.org/abs/2410.18967v1) - Ferret-UI 2 is a multimodal large language model designed to understand user interfaces across various platforms, offering support for multiple platform types, high-resolution perception, and advanced task training data generation.

[ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting](https://arxiv.org/abs/2410.17856v1) - Master open-world interaction with visual-temporal context prompting enables agents to accomplish complex tasks in Minecraft, showcasing the effectiveness of this approach in embodied decision-making.

[Brain-like Functional Organization within Large Language Models](https://arxiv.org/abs/2410.19542v1) - Large language models exhibit brain-like functional organization.

[Unbounded: A Generative Infinite Game of Character Life Simulation](https://arxiv.org/abs/2410.18975v1) - A generative infinite game called Unbounded uses advanced AI to create a virtual world where players can interact with autonomous virtual characters through open-ended mechanics generated by a large language model.

[Bayesian scaling laws for in-context learning](https://arxiv.org/abs/2410.16531v2) - Bayesian scaling laws for in-context learning are discussed in the paper, which is currently unavailable on Hugging Face.

#### Concerns
![](https://www.zdnet.com/a/img/resize/c2d63c5be4b9367a073d561782321f602f93ef63/2024/11/01/faca24b0-25e0-45cd-95d0-2bbfce183742/threataisssgettyimages-1138666425.jpg?auto=webp&fit=crop&height=675&width=1200)

[Anthropic warns of AI catastrophe if governments don't regulate in 18 months](https://www.zdnet.com/article/anthropic-warns-of-ai-catastrophe-if-governments-dont-regulate-in-18-months/) - AI company Anthropic warns of catastrophic AI risks and advocates for targeted regulation to mitigate these risks, emphasizing the importance of transparency, incentivizing security, and simplicity in government guidelines.

[Google, Microsoft, and Perplexity Are Promoting Scientific Racism in Search Results](https://www.wired.com/story/google-microsoft-perplexity-scientific-racism-search-results-ai/) - AI-infused search engines from Google, Microsoft, and Perplexity are surfacing debunked research promoting race science and the idea of white genetic superiority, raising concerns about potential radicalization.

[Open Source Bites Back as China’s Military Makes Full Use of Meta AI](https://gizmodo.com/open-source-bites-back-as-chinas-military-makes-full-use-of-meta-ai-2000519373) - Chinese research institutions with connections to the military have developed AI systems using Meta’s open-source Llama model, training them for military applications such as intelligence analysis, strategic planning, and command decision-making, despite Meta's prohibitions.

[Tesla self-driving test driver: ‘you’re running on adrenaline the entire eight-hour shift’](https://electrek.co/2024/11/01/tesla-self-driving-test-driver-youre-running-on-adrenaline-the-entire-eight-hour-shift/) - Tesla's internal self-driving team pushes the limits of autonomous driving technology, with test drivers describing dangerous scenarios and risky behaviors in the pursuit of data collection.

[OpenAI Research Finds That Even Its Best Models Give Wrong Answers a Wild Proportion of the Time](https://futurism.com/the-byte/openai-research-best-models-wrong-answers) - OpenAI's latest AI models, including its cutting edge o1-preview model, are shockingly bad at providing correct answers, with even the best models scoring abysmally on the new SimpleQA benchmark, raising concerns about the pervasiveness of AI in everyday life.

<hr>

Copyright © 2024 Skynet Today, All rights reserved.
Binary file not shown.

0 comments on commit 2e74858

Please sign in to comment.