Email Details

OpenAI o1 🤖, Gemini Live on Android📱, Image-to-3D Generation🖼️

OpenAI has released a model that spends time considering before it answers, sometimes leading to super human performance 

TLDR

Together With

TLDR AI 2024-09-13

Extracting meaningful data from emails was always a heavy lift. ExtractAI by Nylas makes it easy (Sponsor)

There's valuable data in emails, but it's often buried under layers of text, images, and HTML. Nylas ExtractAI is a new way to get that data into the hands of developers, faster and with zero effort.

👉 Try ExtractAI for free

Retrieve first-party data from your users' inboxes with advanced ML, NLP, and LLMs .

Extract features and classify emails: online orders, travel reservations, invoices, and more.

Sync directly with your user's inbox in real time. No forwarding required!

Create high-quality labeled datasets to train your own extractors.

👷‍♀️ Get your free API key and start building

🚀

Headlines & Launches

OpenAI's newest model (8 minute read)

OpenAI has released its next model, which was trained to think before it answers. The new model was trained with reasoning traces and spends time considering before it answers. In some domains, this has led to super human performance. The model will be rate limited to 30 or so queries per user per week, but OpenAI hopes to lift that restriction soon.
Google is now rolling out Gemini Live to free users on Android (2 minute read)

Google is rolling out Gemini Live, its conversational AI feature, to free Android users after a month of advanced user access. Users can interrupt responses with new information and receive text transcripts of their interactions. Although Gemini Live doesn't support extensions like Gmail yet, it offers ten new voice options, and more features are promised soon.
🧠

Research & Innovation

HTML to Markdown (12 minute read)

Jina has released two new state-of-the-art models that take noisy HTML and parse it into clean and usable Markdown for training and reasoning.
Code Generation with Policy Filtration (16 minute read)

Policy Filtration for Proximal Policy Optimization (PF-PPO) is a method designed to improve the accuracy of reinforcement learning from human feedback (RLHF) in code generation tasks.
Saliency Prediction with Targeted Data Augmentation (27 minute read)

Researchers have developed a new data augmentation method for improving saliency prediction models, which traditionally suffer from limited labeled data.
🧑‍💻

Engineering & Resources

[Ebook] How to be a strategic partner at your business (Sponsor)

Executives want to see numbers—are you delivering? Harnessing the power of people analytics helps HR get a seat at the table. Our guide breaks down:

- The definition and uses of data and reporting

- Finding the right metrics to track for your business

- How to put data into action

Download Now

Teaching AI to Learn and Reuse Task Workflows (GitHub Repo)

Agent Workflow Memory (AWM) is a method that helps language model-based agents learn reusable task workflows from past experiences to handle complex, long-horizon tasks.
Image-to-3D Generation (GitHub Repo)

Hi3D is a new model that improves the generation of multi-view consistent, high-resolution 3D images from a single input. It uses a video diffusion approach to tackle the lack of 3D awareness in traditional 2D methods, leveraging temporal consistency from video models to enhance geometry across views.
🎁

Miscellaneous

1-Click fine-tuning of Llama 405B (11 minute read)

Axolotal AI has partnered with Lambda Labs to show how to use its one click cluster to fine-tune the Llama 3.1 405B model. This requires 64 GPUs, but can be done with minimal infra setup due to new tools.
Can LLMs Reproduce Research Results Autonomously? (GitHub Repo)

SUPER is a new benchmark designed to assess how well LLMs can reproduce tasks from research repositories.
Will the "AI Scientist" Bring Anything to Science? (9 minute read)

Researchers have developed an AI tool that automates scientific processes, performing tasks from hypothesis generation to experiment execution and paper writing. Its accuracy and coherence still need improvement. Critics highlight that, while AI can handle simulations in fields like quantum computing and materials science, it risks narrowing research questions and producing less meaningful knowledge. Proponents believe this AI could optimize early research stages, potentially aiding scientists in conceptualizing and scoping research projects.

Quick Links

Using GPT-4o for web scraping (5 minute read)

An AI-assisted web scraper that uses OpenAI's GPT-4o to extract structured data from HTML tables, with varying results on complex and merged row tables.
Sergey Brin says he's working on AI at Google 'pretty much every day' (1 minute read)

Google co-founder Sergey Brin has returned to Google to focus on AI because of its rapid advancements.
Amazon starts testing ads in its Rufus chatbot (1 minute read)

Amazon's shopping-focused chatbot, Rufus, will soon display sponsored ads based on search and conversational context.

Love TLDR? Tell your friends and get rewards!

Share your referral link below with friends to get free TLDR swag!
Track your referrals here.

Want to advertise in TLDR? 📰

If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us.

If you have any comments or feedback, just respond to this email!

Thanks for reading,
Andrew Tan & Andrew Carr


If you don't want to receive future editions of TLDR AI, please unsubscribe from TLDR AI or manage all of your TLDR newsletter subscriptions.

© 2024 Email Dashboard. All rights reserved.