TLDR AI 2024-09-18

Join Mira Murati, Marc Andreesen, and Jerry Liu at Ray Summit 2024 in San Francisco (Sponsor)

Don't wait for the AI future—come build it.

This September, join the world's largest gathering of open source AI infrastructure leaders and an all-star lineup of speakers you won't see anywhere else.

Attend Ray Summit 2024 in person to:

→ Go behind the scenes of some of the most ambitious and large-scale AI powered consumer applications.

→ Mingle with the world's top minds in distributed computing, machine learning, and AI.

→ Explore the Ray and Anyscale roadmaps and meet the teams that lead them.

🌉 Register to attend @ The San Francisco Marriott Marquis

🚀

Headlines & Launches

Mistral Free API and Price Update (3 minute read)

Mistral has released a free API tier, dramatically reduced its costs, improved the performance of its small model, and put its vision model in Le Chat.

Challengers Are Coming for Nvidia's Crown (14 minute read)

Nvidia's dominance in AI chips has propelled it to immense market value, largely thanks to its GPU capabilities and CUDA software ecosystem. However, competitors like AMD, Intel, Cerebras, and SambaNova are developing innovative solutions to challenge Nvidia's supremacy in AI hardware. While Nvidia's lead remains secure for now, the landscape is dynamic, with multiple players striving to carve out their own niches in the AI market.

🧠

Research & Innovation

Jina Embeddings v3 (Hugging Face Hub)

The Jina series of embeddings are a high quality and powerful suite of models that can be used for embedding and retrieval. Its development team has released the next version of their model with improved performance and training.

Trustworthiness of RAG Systems (30 minute read)

This work introduces a framework to evaluate the trustworthiness of Retrieval-Augmented Generation (RAG) systems across six key areas: factuality, robustness, fairness, transparency, accountability, and privacy.

Enhancing Recommender Systems with beeFormer (8 minute read)

The new beeFormer framework improves sentence Transformers by incorporating interaction data, making them more effective for recommender systems.

🧑‍💻

Engineering & Resources

Discover the future of AI-powered commerce at The Edge Summit (Sponsor)

Join Bloomreach, Google, and NVIDIA for this exclusive one-day event! Hear from leaders across industries about how they're bringing AI to life, where they see opportunity, and their predictions for what's next. Save your virtual front row seat to tap into the future of AI in ecommerce - told by the people building it day by day.

AI Comic Understanding (GitHub Repo)

The last frontier of Visual Language Models is the ability to understand and reason about comics. This project is a survey and a call for research.

Word Llama (GitHub Repo)

A lightweight toolkit for fuzzy deduplication, reranking, and other NLP-based tasks. Optimized to run on the CPU.

Syllable Segmentation in Speech Models (GitHub Repo)

This project enhances speech representation learning by separating syllabic structures from speaker information in self-supervised models. By fine-tuning the HuBERT model with speaker perturbation techniques, researchers improved syllable segmentation, leading to better syllabic unit organization.

🎁

Miscellaneous

Data Pipelines are the new AI secret sauce (16 minute read)

With models being somewhat commoditized, much of the advantage in AI comes from the data. It also, by extension, comes from the pipeline that ingests and creates the data. This post discusses the challenges and opportunities associated with data pipelines in the modern age.

Why Copilot is Making Programmers Worse at Programming (5 minute read)

AI tools like GitHub Copilot enhance programming productivity but risk eroding essential coding skills. Over-reliance on AI-generated code can lead to quality, security, and maintainability issues and reduce learning opportunities. These tools may also limit creative problem-solving and foster a false sense of expertise among developers.

⚡

Quick Links

Surveillance video summarization (GitHub Repo)

A custom-trained Florence 2-based model and system that can summarize CCTV and Surveillance videos and give accurate updates about what is occurring at any time.

TikTok's owner wants to design its own AI chips (2 minute read)

ByteDance is developing its own AI chips with TSMC to reduce reliance on Nvidia GPUs amid U.S. export controls.

Salesforce unleashes its first AI agents (2 minute read)

Salesforce has debuted Agentforce, its effort to create generative AI bots capable of taking action on their own - within established limits.

Love TLDR? Tell your friends and get rewards!

Share your referral link below with friends to get free TLDR swag!

https://refer.tldr.tech/6d412934/2

Track your referrals here.

Want to advertise in TLDR? 📰

If your company is interested in reaching an audience of AI professionals and decision makers, you may want to advertise with us.

If you have any comments or feedback, just respond to this email!

Thanks for reading,
Andrew Tan & Andrew Carr

If you don't want to receive future editions of TLDR AI, please unsubscribe from TLDR AI or manage all of your TLDR newsletter subscriptions.

Email Details

Mistral API 💻, Nvidia challengers👊, Trustworthiness of RAG Systems📚

TLDR AI 2024-09-18

Headlines & Launches

Research & Innovation

Engineering & Resources

Miscellaneous

Quick Links