🧀 BigCheese.ai

Social

AI News & Launches

Stay ahead of the curve with BigCheese Weekly, your source for the Latest AI news and product debuts.

Subscribe
2 hours ago
Show HN: Open-source, native audio turn detection model

Smart Turn is an open-source, community-driven project for native audio turn detection for conversational voice AI, utilizing a Wav2Vec2-BERT model base. It is designed to improve on limitations of VAD-based turn detection by including linguistic and acoustic cues and supports only English currently. The project invites contributions and is licensed under the BSD-2-Clause license.

5 hours ago
Reducing latency with esoteric Linux process flags

James Matsuzaki of Recall.ai shares the experience of reducing latency from 12 seconds to 2-3 seconds for their Output Media feature by removing an esoteric Linux process flag, '--die-with-parent'. The post details challenges encountered with Bubblewrap sandboxing and Tokio's threading model in their endeavor.

5 hours ago
Using GRPO to Beat o1, o3-mini and R1 at "Temporal Clue"

OpenPipe has used GRPO to achieve performance surpassing R1, o1, o3-mini models on 'Temporal Clue' reasoning game. They discuss task design, hyperparameters, and share a training recipe, dataset, and model weights under the MIT license.

7 hours ago
Mistral OCR

Mistral AI introduces Mistral OCR, a new Optical Character Recognition API offering state-of-the-art document understanding. It processes text, tables, equations, and images with unprecedented accuracy. Ideal for use with RAG systems, it's multilingual and available for self-hosting. Try it on le Chat or la Plateforme.

8 hours ago
Mistral OCR

Mistral OCR introduces the world's best document understanding API, revolutionizing information abstraction and retrieval with Optical Character Recognition technology. The API comprehends media, text, tables, and equations with high accuracy, fostering innovation in document processing.

13 hours ago
Israel creating GPT-like tool using collection of Palestinian surveillance data

Israel's military intelligence used intercepted Palestinian communications to create a ChatGPT-like AI tool, raising concerns over bias and inaccuracies in surveillance practices. This advanced model can analyze Arabic dialects intensely, providing insights into vast data collections and potentially impacting Palestinians' daily lives.

17 hours ago
Fynix

Fynix.ai offers a suite of AI-powered tools designed to increase coding efficiency and code quality. Products such as AI Code Assistant and Code Quality Agent integrate into popular IDEs to provide real-time coding aid, automated code reviews, and repository-wide context insight.

17 hours ago
Slack Huddle API by Recall.ai

Recall.ai provides an integration for Slack huddle data capture, offering bots to record conversations, transcribe them, and manage events. It streamlines meeting management with automated workflows and powers AI-driven applications directly within Slack.

17 hours ago
Aider

Aider is an AI-powered pair programming tool for developers, enabling them to program alongside large language models (LLMs) directly from their terminal. It integrates with local git repositories to enhance coding productivity by providing features like automatic git commits, compatibility with most programming languages and the ability to work on complex multi-file projects.

17 hours ago
Perigon

Perigon is a data management platform that combines multi-source ingestion, data preparation, and analytics into one seamless workflow. Designed to handle complex data, it offers data integration, real-time data streams, and interactive visualizations to unlock insights.

18 hours ago
Arva AI (YC S24) Is Hiring an AI Product Engineer

Arva AI is hiring an AI Product Engineer in London, offering a salary of £65K - £90K GBP with 0.50% - 1.00% equity. The role involves building full-stack features on their AI-powered compliance platform and improving it through feedback loops and AI advancements. The company culture emphasizes fast delivery, customer-first priorities, and transparent communication.

a day ago
Volkswagen seeks to counter rivals with budget EV model

Volkswagen AG introduces the ID. EVERY1, the company's most affordable electric vehicle (EV) at €20,000, approximately $21,000. It will be the first model to use the electric architecture developed with Rivian. The ID. EVERY1 is set to offer over 155 miles of range and begin production in two years in Europe.

a day ago
DeepSeek-R1-671B-Q4_K_M with 1 or 2 Arc A770 on Xeon

This quickstart guide provides instructions on how to use llama.cpp portable zip to run llama.cpp on Intel GPU with ipex-llm. It includes prerequisites, step-by-step runtime configuration, and model execution for both Windows and Linux platforms. The guide is intended for running models like DeepSeek-R1-671B-Q4_K_M on Intel hardware, with different prerequisites and steps for Windows and Linux. DeepSeek V3/R1 models can also be run using FlashMoE on Linux platforms, provided the system meets the specified hardware requirements.

a day ago
Wake up Neo: An ancient painting competition, and The Matrix

An exploration of the ancient Greek yardstick of imitating reality through an art duel between Zeuxis and Parrhasios, and how the Hardham muralist engages with this theme to challenge the viewer's perception.

a day ago
Datafold (YC S20) is hiring Engineer to build AI-powered data migration tools

Datafold is seeking a Senior Software Engineer for their AI-powered data migration tool, the Datafold Migration Agent, offering a salary range of $175K-$245K. Applicants must have 3+ years of experience, proficiency in Python, and a startup mindset. The role entails driving DMA development, designing and scaling backend systems, and collaborating with customers. Datafold, a Series A startup backed by YC and others, prioritizes data quality and provides tools integrated into data engineers' workflows.

a day ago
Sam Altman Returns to Political Fund-Raising

Sam Altman, CEO of OpenAI, has reignited his political fundraising activities by hosting an event for Democratic Senator Mark Warner of Virginia. After a period of reduced involvement and a brief dismissal from OpenAI in 2023, Altman is participating in the political scene with a fund-raising lunch on March 20, whose tickets range from $1,000 to $22,000. The event marks his reentry into political support since helping Democrat Dean Phillips in 2022.

a day ago
Expanding AI Overviews and Introducing AI Mode

Google introduces AI Mode in Search, an experimental feature that enables advanced AI Overviews. It enhances the search experience with more reasoning, thinking, and multimodal capabilities, allowing users to get in-depth assistance on complex queries. Starting with coding and math questions, the feature will expand to more areas. It is currently available for testing in Labs to a limited audience, including Google One AI Premium subscribers.

2 days ago
They wanted to save us from a dark AI future. Then six people were killed

A group of outliers from the Silicon Valley community, influenced by AI-risk theories and calling themselves 'Zizians', become involved in a series of violent acts, resulting in six deaths. Their descent from rationalist thinkers to extreme behavior illustrates the potential dangers of ideology taken to the extreme.

2 days ago
Lifestack (Web + AI Scheduler)

Lifestack is a web and mobile application designed to optimize your day by combining schedule management with wearable data to plan tasks according to your energy levels, aiming to enhance productivity while maintaining health.

2 days ago
Latitude Agents

Latitude provides a platform to build self-improving AI agents capable of designing, evaluating, and deploying with your tools and data. Agents operate autonomously, unlike predefined workflows, exhibiting adaptiveness and integrability. Latitude's mission supports AI builders in creating scalable AI agents.

2 days ago
GaliChat AI

GaliChat is an AI Chatbot that provides 24/7 customer support, lead generation, and support cost reduction by training on specific industry data, and can be set up without coding.

2 days ago
Writing an LLM from scratch, part 8 – trainable self-attention

Giles Thomas' latest blog post details his journey through the book 'Build a Large Language Model (from Scratch)' by Sebastian Raschka. He delves into section 3.4 covering 'Implementing self-attention with trainable weights'. The post offers his perspective on understanding the concept of self-attention in LLMs, where inputs are transformed into different spaces to calculate attention scores, leading to the production of context vectors representing the meaning of tokens.

2 days ago
AI: Where in the Loop Should Humans Go?

This article by Fred Hebert on Honeycomb's blog, dated March 3rd, 2025, provides insight into the incorporation of AI in technology, focusing on its current limitations and the crucial role humans need to play in the AI loop. Hebert emphasizes that despite the promises of AI, practical deployment necessitates current functionality rather than future potential. The article raises several thoughtful questions regarding the integration of AI tools into systems, the risks of dependency, and enhancing human capabilities instead of replacing them.

2 days ago
Show HN: Fork of Claude-code working with local and other LLM providers

Project anon-kode is a terminal-based AI coding assistant that can utilize any model supporting the OpenAI-style API. It is designed to improve and explain code, run tests, and perform other tasks, depending on the chosen model.

3 days ago
Microsoft's new Dragon Copilot is an AI assistant for healthcare

Microsoft introduces Dragon Copilot, an AI healthcare assistant offering dictation and ambient listening in clinical settings. The system is developed from Nuance's technology, aiming to reduce administrative burden on clinicians and improve patient care. Microsoft's studies show reduced clinician burnout and enhanced patient experiences.

3 days ago
Trellis (YC W24) Is Hiring Eng to Build the Best AI Agents for PDF

Trellis provides a powerful AI engine to automate PDF workflows, transforming complex documents into ready-to-use data for operations and engineering teams within seconds, not weeks.

3 days ago
AI can decode digital data stored in DNA in minutes instead of days

Researchers have developed a new AI method called DNAformer that decodes digital data from DNA strands about 90 times faster than previous techniques. This advancement brings DNA data storage closer to practical use in computing, with potential for long-term preservation, as it's naturally replicable and can remain intact for millennia.

3 days ago
Kive Product Shots

Kive leverages AI to transform the creative process, allowing users to generate on-brand product visuals without needing a studio or photoshoot. The platform is designed to make tasks like social media ideas, ad campaigns, and asset management effortless for brands, agencies, and freelancers.

3 days ago
Opera Operator

Opera introduces Browser Operator, an AI agent for browsers, optimizing tasks for efficiency and privacy. It uses local resources for web tasks, and its AI Composer Engine handles instructions in natural language. The user remains in control, providing inputs for sensitive actions, as browsing data is kept private and secure on the user's device.

3 days ago
NeoBase

NeoBase is a cloud-based platform offering secure and private file sharing and storage services. It emphasizes on data privacy and aims to provide an intuitive user experience for managing files in the cloud.

3 days ago
Cardamon

Cardamon.ai is a cutting-edge platform blending artificial intelligence with data analysis to pave the way for innovative tech solutions, specializing in machine learning and complex algorithmic applications.

3 days ago
Microsoft Dragon Copilot

Dragon Copilot by Microsoft is an AI-powered clinical documentation tool designed to streamline documentation, surface relevant information, automate tasks, and improve clinical workflows across different care settings.

3 days ago
Nomadic infrastructure design for AI workloads

In a blog post on nomadic compute designs for AI workloads, the author discusses building sustainable systems using simple components for flexibility and cost-efficiency. Objects are stored without egress fees, reducing reliance on provider-specific features that impose vendor lock-in. The post further explores production readiness, cloud dependency strategies, and concepts for orchestrating easily replaceable compute resources.

3 days ago
Show HN: Open-Source Windows AI assistant that uses Word, Excel through COM

SmartestKid is a Windows desktop AI assistant written in Python, offering voice and text input modes and interfaces with Microsoft Office applications, Images, and the file system. It is open-source with an inviting community for contributions to expand its functionality.

3 days ago
Show HN: Agents.json – OpenAPI Specification for LLMs

agents-json is a GitHub repository created by Wildcard AI, aiming to describe contracts for API and agent interactions built on top of the OpenAPI standard. It provides a specification to enable AI agents such as language model tools to interact with APIs by using structured contracts. The current version of the agents.json specification is 0.1.0, and the repository contains examples, Python package, and demos for integrating with agents.json.

3 days ago
Go-attention: A full attention mechanism and transformer in pure Go

go-attention is the first pure Go implementation of attention mechanisms and transformer layers, designed for high performance and ease of use. It supports dot-product attention, multi-head attention, and full transformer layers with optimization for the CPU and minimal memory allocations.

3 days ago
SOTA Code Retrieval with Efficient Code Embedding Models

Qodo unveils Qodo-Embed-1, a new code embedding model family offering state-of-the-art performance with a smaller size than existing models. Qodo-Embed-1-1.5B achieves a 68.53 score on the CoIR benchmark, and its larger counterpart, Qodo-Embed-1-7B scores 71.5, showing superior efficiency across models.

4 days ago
Robust Wavefront OBJ model parsing in C

Article presents a robust Wavefront OBJ parser in C, detailing challenges and techniques in parsing 3D model data, showcasing a technique to handle undefined behavior, and demonstrating the parsing process with sample code.

4 days ago
AI-Dev

AI Dev Info provides insights into artificial intelligence and software development, featuring articles, tutorials, and news on latest technologies and trends in the AI programming domain.

Want stories like these in your inbox?

BigCheese Weekly newsletter will be sent to you each week.