AI 每日快讯

AI 每日快讯

AI 产品、模型、开源工具和官方动态的时间流。保留历史记录,按分类、日期和标签继续筛选。

1335历史快讯
80开源工具
80当前结果
06 月 25 日 今日快讯
MarkTechPost 官方资讯

MarkTechPost:Baidu Releases Unlimited OCR, a 3B Model That Keeps the KV Cache Flat for Long-Document Pars…

原文摘要:Baidu open-sourced Unlimited OCR, a 3B-parameter MoE model that parses dozens of document pages in a single forward pass. Its Reference Sliding Window Attention (R-SWA) holds the K 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 24 日 昨日快讯
MarkTechPost 官方资讯

MarkTechPost:Gradium Launches stt-translate and s2s-translate, Real-Time Speech Translation Models Beatin…

原文摘要:Gradium released two real-time speech translation models, stt-translate and s2s-translate, covering English, French, German, Spanish, and Portuguese across 20 language pairs. The m 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:How to Design an OpenHarness Style Agent Runtime with Tools, Memory, Permissions, Skills, an…

原文摘要:In this tutorial, we build an OpenHarness style agent harness from scratch to see how a practical agent system works. We recreate the core building blocks: tool use, typed tool sch 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Using Graphify and NetworkX to Map Python Codebase Structure with God Nodes, Communities, an…

原文摘要:In this tutorial, we build a fully offline Graphify pipeline that turns a multi-module Python application into a knowledge graph. We install Graphify, generate a connected sample a 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Nous Research Adds /learn to Hermes Agent’s Skills System, Capturing 工作流 as Slash Comm…

原文摘要:Nous Research has added /learn to the Hermes Agent Skills System. The command authors a standards-compliant SKILL.md from a local directory, a doc URL, a past conversation, or past 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:DFlash Speculative Decoding Drafts Whole Token Blocks in Parallel for Up to 15x Higher Throu…

原文摘要:UC San Diego's DFlash replaces autoregressive drafting with a lightweight block diffusion model for speculative decoding. It drafts whole token blocks in a single forward pass and 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 23 日 2026-06-23 快讯
MarkTechPost 官方资讯

MarkTechPost:Datalab Releases lift: A 9B Open-Weights Vision Model That Extracts Structured JSON From PDF…

原文摘要:Datalab released lift, a 9B open-weights vision model that turns PDFs and images into schema-matching JSON. It uses schema-constrained decoding for valid structure and trained abst 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:How to Use NVIDIA Canary-1B-v2 for ASR, Translation, and Automatic SRT Subtitle Export in Py…

原文摘要:In this tutorial, we build a multilingual ASR and speech translation pipeline with NVIDIA Canary-1B-v2. We load the model on a GPU-enabled runtime, prepare audio into 16 kHz mono, 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and L…

原文摘要:We build a practical GLM-5.2 工作流 using its hosted, OpenAI-compatible API instead of running the model locally. We set up multiple providers, load the API key securely, and cre 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 22 日 2026-06-22 快讯
MarkTechPost 官方资讯

MarkTechPost:xAI Launches /goal in Grok Build, Adding Long-Running Autonomous Execution With Built-In Ver…

原文摘要:xAI introduced /goal in Grok Build, a mode for long-running, autonomous task execution. You hand off one objective, and the agent plans an approach, executes a progress checklist, 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Sakana AI Launches Sakana Fugu: An Orchestration Model That Routes Tasks Across a Swappable …

原文摘要:Fugu and Fugu Ultra route tasks across a swappable model pool, leading most coding, reasoning, and agentic 评测. The post Sakana AI Launches Sakana Fugu: An Orchestration Mod 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:MoonMath AI Open-Sources a HIP Attention Kernel for AMD MI300X That Beats AITER v3 on Every …

原文摘要:The HIP kernel uses one-instruction asm wrappers and an eight-wave pipeline to outperform AMD's AITER v3 on MI300X. The post MoonMath AI Open-Sources a HIP Attention Kernel for AMD 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 21 日 2026-06-21 快讯
MarkTechPost 官方资讯

MarkTechPost:Crawlee for Python: Build a Web Crawling Pipeline with Robots Handling, Link Graphs, and RAG…

原文摘要:In this tutorial, we build a complete Crawlee for Python 工作流 from setup to AI-ready output. We generate a local demo website, then crawl it with BeautifulSoupCrawler, ParselCr 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 20 日 2026-06-20 快讯
MarkTechPost 官方资讯

MarkTechPost:Cisco AI Introduces FAPO: Pipeline-Aware Prompt Optimization With Step-Level Failure Attribu…

原文摘要:Cisco Foundation AI has open-sourced FAPO (Fully Automated Prompt Optimization), a Claude Code-driven system that autonomously optimizes multi-step LLM pipelines from baseline prom 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Nous Research Updates Hermes Agent With a Blank Slate Mode That Pins Toolsets via platform_t…

原文摘要:Nous Research has added a Blank Slate setup mode to its open-source Hermes Agent. It starts an agent with everything off except provider, model, File Operations, and Terminal. You 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 19 日 2026-06-19 快讯
MarkTechPost 官方资讯

MarkTechPost:NVIDIA AI Introduce SpatialClaw: A Training-Free Agent That Treats Code as the Action Interf…

原文摘要:SpatialClaw is a training-free agent that writes Python in a persistent kernel, composing perception tools for 3D spatial reasoning The post NVIDIA AI Introduce SpatialClaw: A Trai 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2.5-Coder-3B With the Spectrum-to-Si…

原文摘要:VibeThinker-3B, a 3B MIT-licensed reasoning model matching DeepSeek V3.2 and Kimi K2.5 on verifiable 评测. The post VibeThinker-3B: A 3B Dense Reasoning Model Built on Qwen2. 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Liquid AI Introduces LFM2.5-Embedding-350M and LFM2.5-ColBERT-350M: Dense Bi-Encoder and Lat…

原文摘要:Liquid AI's LFM2.5 Retrievers combine a dense bi-encoder and ColBERT late-interaction model for multilingual search on edge devices. The post Liquid AI Introduces LFM2.5-Embedding- 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 18 日 2026-06-18 快讯
MarkTechPost 官方资讯

MarkTechPost:Perplexity Launches Brain, a Self-Improving Memory System That Builds a Context Graph of an …

原文摘要:Perplexity has launched Brain, a self-improving memory system for its Computer agent. Instead of remembering the user, Brain remembers the agent's work — what worked, what failed, 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:OpenAI Releases LifeSciBench, a 750-Task 评测 Grading AI Models on Real Life-Science Re…

原文摘要:OpenAI's LifeSciBench evaluates whether frontier AI can handle real life-science research across 750 expert-authored tasks, seven 工作流, and seven biological domains. Built by 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA SkillSpector Guide: Scanning AI Skills for Security Risks with Static Analysis and SA…

原文摘要:In this tutorial, we use NVIDIA SkillSpector to evaluate AI skills for security risks before deployment. We build a corpus of benign and deliberately vulnerable skills, then scan t 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 17 日 2026-06-17 快讯
MarkTechPost 官方资讯

MarkTechPost:Vercel Releases Eve: An Open-Source AI Agent Framework Where Each Agent is a Directory of Fi…

原文摘要:Vercel has open-sourced eve, an Apache-2.0 agent framework now in public preview. An agent is a directory of files, with durable execution, sandboxes, approvals, connections, chann 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:MiniMax Sparse Attention (MSA): a Two-Branch Block-Sparse Attention Trained on a 109B-Parame…

原文摘要:MiniMax released MSA, a sparse attention built on Grouped Query Attention. A lightweight Index Branch selects Top-k key-value blocks per query and GQA group; the Main Branch attend 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:OpenAI’s Deployment Simulation Extends Pre-Deployment Risk Assessment to Agentic Coding Thro…

原文摘要:OpenAI introduced Deployment Simulation on June 16, 2026. The method replays past conversations through a new candidate model before release. It then grades the completions to esti 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:How to Build Memory-Efficient Transformers with xFormers Using Packed Sequences, GQA, ALiBi,…

原文摘要:We implement xFormers, a practical toolkit for fast, memory-efficient Transformer models on GPUs. We validate memory-efficient attention against a standard implementation, then com 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 16 日 2026-06-16 快讯
MarkTechPost 官方资讯

MarkTechPost:Hermes Agent Adds Asynchronous Subagents, So Delegated Work No Longer Blocks the Parent Chat

原文摘要:We look at Hermes Agent's new asynchronous subagents from Nous Research. The delegate tool can now spawn background agents that no longer block the parent chat. We walk through the 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet Atoms: A Vibe Coding Tool That Uses AI Agents to Build, Deploy, and Market Your App (No…

原文摘要:The concept of vibe coding is interesting; you don’t need to be a 开发者 or software engineer to build your own applications. You can describe your idea to an AI in plain langua 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Google Cloud Introduces Open Knowledge Format (OKF): A Vendor-Neutral Markdown Spec for Givi…

原文摘要:We break down Google Cloud's new Open Knowledge Format (OKF), an open spec that formalizes the LLM-wiki pattern. We explain how a bundle works: a directory of markdown files with Y 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:How to Build a Parsing Pipeline with Docling Parse for Layout-Aware Document Intelligence

原文摘要:In this tutorial, we build a 工作流 that uses Docling Parse to analyze PDF documents at a detailed structural level. We prepare a stable Python environment, handle common Colab d 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 15 日 2026-06-15 快讯
MarkTechPost 官方资讯

MarkTechPost:Sakana AI Commercializes AB-MCTS in Sakana Marlin, an Enterprise Agent Generating Up to 100-…

原文摘要:Sakana AI's first commercial product runs autonomously for up to eight hours per task. It returns multi-page reports and slides, built on AB-MCTS and AI Scientist 工作流. The po 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet Flash-KMeans: An IO-Aware, Exact K-Means That Runs Over 200× Faster Than FAISS on GPUs

原文摘要:Flash-KMeans is an open-source, IO-aware implementation of standard Lloyd's k-means in Triton GPU kernels. It does not change the math or approximate. FlashAssign removes distance- 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 14 日 2026-06-14 快讯
MarkTechPost 官方资讯

MarkTechPost:Databricks Open-Sources Omnigent: A Meta-Harness That Composes, Governs, and Shares AI Agent…

原文摘要:Databricks has open-sourced Omnigent, a meta-harness that sits above coding agents like Claude Code, Codex, and Pi. It adds composition, contextual policies, and live session shari 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 13 日 2026-06-13 快讯
MarkTechPost 官方资讯

MarkTechPost:A Coding Implementation on Spatial Graph Neural Networks for Urban Function Inference Using …

原文摘要:We build an end-to-end spatial graph learning pipeline using city2graph. We collect urban POI and street network data from OpenStreetMap, with a synthetic fallback for reliability. 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 12 日 2026-06-12 快讯
MarkTechPost 官方资讯

MarkTechPost:Google Releases Gemini-SQL2: Gemini 3.1 Pro Text-to-SQL Scores 80.04% on BIRD Single-Model L…

原文摘要:We look at Gemini-SQL2, the text-to-SQL capability Google Research announced on June 12, 2026. Powered by Gemini 3.1 Pro, it posted 80.04% execution accuracy on the BIRD single-mod 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Moonshot AI Launches Kimi Work, a Local Desktop Agent Reportedly Running on Kimi K2.6 With a…

原文摘要:Moonshot AI's Kimi Work is a local desktop agent for macOS and Windows. It runs a 300-sub-agent swarm, drives your logged-in browser via WebBridge, and schedules background jobs. T 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Zyphra Release Zamba2-VL: Hybrid Mamba2–Transformer Vision-Language Models That Cut Time-to-…

原文摘要:Zyphra has released Zamba2-VL, a family of open vision-language models at 1.2B, 2.7B, and 7B parameters. The models use a hybrid Mamba2 state-space and Transformer backbone, shippi 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:A Coding Implementation on MONAI for End-to-End 3D Spleen Segmentation Using UNet on Medical…

原文摘要:In this tutorial, we build an end-to-end 3D medical image segmentation pipeline using MONAI to segment the spleen on the Medical Segmentation Decathlon Task09 dataset. We work with 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 11 日 2026-06-11 快讯
MarkTechPost 官方资讯

MarkTechPost:Perplexity Moves Deep Research Into Computer, Routing Research Subtasks Across 20+ Frontier …

原文摘要:Deep Research now lives inside Perplexity Computer, breaking hard questions into subtasks and routing across 20+ frontier models. The post Perplexity Moves Deep Research Into Compu 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:xAI Ships Grok Build Plugin Marketplace With MongoDB, Vercel, Sentry, Chrome DevTools, Cloud…

原文摘要:Grok Build's in-terminal marketplace bundles skills, agents, hooks, and MCP servers, with commit-SHA verification on every remote plugin. The post xAI Ships Grok Build Plugin Marke 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Nous Research Ships Hermes Agent Profile Builder: Identity, Model, Skills, and MCP Servers i…

原文摘要:The Hermes Agent dashboard now builds complete agent profiles in one flow, replacing multi-step CLI setup for users. The post Nous Research Ships Hermes Agent Profile Builder: Iden 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet ‘North Mini Code’: Cohere’s 30B Open-Weight Mixture-of-Experts Model With 3B Active Par…

原文摘要:Cohere's first 开发者 coding model is a 30B mixture-of-experts running on a single H100 with 256K context length. The post Meet ‘North Mini Code’: Cohere’s 30B Open-Weight Mixtu 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 10 日 2026-06-10 快讯
MarkTechPost 官方资讯

MarkTechPost:A Coding Implementation on Microsoft SkillOpt for Instrumented Prompt Optimization, Skill Ev…

原文摘要:We implement an instrumented 工作流 for Microsoft SkillOpt end to end. We set up the 代码仓库, connect OpenAI-compatible model access, and configure the optimizer and target mo 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Google AI Releases DiffusionGemma, a 26B MoE Open Model Using Text Diffusion for Up to 4x Fa…

原文摘要:DiffusionGemma is Google DeepMind's experimental 26B open model using text diffusion for up to 4x faster generation on GPUs. The post Google AI Releases DiffusionGemma, a 26B MoE O 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Top AI Coding Agents and Development Platforms in 2026: Atoms, Devin, Windsurf, Cursor, Warp…

原文摘要:Software development has changed. Engineers no longer type most code by hand. They describe intent, and AI agents do the work. Modern tools plan tasks, edit across files, run tests 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Anthropic Releases Claude Fable 5 and Claude Mythos 5: Same Underlying Model, Different Safe…

原文摘要:Claude Fable 5 ships generally available with classifiers; Mythos 5 stays limited, cyber safeguards lifted, through Project Glasswing. The post Anthropic Releases Claude Fable 5 an 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Building a Code Dataset Pipeline from NVIDIA Nemotron-Pretraining-Code-v3 Metadata with Stre…

原文摘要:In this tutorial, we work with NVIDIA's Nemotron-Pretraining-Code-v3 dataset as a large-scale metadata index for code pretraining research. We stream the dataset instead of downloa 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 09 日 2026-06-09 快讯
MarkTechPost 官方资讯

MarkTechPost:Google Releases Gemini 3.5 Live Translate, a Streaming Speech-to-Speech Audio Model Covering…

原文摘要:Gemini 3.5 Live Translate streams speech-to-speech translation across 70+ languages. It generates audio continuously, staying a few seconds behind the speaker. The model reaches de 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA cuTile Python Tutorial: Building Tiled GPU Kernels for Vector Addition, Matrix Additi…

原文摘要:In this tutorial, we implement a hands-on 工作流 for NVIDIA cuTile Python, a tile-based GPU programming interface for CUDA-style kernels in Python. We prepare a Colab-friendly en 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:A New Study from Harvard and Perplexity Finds AI Agents Perform 26 Minutes of Autonomous Wor…

原文摘要:A new Harvard and Perplexity paper uses matched-pair sessions to compare an autonomous agent with a search assistant. It finds large gains in autonomy, time, and cost, plus broader 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 08 日 2026-06-08 快讯
MarkTechPost 官方资讯

MarkTechPost:ClawHub Security Signals: A Coding Guide to End-to-End Security Signal Analysis and Verdict …

原文摘要:In this tutorial, we explore the ClawHub Security Signals dataset to see how scanners assess AI skills. We load the data from the Hugging Face Parquet conversion and inspect verdic 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Xiaomi MiMo and TileRT Push a 1-Trillion-Parameter Model Past 1000 Tokens Per Second on Comm…

原文摘要:Xiaomi's MiMo team, with TileRT, released MiMo-V2.5-Pro-UltraSpeed, a serving mode for the MiMo-V2.5-Pro model. It decodes over 1000 tokens per second on a 1-trillion-parameter mod 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Microsoft AI Introduces MAI-Transcribe-1.5: 2.4% WER on Artificial Analysis, Best-in-Class F…

原文摘要:Microsoft AI has released MAI-Transcribe-1.5, the second iteration of its in-house speech-to-text family. The model covers 43 languages, adds keyword (entity) biasing for domain-sp 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 07 日 2026-06-07 快讯
MarkTechPost 官方资讯

MarkTechPost:Building Reflective Prompt Optimization with GEPA: Multi-Component Prompts, Structured Feedb…

原文摘要:In this tutorial, we use GEPA as a reflective prompt-evolution framework to improve how a small language model solves multi-step arithmetic word problems. We start from a weak seed 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet Harness-1: A 20B Retrieval Subagent Trained With Reinforcement Learning Inside a Statef…

原文摘要:UIUC and Chroma's Harness-1 is a 20B retrieval subagent trained with reinforcement learning inside a stateful search harness. The harness maintains the bookkeeping — candidate pool 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA garak Tutorial: Build a Complete Defensive LLM Red-Teaming 工作流 with Custom Probe…

原文摘要:This tutorial walks through NVIDIA garak as an end-to-end framework for defensive LLM red-teaming. It covers setup, plugin discovery, dry runs, real-model scans on a Hugging Face g 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 06 日 2026-06-06 快讯
MarkTechPost 官方资讯

MarkTechPost:Google’s New Colab CLI Lets 开发者 and AI Agents Run Python on Remote Colab GPUs and TPU…

原文摘要:Google released the Colab CLI, letting 开发者 and AI agents run local code on remote Colab GPU and TPU runtime The post Google’s New Colab CLI Lets 开发者 and AI Agents Run 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Moonshot AI Releases Kimi Code CLI: A Terminal AI Coding Agent Built in TypeScript for Next-…

原文摘要:Kimi Code CLI is Moonshot AI's open-source terminal coding agent, written in TypeScript with subagents and MCP configuration. The post Moonshot AI Releases Kimi Code CLI: A Termina 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA Releases Nemotron 3.5 ASR: A 600M-Parameter Cache-Aware Streaming Model Transcribing …

原文摘要:NVIDIA released Nemotron 3.5 ASR, a cache-aware 600M streaming model transcribing 40 language-locales in real time from one checkpoint. The post NVIDIA Releases Nemotron 3.5 ASR: A 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 05 日 2026-06-05 快讯
MarkTechPost 官方资讯

MarkTechPost:A Hands-On Coding Tutorial on Qualcomm AI Hub Models for Classification, Object Detection, a…

原文摘要:Set up Qualcomm AI Hub Models to run MobileNet-V2 inference, YOLOv7 detection, and compile models on real devices. The post A Hands-On Coding Tutorial on Qualcomm AI Hub Models for 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatib…

原文摘要:A hands-on guide to running Microsoft Fara in Colab, testing the browser agent loop with a mock endpoint. The post Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Perplexity AI Introduces Hybrid Local-Server Inference Orchestrator for Personal Computer: A…

原文摘要:Perplexity AI announces a hybrid local-server inference orchestrator for Personal Computer, automatically routing AI tasks between on-device and cloud models. The post Perplexity A 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based Fast Startup System for AI Inference on Kub…

原文摘要:NVIDIA Dynamo Snapshot checkpoints and restores vLLM inference workers on Kubernetes using CRIU and cuda-checkpoint tools. The post NVIDIA AI Releases Dynamo Snapshot: A CRIU-Based 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 04 日 2026-06-04 快讯
MarkTechPost 官方资讯

MarkTechPost:Building a Semantic Search Engine and Open-Status Classifier over the ResearchMath-14k Datas…

原文摘要:This tutorial walks through a complete NLP pipeline for research-level mathematics. Using the ResearchMath-14k dataset, we extract field-specific keywords with TF-IDF, generate sen 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA AI Releases Nemotron 3 Ultra: An Open 550B Mixture-of-Experts Hybrid Mamba-Transforme…

原文摘要:NVIDIA has released Nemotron 3 Ultra, a 550B total (55B active) open Mixture-of-Experts hybrid Mamba-Transformer for long-running agents. It pairs a 1M-token context with up to ~6x 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Miso Labs Releases MisoTTS: An 8B Emotive Text-to-Speech Model with Open Weights

原文摘要:Miso Labs has released MisoTTS, an open-weights 8B text-to-speech model. It uses residual vector quantization (RVQ) to scale its sonic range without scaling parameters, and conditi 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet OpenJarvis: A Local-First Framework for On-Device Personal AI Agents with Tools, Memory…

原文摘要:Stanford researchers released OpenJarvis, an open-source framework that runs inference, agents, memory, and learning entirely on-device. It decomposes a personal AI system into fiv 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 03 日 2026-06-03 快讯
MarkTechPost 官方资讯

MarkTechPost:Google DeepMind Releases Gemma 4 12B: An Encoder-Free Multimodal Model with Native audio tha…

原文摘要:Gemma 4 12B feeds vision and audio straight into the LLM backbone, running locally under an Apache 2.0 license. The post Google DeepMind Releases Gemma 4 12B: An Encoder-Free Multi 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0…

原文摘要:Hermes Desktop is a no-terminal GUI sharing one agent core, skills, and memory with the Hermes Agent CLI. The post Nous Research Releases Hermes Desktop: A Native Cross-Platform Fr 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA Releases Cosmos 3: A Two-Tower Mixture-of-Transformers Foundation Model Unifying Phys…

原文摘要:NVIDIA released Cosmos 3, open omnimodal world models pairing an autoregressive VLM reasoner with a diffusion generator for physical AI. The post NVIDIA Releases Cosmos 3: A Two-To 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 02 日 2026-06-02 快讯
MarkTechPost 官方资讯

MarkTechPost:TinyFish Launches BigSet: An Open-Source Multi-Agent System That Builds Structured Live Data…

原文摘要:Describe a dataset in one sentence; Bigset's orchestrator and parallel sub-agents research the live web and return structured tables. The post TinyFish Launches BigSet: An Open-Sou 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pi…

原文摘要:JetBrains releases Mellum2 under Apache 2.0 — a 12B MoE model trained on 10.6 trillion tokens for AI 工作流. The post JetBrains Releases Mellum2: A 12B MoE Model for Fast, Speci 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:How to Speed Up Transformer Training Using NVIDIA Apex (FusedAdam, FusedLayerNorm) and Nativ…

原文摘要:We build NVIDIA Apex from source, detect fused kernels, and 评测 FusedAdam, FusedLayerNorm, and torch.amp in Transformer training. The post How to Speed Up Transformer Trainin 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 01 日 2026-06-01 快讯
MarkTechPost 官方资讯

MarkTechPost:MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multim…

原文摘要:MiniMax M3 introduces MiniMax Sparse Attention, a 1M-token context window, and native image, video, and computer use support. The post MiniMax Releases MiniMax M3 with MSA Architec 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。