AI 每日快讯

AI 每日快讯

AI 产品、模型、开源工具和官方动态的时间流。保留历史记录,按分类、日期和标签继续筛选。

1335历史快讯
80开源工具
80当前结果
06 月 24 日 昨日快讯
MarkTechPost 官方资讯

MarkTechPost:How to Design an OpenHarness Style Agent Runtime with Tools, Memory, Permissions, Skills, an…

原文摘要:In this tutorial, we build an OpenHarness style agent harness from scratch to see how a practical agent system works. We recreate the core building blocks: tool use, typed tool sch 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Nous Research Adds /learn to Hermes Agent’s Skills System, Capturing 工作流 as Slash Comm…

原文摘要:Nous Research has added /learn to the Hermes Agent Skills System. The command authors a standards-compliant SKILL.md from a local directory, a doc URL, a past conversation, or past 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 23 日 2026-06-23 快讯
MarkTechPost 官方资讯

MarkTechPost:GLM-5.2 OpenAI-Compatible API: A Hands-On Guide to Reasoning Effort, Function Calling, and L…

原文摘要:We build a practical GLM-5.2 工作流 using its hosted, OpenAI-compatible API instead of running the model locally. We set up multiple providers, load the API key securely, and cre 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 22 日 2026-06-22 快讯
MarkTechPost 官方资讯

MarkTechPost:xAI Launches /goal in Grok Build, Adding Long-Running Autonomous Execution With Built-In Ver…

原文摘要:xAI introduced /goal in Grok Build, a mode for long-running, autonomous task execution. You hand off one objective, and the agent plans an approach, executes a progress checklist, 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Sakana AI Launches Sakana Fugu: An Orchestration Model That Routes Tasks Across a Swappable …

原文摘要:Fugu and Fugu Ultra route tasks across a swappable model pool, leading most coding, reasoning, and agentic 评测. The post Sakana AI Launches Sakana Fugu: An Orchestration Mod 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 21 日 2026-06-21 快讯
MarkTechPost 官方资讯

MarkTechPost:Crawlee for Python: Build a Web Crawling Pipeline with Robots Handling, Link Graphs, and RAG…

原文摘要:In this tutorial, we build a complete Crawlee for Python 工作流 from setup to AI-ready output. We generate a local demo website, then crawl it with BeautifulSoupCrawler, ParselCr 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 20 日 2026-06-20 快讯
MarkTechPost 官方资讯

MarkTechPost:Nous Research Updates Hermes Agent With a Blank Slate Mode That Pins Toolsets via platform_t…

原文摘要:Nous Research has added a Blank Slate setup mode to its open-source Hermes Agent. It starts an agent with everything off except provider, model, File Operations, and Terminal. You 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 19 日 2026-06-19 快讯
MarkTechPost 官方资讯

MarkTechPost:NVIDIA AI Introduce SpatialClaw: A Training-Free Agent That Treats Code as the Action Interf…

原文摘要:SpatialClaw is a training-free agent that writes Python in a persistent kernel, composing perception tools for 3D spatial reasoning The post NVIDIA AI Introduce SpatialClaw: A Trai 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 18 日 2026-06-18 快讯
MarkTechPost 官方资讯

MarkTechPost:Perplexity Launches Brain, a Self-Improving Memory System That Builds a Context Graph of an …

原文摘要:Perplexity has launched Brain, a self-improving memory system for its Computer agent. Instead of remembering the user, Brain remembers the agent's work — what worked, what failed, 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:OpenAI Releases LifeSciBench, a 750-Task 评测 Grading AI Models on Real Life-Science Re…

原文摘要:OpenAI's LifeSciBench evaluates whether frontier AI can handle real life-science research across 750 expert-authored tasks, seven 工作流, and seven biological domains. Built by 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA SkillSpector Guide: Scanning AI Skills for Security Risks with Static Analysis and SA…

原文摘要:In this tutorial, we use NVIDIA SkillSpector to evaluate AI skills for security risks before deployment. We build a corpus of benign and deliberately vulnerable skills, then scan t 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 17 日 2026-06-17 快讯
MarkTechPost 官方资讯

MarkTechPost:Vercel Releases Eve: An Open-Source AI Agent Framework Where Each Agent is a Directory of Fi…

原文摘要:Vercel has open-sourced eve, an Apache-2.0 agent framework now in public preview. An agent is a directory of files, with durable execution, sandboxes, approvals, connections, chann 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:OpenAI’s Deployment Simulation Extends Pre-Deployment Risk Assessment to Agentic Coding Thro…

原文摘要:OpenAI introduced Deployment Simulation on June 16, 2026. The method replays past conversations through a new candidate model before release. It then grades the completions to esti 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 16 日 2026-06-16 快讯
MarkTechPost 官方资讯

MarkTechPost:Hermes Agent Adds Asynchronous Subagents, So Delegated Work No Longer Blocks the Parent Chat

原文摘要:We look at Hermes Agent's new asynchronous subagents from Nous Research. The delegate tool can now spawn background agents that no longer block the parent chat. We walk through the 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet Atoms: A Vibe Coding Tool That Uses AI Agents to Build, Deploy, and Market Your App (No…

原文摘要:The concept of vibe coding is interesting; you don’t need to be a 开发者 or software engineer to build your own applications. You can describe your idea to an AI in plain langua 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Google Cloud Introduces Open Knowledge Format (OKF): A Vendor-Neutral Markdown Spec for Givi…

原文摘要:We break down Google Cloud's new Open Knowledge Format (OKF), an open spec that formalizes the LLM-wiki pattern. We explain how a bundle works: a directory of markdown files with Y 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:How to Build a Parsing Pipeline with Docling Parse for Layout-Aware Document Intelligence

原文摘要:In this tutorial, we build a 工作流 that uses Docling Parse to analyze PDF documents at a detailed structural level. We prepare a stable Python environment, handle common Colab d 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 15 日 2026-06-15 快讯
MarkTechPost 官方资讯

MarkTechPost:Sakana AI Commercializes AB-MCTS in Sakana Marlin, an Enterprise Agent Generating Up to 100-…

原文摘要:Sakana AI's first commercial product runs autonomously for up to eight hours per task. It returns multi-page reports and slides, built on AB-MCTS and AI Scientist 工作流. The po 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 14 日 2026-06-14 快讯
MarkTechPost 官方资讯

MarkTechPost:Databricks Open-Sources Omnigent: A Meta-Harness That Composes, Governs, and Shares AI Agent…

原文摘要:Databricks has open-sourced Omnigent, a meta-harness that sits above coding agents like Claude Code, Codex, and Pi. It adds composition, contextual policies, and live session shari 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 12 日 2026-06-12 快讯
MarkTechPost 官方资讯

MarkTechPost:Moonshot AI Launches Kimi Work, a Local Desktop Agent Reportedly Running on Kimi K2.6 With a…

原文摘要:Moonshot AI's Kimi Work is a local desktop agent for macOS and Windows. It runs a 300-sub-agent swarm, drives your logged-in browser via WebBridge, and schedules background jobs. T 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 11 日 2026-06-11 快讯
MarkTechPost 官方资讯

MarkTechPost:xAI Ships Grok Build Plugin Marketplace With MongoDB, Vercel, Sentry, Chrome DevTools, Cloud…

原文摘要:Grok Build's in-terminal marketplace bundles skills, agents, hooks, and MCP servers, with commit-SHA verification on every remote plugin. The post xAI Ships Grok Build Plugin Marke 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Nous Research Ships Hermes Agent Profile Builder: Identity, Model, Skills, and MCP Servers i…

原文摘要:The Hermes Agent dashboard now builds complete agent profiles in one flow, replacing multi-step CLI setup for users. The post Nous Research Ships Hermes Agent Profile Builder: Iden 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet ‘North Mini Code’: Cohere’s 30B Open-Weight Mixture-of-Experts Model With 3B Active Par…

原文摘要:Cohere's first 开发者 coding model is a 30B mixture-of-experts running on a single H100 with 256K context length. The post Meet ‘North Mini Code’: Cohere’s 30B Open-Weight Mixtu 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 10 日 2026-06-10 快讯
MarkTechPost 官方资讯

MarkTechPost:A Coding Implementation on Microsoft SkillOpt for Instrumented Prompt Optimization, Skill Ev…

原文摘要:We implement an instrumented 工作流 for Microsoft SkillOpt end to end. We set up the 代码仓库, connect OpenAI-compatible model access, and configure the optimizer and target mo 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Top AI Coding Agents and Development Platforms in 2026: Atoms, Devin, Windsurf, Cursor, Warp…

原文摘要:Software development has changed. Engineers no longer type most code by hand. They describe intent, and AI agents do the work. Modern tools plan tasks, edit across files, run tests 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 09 日 2026-06-09 快讯
MarkTechPost 官方资讯

MarkTechPost:NVIDIA cuTile Python Tutorial: Building Tiled GPU Kernels for Vector Addition, Matrix Additi…

原文摘要:In this tutorial, we implement a hands-on 工作流 for NVIDIA cuTile Python, a tile-based GPU programming interface for CUDA-style kernels in Python. We prepare a Colab-friendly en 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:A New Study from Harvard and Perplexity Finds AI Agents Perform 26 Minutes of Autonomous Wor…

原文摘要:A new Harvard and Perplexity paper uses matched-pair sessions to compare an autonomous agent with a search assistant. It finds large gains in autonomy, time, and cost, plus broader 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 07 日 2026-06-07 快讯
MarkTechPost 官方资讯

MarkTechPost:Meet Harness-1: A 20B Retrieval Subagent Trained With Reinforcement Learning Inside a Statef…

原文摘要:UIUC and Chroma's Harness-1 is a 20B retrieval subagent trained with reinforcement learning inside a stateful search harness. The harness maintains the bookkeeping — candidate pool 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:NVIDIA garak Tutorial: Build a Complete Defensive LLM Red-Teaming 工作流 with Custom Probe…

原文摘要:This tutorial walks through NVIDIA garak as an end-to-end framework for defensive LLM red-teaming. It covers setup, plugin discovery, dry runs, real-model scans on a Hugging Face g 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 06 日 2026-06-06 快讯
MarkTechPost 官方资讯

MarkTechPost:Google’s New Colab CLI Lets 开发者 and AI Agents Run Python on Remote Colab GPUs and TPU…

原文摘要:Google released the Colab CLI, letting 开发者 and AI agents run local code on remote Colab GPU and TPU runtime The post Google’s New Colab CLI Lets 开发者 and AI Agents Run 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Moonshot AI Releases Kimi Code CLI: A Terminal AI Coding Agent Built in TypeScript for Next-…

原文摘要:Kimi Code CLI is Moonshot AI's open-source terminal coding agent, written in TypeScript with subagents and MCP configuration. The post Moonshot AI Releases Kimi Code CLI: A Termina 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 05 日 2026-06-05 快讯
MarkTechPost 官方资讯

MarkTechPost:Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab with a Mock OpenAI-Compatib…

原文摘要:A hands-on guide to running Microsoft Fara in Colab, testing the browser agent loop with a mock endpoint. The post Microsoft Fara Tutorial: Run a Browser-Use Agent in Google Colab 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 04 日 2026-06-04 快讯
MarkTechPost 官方资讯

MarkTechPost:NVIDIA AI Releases Nemotron 3 Ultra: An Open 550B Mixture-of-Experts Hybrid Mamba-Transforme…

原文摘要:NVIDIA has released Nemotron 3 Ultra, a 550B total (55B active) open Mixture-of-Experts hybrid Mamba-Transformer for long-running agents. It pairs a 1M-token context with up to ~6x 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet OpenJarvis: A Local-First Framework for On-Device Personal AI Agents with Tools, Memory…

原文摘要:Stanford researchers released OpenJarvis, an open-source framework that runs inference, agents, memory, and learning entirely on-device. It decomposes a personal AI system into fiv 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 03 日 2026-06-03 快讯
MarkTechPost 官方资讯

MarkTechPost:Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0…

原文摘要:Hermes Desktop is a no-terminal GUI sharing one agent core, skills, and memory with the Hermes Agent CLI. The post Nous Research Releases Hermes Desktop: A Native Cross-Platform Fr 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 02 日 2026-06-02 快讯
MarkTechPost 官方资讯

MarkTechPost:TinyFish Launches BigSet: An Open-Source Multi-Agent System That Builds Structured Live Data…

原文摘要:Describe a dataset in one sentence; Bigset's orchestrator and parallel sub-agents research the live web and return structured tables. The post TinyFish Launches BigSet: An Open-Sou 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:JetBrains Releases Mellum2: A 12B MoE Model for Fast, Specialized Tasks in Multi-Model AI Pi…

原文摘要:JetBrains releases Mellum2 under Apache 2.0 — a 12B MoE model trained on 10.6 trillion tokens for AI 工作流. The post JetBrains Releases Mellum2: A 12B MoE Model for Fast, Speci 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

06 月 01 日 2026-06-01 快讯
MarkTechPost 官方资讯

MarkTechPost:MiniMax Releases MiniMax M3 with MSA Architecture Supporting 1M-Token Context, Native Multim…

原文摘要:MiniMax M3 introduces MiniMax Sparse Attention, a 1M-token context window, and native image, video, and computer use support. The post MiniMax Releases MiniMax M3 with MSA Architec 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Meet Memory OS: A 6-Layer Open-Source Memory Stack Built on Top of Hermes Agent

原文摘要:The open-source project adds local persistent memory to Hermes Agent through six layers, gated retrieval, and a wiki. The post Meet Memory OS: A 6-Layer Open-Source Memory Stack Bu 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 31 日 2026-05-31 快讯
MarkTechPost 官方资讯

MarkTechPost:An Implementation of the Microsoft Agent Governance Toolkit for Safe AI Agent Tool Use with …

原文摘要:In this tutorial, we build a governed AI-agent 工作流 using Microsoft’s Agent Governance Toolkit as the reference point. We create a Colab-ready implementation where agents do no 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Build Skill-Augmented AI Agents with SkillNet for Search, 评测, Graph Analysis, and Ta…

原文摘要:In this tutorial, we implement a SkillNet use case as a practical framework for discovering, installing, inspecting, evaluating, and organizing reusable AI skills. The post Build S 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 30 日 2026-05-30 快讯
MarkTechPost 官方资讯

MarkTechPost:Hermes Agent Ships Tool Search for MCP: Anthropic Evals Show 49% to 74% Accuracy Gain on Opu…

原文摘要:Nous Research's Hermes Agent adds Tool Search to fix MCP context bloat using BM25 progressive schema disclosure. The post Hermes Agent Ships Tool Search for MCP: Anthropic Evals Sh 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:How to Use AgentTrove: Streaming 1.7M Agentic Traces and Building a Clean ShareGPT SFT Datas…

原文摘要:AgentTrove is the largest open-source collection of agentic interaction traces, with 1.7M rows in a ShareGPT-style layout. This hands-on Python tutorial shows how to stream the dat 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 29 日 2026-05-29 快讯
MarkTechPost 官方资讯

MarkTechPost:StepFun Releases Step 3.7 Flash: A 198B MoE Vision-Language Model for Coding Agents and Sear…

原文摘要:StepFun releases Step 3.7 Flash, a 198B MoE model with native vision, 256k context, and Advisor Mode. The post StepFun Releases Step 3.7 Flash: A 198B MoE Vision-Language Model for 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Hexo Labs Open-Sources SIA: A Self-Improving Agent That Updates Both the Harness and the Mod…

原文摘要:Hexo Labs released SIA, an open-source self-improving loop, under an MIT license. A Feedback-Agent reads each run's trajectory, then either rewrites the scaffold or triggers a LoRA 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 28 日 2026-05-28 快讯
MarkTechPost 官方资讯

MarkTechPost:Anthropic Ships Claude Opus 4.8 Alongside Dynamic 工作流 and Cheaper Fast Mode, With Work…

原文摘要:Anthropic's Claude Opus 4.8 brings dynamic 工作流 and cheaper fast mode to Claude Code, now in research preview The post Anthropic Ships Claude Opus 4.8 Alongside Dynamic Workfl 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 27 日 2026-05-27 快讯
MarkTechPost 官方资讯

MarkTechPost:NVIDIA Releases Polar, a Token-Faithful Rollout Framework for GRPO Training Across Codex, Cl…

原文摘要:NVIDIA researchers have introduced Polar, a rollout framework that trains language agents using reinforcement learning without modifying their agent harnesses. Polar places a model 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 25 日 2026-05-25 快讯
MarkTechPost 官方资讯

MarkTechPost:WorkOS Releases auth.md: An Open Agent Registration Protocol Built on OAuth Standards

原文摘要:Most web applications still have no structured way for an AI agent to register. auth.md proposes a fix: a Markdown file apps publish at their domain that tells agents which registr 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 24 日 2026-05-24 快讯
MarkTechPost 官方资讯

MarkTechPost:Build a Complete Langfuse Observability and 评测 Pipeline for Tracing, Prompt Manageme…

原文摘要:In this tutorial, we implement the Langfuse (an open-source LLM engineering platform) pipeline for tracing, prompt management, scoring, datasets, and experiments. We build a comple 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Microsoft Research Releases Webwright: A Terminal-Native Web Agent Framework That Scores 60.…

原文摘要:Microsoft Research introduces Webwright, a terminal-native browser agent framework that replaces click-trace web automation with reusable Playwright scripts. Using a single agent l 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 23 日 2026-05-23 快讯
MarkTechPost 官方资讯

MarkTechPost:Build a SuperClaude Framework 工作流 with Commands, Agents, Modes, and Session Memory

原文摘要:In this tutorial, we build an advanced 工作流 using the SuperClaude Framework as a structured layer on top of the Anthropic API. The post Build a SuperClaude Framework 工作流 w 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 22 日 2026-05-22 快讯
MarkTechPost 官方资讯

MarkTechPost:Microsoft Releases Fara1.5: A Family of Browser Computer-Use Agents (4B/9B/27B) That Outperf…

原文摘要:Microsoft Research released Fara1.5, a family of browser computer-use agents in 4B, 9B, and 27B sizes. Fara1.5-27B scores 72% on Online-Mind2Web, outperforming OpenAI Operator, Gem 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Build Recurrent-Depth Transformers with OpenMythos for MLA, GQA, Sparse MoE, and Loop-Scaled…

原文摘要:In this tutorial, we explore OpenMythos by building an advanced recurrent-depth transformer 工作流 that runs end-to-end in Google Colab. We create both MLA and GQA model variants 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 21 日 2026-05-21 快讯
MarkTechPost 官方资讯

MarkTechPost:Qwen Introduces Qwen3.7-Max: A Reasoning Agent Model With a 1M-Token Context Window

原文摘要:Alibaba's Qwen team introduced Qwen3.7-Max at the 2026 Alibaba Cloud Summit, describing it as its most advanced and comprehensive agent model to date. The model features a 1M-token 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Cohere Releases Command A+: A 218B Sparse MoE Model for Agentic 工作流 That Runs on as Fe…

原文摘要:Cohere releases Command A+, an open-source 218B Sparse Mixture-of-Experts model consolidating four prior Command A variants into one. It runs on as few as two H100 GPUs at W4A4 qua 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 20 日 2026-05-20 快讯
MarkTechPost 官方资讯

MarkTechPost:Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for AI Agents and…

原文摘要:Google's Gemini 3.5 Flash beats its own flagship on coding and agentic 评测 while running four times faster and at half the cost. The post Google Introduces Gemini 3.5 Flash 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 19 日 2026-05-19 快讯
MarkTechPost 官方资讯

MarkTechPost:Google Launches Antigravity 2.0 at I/O 2026: A Standalone Agent-First Platform with CLI, SDK…

原文摘要:Google used its I/O 2026 开发者 keynote to ship a meaningful architectural shift in how it packages AI-assisted development. The company announced Google Antigravity 2.0 — a sta 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:How to Build an Advanced Agentic AI System with Planning, Tool Calling, Memory, and Self-Cri…

原文摘要:In this tutorial, we build an advanced agentic AI system using the OpenAI API and a hidden terminal prompt for the API key. We design the agent as a small pipeline of specialized r 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 18 日 2026-05-18 快讯
MarkTechPost 官方资讯

MarkTechPost:Meet MemPrivacy: An Edge-Cloud Framework that Uses Local Reversible Pseudonymization to Prot…

原文摘要:As LLM-powered agents move from research to production, one design tension is becoming harder to ignore: the more useful cloud-hosted memory becomes, the more private user data it 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 17 日 2026-05-17 快讯
05 月 16 日 2026-05-16 快讯
05 月 15 日 2026-05-15 快讯
MarkTechPost 官方资讯

MarkTechPost:How to Build an MCP Style Routed AI Agent System with Dynamic Tool Exposure Planning, Execut…

原文摘要:In this tutorial, we build a fully functional MCP-style routed agent system from scratch, combining tool discovery, intelligent routing, structured planning, and execution into a s 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:Best AI Agents for Software Development Ranked: A 评测-Driven Look at the Current Field

原文摘要:The AI coding agent field in 2026 is more capable, more fragmented, and harder to 评测 than it looks. Claude Code leads on code quality at 87.6% SWE-bench Verified. GPT-5.5 to 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 14 日 2026-05-14 快讯
MarkTechPost 官方资讯

MarkTechPost:Cline Releases Cline SDK: An Open-Source Agent Runtime Now Powering Its CLI and Kanban, With…

原文摘要:Cline has extracted its internal agent harness into an open-source TypeScript SDK called @cline/sdk, the same runtime now powering its CLI and Kanban, with VS Code and JetBrains ex 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 12 日 2026-05-12 快讯
MarkTechPost 官方资讯

MarkTechPost:Build a Hybrid-Memory Autonomous Agent with Modular Architecture and Tool Dispatch Using Ope…

原文摘要:In this tutorial, we begin by exploring the architecture behind a hybrid-memory autonomous agent. This system combines semantic vector search, keyword-based retrieval, and a modula 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:A Coding Implementation to Portfolio Optimization with skfolio for Building Testing, Tuning,…

原文摘要:In this tutorial, we explore skfolio, a scikit-learn compatible portfolio optimization library that helps us build, compare, and evaluate different investment strategies in a struc 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:OpenAI Introduces Daybreak: A Cybersecurity Initiative That Puts Codex Security at the Cente…

原文摘要:OpenAI on just launched Daybreak, a cybersecurity initiative that combines the company’s frontier AI models with Codex Security, its coding-focused agentic system, and a broad netw 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 11 日 2026-05-11 快讯
MarkTechPost 官方资讯

MarkTechPost:How to Build Technical Analysis and Backtesting 工作流 with pandas-ta-classic, Strategy Si…

原文摘要:In this tutorial, we implement how to use pandas-ta-classic to build a complete technical analysis and trading strategy 工作流. We start by installing the required libraries, dow 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

MarkTechPost 官方资讯

MarkTechPost:A Coding Implementation to Build Agent-Native Memory Infrastructure with Memori for Persiste…

原文摘要:In this tutorial, we implement how Memori serves as an agent-native memory infrastructure layer for building more persistent, context-aware LLM applications. We start by setting up 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。

05 月 10 日 2026-05-10 快讯
MarkTechPost 官方资讯

MarkTechPost:Best Vector Databases in 2026: Pricing, Scale Limits, and Architecture Tradeoffs Across Nine…

原文摘要:Vector databases are now core retrieval infrastructure for RAG and agentic AI. This guide compares nine production options on architecture, pricing, and scale. The post Best Vector 来源:MarkTechPost。建议继续查看原文,重点核对它影响的工具入口、成本、风险和真实使用场景。