Full-Code · Model-Vendor Agentsactive

Zhipu GLM Agent

Type: full-code  ·  Vendor: Zhipu AI (Z.ai)  ·  Language: API  ·  License: proprietary (hosted API) / MIT (GLM open weights since July 2025)  ·  Status: active  ·  Status in practice: emerging

Links: homepage docs

Tsinghua-spinoff Zhipu (rebranded internationally as Z.ai in July 2025) ships the GLM model line through the BigModel platform, with function calling, web search, knowledge-base retrieval and JSON output all available as tool/response options on the same chat-completion API; AutoGLM is the consumer agent built on top.

Description. Zhipu (Z.ai) is one of China's three biggest LLM providers and IPO'd on the Hong Kong Stock Exchange on 8 January 2026. Its GLM model line has been MIT-open-source since July 2025 (GLM-4.5 onward), with the latest GLM-5/5.1 and GLM-5V-Turbo positioned for agentic engineering, coding and long-chain execution. The BigModel chat-completion API supports tools: function calling, knowledge-base retrieval and web search are listed together as tool options, and structured output is selectable via response_format json_object. AutoGLM is the consumer agent application that uses voice commands to complete tasks on a phone and can self-execute operations exceeding 50 steps across applications.

Agent loop shape. Two surfaces. (1) BigModel chat-completion API: standard tools-loop where tools[] can include functions, knowledge-base retrieval, or web search; the model emits tool_calls, the caller executes, results return as tool messages; response_format json_object pins shape. (2) AutoGLM: hosted consumer agent that takes voice commands, plans multi-step actions, and self-executes >50-step workflows across smartphone apps.

Primary use cases

  • function-calling agents over GLM hosted API
  • RAG via knowledge-base retrieval and web search as built-in tools
  • JSON-structured extraction via response_format
  • consumer voice agent (AutoGLM) on smartphones

Key concepts

  • GLM-4.6 / GLM-5 / GLM-5.1 (docs)Flagship Zhipu models; GLM-4.6 raises context to 200K, GLM-5/5.1 positioned as base models for agentic engineering with open-source SOTA-level coding.
  • Tools parameter (function/retrieval/web search) tool-use (docs)Single tools parameter accepts function calls, knowledge-base retrieval, and web search; model emits a JSON-typed call the caller handles.
  • response_format json_object structured-output (docs)Forces the model to return structured JSON; documented as suitable for extraction and structured API responses.
  • AutoGLM mobile-ui-agent (docs)Consumer agent application that uses voice commands to complete tasks; can self-execute operations exceeding 50 steps across smartphone apps.
  • GLM-5-Turbo for agent harnesses (docs)Trained at the training level for core agent capabilities — tool calling, instruction following, long-chain execution — targeted at agent runtimes ('Claw').

Patterns this full-code implements