PaddleOCR_MCP_en

更新时间：2025-12-19

PaddleOCR Official MCP Server Documentation

PaddleOCR provides a lightweight MCP server, which can be used to quickly integrate PaddleOCR’s text recognition, document parsing, and other capabilities into various large model applications. The PaddleOCR MCP server currently supports the following tools:

Currently Supported Tools
- OCR: Perform text detection and recognition on images and PDF files.
- PP-StructureV3: Identify and extract text blocks, titles, paragraphs, images, tables, and other layout elements from images or PDF files, converting the input into a Markdown document.
- PaddleOCR-VL：A high-accuracy multimodal document parsing model that analyzes text, tables, formulas, and charts in images or PDF files, converting the input into a Markdown document.

The PaddleOCR MCP server can be integrated into various applications to enable text recognition and document parsing capabilities. Here, we use Claude for Desktop as an example to demonstrate the steps for quickly connecting to the PaddleOCR MCP server:

Install uv.

Locate the Claude for Desktop configuration file in one of the following locations:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
Linux: ~/.config/Claude/claude_desktop_config.json

Open the claude_desktop_config.json file, refer to the following example to adjust the configuration, and paste it into your claude_desktop_config.json.

For the PP-OCRv5 service:

{
  "mcpServers": {
    "PP-OCRv5": {
      "command": "uvx",
      "args": [
        "--from",
        "paddleocr-mcp@https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/mcp/paddleocr_mcp/releases/v0.2.0/paddleocr_mcp-0.2.0-py3-none-any.whl",
        "paddleocr_mcp"
      ],
      "env": {
        "PADDLEOCR_MCP_PIPELINE": "OCR",
        "PADDLEOCR_MCP_PPOCR_SOURCE": "aistudio",
        "PADDLEOCR_MCP_SERVER_URL": "<your-server-url>",
        "PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN": "<your-access-token>"
      }
    }
  }
}

For the PP-StructureV3 service:

{
  "mcpServers": {
    "PP-StructureV3": {
      "command": "uvx",
      "args": [
        "--from",
        "paddleocr-mcp@https://paddle-model-ecology.bj.bcebos.com/paddlex/PaddleX3.0/mcp/paddleocr_mcp/releases/v0.2.0/paddleocr_mcp-0.2.0-py3-none-any.whl",
        "paddleocr_mcp"
      ],
      "env": {
        "PADDLEOCR_MCP_PIPELINE": "PP-StructureV3",
        "PADDLEOCR_MCP_PPOCR_SOURCE": "aistudio",
        "PADDLEOCR_MCP_SERVER_URL": "<your-server-url>",
        "PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN": "<your-access-token>"
      }
    }
  }
}

For the PaddleOCR-VL service：

{
  "mcpServers": {
    "PaddleOCR-VL": {
      "command": "uvx",
      "args": [
        "--from",
        "paddleocr-mcp",
        "paddleocr_mcp"
      ],
      "env": {
        "PADDLEOCR_MCP_PIPELINE": "PaddleOCR-VL",
        "PADDLEOCR_MCP_PPOCR_SOURCE": "aistudio",
        "PADDLEOCR_MCP_SERVER_URL": "<your url>",
        "PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN": "<your-access-token>"
      }
    }
  }
}

PADDLEOCR_MCP_SERVER_URL value is the base URL of the corresponding model service (e.g., https://xxxxxx.aistudio-app.com)， obtain your access token from this page.

Restart Claude for Desktop. The new paddleocr-ocr tool should now be available in the application.

Besides the usage described above, the PaddleOCR MCP server also supports more features, such as integrating local Python libraries and using Streamable HTTP transmission. For more capabilities of the PaddleOCR MCP server, please refer to the official PaddleOCR documentation.

PaddleOCR_MCP

PaddleOCR API