更新readme

2025-09-24 11:39:30 +08:00
parent c05f2dac58
commit cad7c47458
3 changed files with 298 additions and 344 deletions
--- a/README.md
+++ b/README.md
@@ -17,50 +17,39 @@
 </p>

 <p align="center">
-  A lightweight local file translation tool based on a large language model.
+  A lightweight local file translation tool based on Large Language Models
 </p>

- ✅ **Supports Multiple Formats**: Can translate various files such as `pdf`, `docx`, `xlsx`, `md`, `txt`, `json`,
-  `epub`, `srt`, and more.
- ✅ **Automatic Glossary Generation**: Supports automatic generation of glossaries to ensure term alignment.
- ✅ **PDF Table, Formula, and Code Recognition**: With the `docling` and `mineru` PDF parsing engines, it can recognize
-  and translate tables, formulas, and code frequently found in academic papers.
- ✅ **JSON Translation**: Supports specifying the values to be translated in JSON via JSON paths (using `jsonpath-ng`
-  syntax).
- ✅ **Word/Excel Format-Preserving Translation**: Supports translating `docx` and `xlsx` files (currently not `doc` or
-  `xls` files) while preserving the original formatting.
- ✅ **Multi-AI Platform Support**: Supports most AI platforms, enabling high-performance, concurrent AI translation with
-  custom prompts.
- ✅ **Asynchronous Support**: Designed for high-performance scenarios, it offers complete asynchronous support,
-  providing service interfaces for parallel multitasking.
- ✅ **LAN and Multi-user Support**: Supports simultaneous use by multiple users on a local area network.
+- ✅ **Multiple Format Support**: Translates various files including `pdf`, `docx`, `xlsx`, `md`, `txt`, `json`, `epub`, `srt`, `ass`, and more.
+- ✅ **Automatic Glossary Generation**: Supports automatic generation of glossaries for term alignment.
+- ✅ **PDF Table, Formula, and Code Recognition**: Recognizes and translates tables, formulas, and code often found in academic papers, powered by `docling` and `mineru` PDF parsing engines.
+- ✅ **JSON Translation**: Supports specifying values to be translated in JSON using JSON paths (following `jsonpath-ng` syntax).
+- ✅ **Word/Excel Format Preservation**: Translates `docx` and `xlsx` files while preserving their original formatting (does not yet support `doc` or `xls` files).
+- ✅ **Multi-AI Platform Support**: Compatible with most AI platforms, enabling high-performance, concurrent AI translation with custom prompts.
+- ✅ **Asynchronous Support**: Designed for high-performance scenarios with full asynchronous support, offering service interfaces for parallel tasks.
+- ✅ **LAN and Multi-user Support**: Can be used by multiple people simultaneously on a local area network.
 - ✅ **Interactive Web Interface**: Provides an out-of-the-box Web UI and RESTful API for easy integration and use.
- ✅ **Small-Footprint, Multi-Platform "Lazy" Packages**: Windows and Mac "lazy" packages under 40MB (for versions not
-  using `docling` for local PDF parsing).
+- ✅ **Small, Multi-platform Standalone Packages**: Windows and Mac standalone packages under 40MB (for versions not using the `docling` local PDF parser).

-> When translating `pdf` files, they are first converted to Markdown, which will **lose** the original layout. Users
-> with layout requirements should take note.
+> When translating `pdf` files, they are first converted to Markdown, which will **cause the original layout to be lost**. Users with strict layout requirements should take note.

 > QQ Discussion Group: 1047781902

 **UI Interface**:
 ![Translation Effect](/images/UI界面.png)

-**Thesis Translation**:
+**Academic Paper Translation**:
 ![Translation Effect](/images/论文翻译.png)

-**Novel Translation**:![Translation Effect](/images/小说翻译.png)
+**Novel Translation**:
+![Translation Effect](/images/小说翻译.png)

 ## All-in-One Packages

-For users who want to get started quickly, we provide all-in-one packages
-on [GitHub Releases](https://github.com/xunbu/docutranslate/releases). Simply download, unzip, and enter your AI
-platform API-Key to start using.
+For users who want to get started quickly, we provide all-in-one packages on [GitHub Releases](https://github.com/xunbu/docutranslate/releases). Simply download, unzip, and enter your AI platform API Key to begin.

- **DocuTranslate**: Standard version, uses the online `minerU` engine to parse PDF documents. Choose this version if
-  you don't need local PDF parsing (recommended).
- **DocuTranslate_full**: Full version, includes the built-in `docling` local PDF parsing engine. Choose this version if
-  you need local PDF parsing.
+- **DocuTranslate**: Standard version, uses the online `minerU` engine to parse PDF documents. Choose this version if you don't need local PDF parsing (recommended).
+- **DocuTranslate_full**: Full version, includes the built-in `docling` local PDF parsing engine. Choose this version if you need local PDF parsing.

 ## Installation

@@ -96,48 +85,44 @@ git clone https://github.com/xunbu/docutranslate.git
 cd docutranslate

 uv sync
+
 ```

 ## Core Concept: Workflow

-The core of the new DocuTranslate is the **Workflow**. Each workflow is a complete, end-to-end translation pipeline
-designed specifically for a particular file type. You no longer interact with a monolithic class; instead, you select
-and configure a suitable workflow based on your file type.
+The core of the new DocuTranslate is the **Workflow**. Each workflow is a complete, end-to-end translation pipeline designed for a specific file type. Instead of interacting with a single large class, you select and configure a workflow based on your file type.

-**The basic usage process is as follows:**
+**The basic usage flow is as follows:**

-1. **Select a Workflow**: Choose a workflow based on your input file type (e.g., PDF/Word or TXT), such as
-   `MarkdownBasedWorkflow` or `TXTWorkflow`.
-2. **Build the Configuration**: Create the corresponding configuration object for the selected workflow (e.g.,
-   `MarkdownBasedWorkflowConfig`). This configuration object contains all the necessary sub-configurations, such as:
-    * **Converter Config**: Defines how to convert the original file (e.g., PDF) to Markdown.
-    * **Translator Config**: Defines which LLM, API-Key, target language, etc., to use.
-    * **Exporter Config**: Defines specific options for the output format (e.g., HTML).
-3. **Instantiate the Workflow**: Create a workflow instance using the configuration object.
-4. **Execute the Translation**: Call the workflow's `.read_*()` and `.translate()` / `.translate_async()` methods.
-5. **Export/Save the Result**: Call the `.export_to_*()` or `.save_as_*()` methods to get or save the translation
-   result.
+1.  **Select a Workflow**: Choose a workflow based on your input file type (e.g., PDF/Word or TXT), such as `MarkdownBasedWorkflow` or `TXTWorkflow`.
+2.  **Build Configuration**: Create the corresponding configuration object for the selected workflow (e.g., `MarkdownBasedWorkflowConfig`). This object contains all necessary sub-configurations, such as:
+    *   **Converter Config**: Defines how to convert the original file (like a PDF) to Markdown.
+    *   **Translator Config**: Defines which LLM, API-Key, target language, etc., to use.
+    *   **Exporter Config**: Defines specific options for the output format (like HTML).
+3.  **Instantiate the Workflow**: Create an instance of the workflow using the configuration object.
+4.  **Execute Translation**: Call the workflow's `.read_*()` and `.translate()` / `.translate_async()` methods.
+5.  **Export/Save Results**: Call the `.export_to_*()` or `.save_as_*()` methods to get or save the translation results.

 ## Available Workflows

-| Workflow                    | Use Case                                                                                                              | Input Formats                                | Output Formats         | Core Config Class             |
-|:----------------------------|:----------------------------------------------------------------------------------------------------------------------|:---------------------------------------------|:-----------------------|:------------------------------|
-| **`MarkdownBasedWorkflow`** | Processes rich text documents like PDF, Word, images, etc. The process is: `File -> Markdown -> Translate -> Export`. | `.pdf`, `.docx`, `.md`, `.png`, `.jpg`, etc. | `.md`, `.zip`, `.html` | `MarkdownBasedWorkflowConfig` |
-| **`TXTWorkflow`**           | Processes plain text documents. The process is: `txt -> Translate -> Export`.                                         | `.txt` and other plain text formats          | `.txt`, `.html`        | `TXTWorkflowConfig`           |
-| **`JsonWorkflow`**          | Processes JSON files. The process is: `json -> Translate -> Export`.                                                  | `.json`                                      | `.json`, `.html`       | `JsonWorkflowConfig`          |
-| **`DocxWorkflow`**          | Processes docx files. The process is: `docx -> Translate -> Export`.                                                  | `.docx`                                      | `.docx`, `.html`       | `docxWorkflowConfig`          |
-| **`XlsxWorkflow`**          | Processes xlsx files. The process is: `xlsx -> Translate -> Export`.                                                  | `.xlsx`, `.csv`                              | `.xlsx`, `.html`       | `XlsxWorkflowConfig`          |
-| **`SrtWorkflow`**           | Processes srt files. The process is: `srt -> Translate -> Export`.                                                    | `.srt`                                       | `.srt`, `.html`        | `SrtWorkflowConfig`           |
-| **`EpubWorkflow`**          | Processes epub files. The process is: `epub -> Translate -> Export`.                                                  | `.epub`                                      | `.epub`, `.html`       | `EpubWorkflowConfig`          |
-| **`HtmlWorkflow`**          | Processes html files. The process is: `html -> Translate -> Export`.                                                  | `.html`, `.htm`                              | `.html`                | `HtmlWorkflowConfig`          |
+| Workflow                    | Use Case                                                        | Input Formats                                | Output Formats             | Core Config Class             |
+|:----------------------------|:----------------------------------------------------------------|:---------------------------------------------|:---------------------------|:------------------------------|
+| **`MarkdownBasedWorkflow`** | Processes rich text documents like PDF, Word, images. Flow: `File -> Markdown -> Translate -> Export`. | `.pdf`, `.docx`, `.md`, `.png`, `.jpg`, etc. | `.md`, `.zip`, `.html`     | `MarkdownBasedWorkflowConfig` |
+| **`TXTWorkflow`**           | Processes plain text documents. Flow: `txt -> Translate -> Export`. | `.txt` and other plain text formats          | `.txt`, `.html`            | `TXTWorkflowConfig`           |
+| **`JsonWorkflow`**          | Processes JSON files. Flow: `json -> Translate -> Export`.          | `.json`                                      | `.json`, `.html`           | `JsonWorkflowConfig`          |
+| **`DocxWorkflow`**          | Processes docx files. Flow: `docx -> Translate -> Export`.          | `.docx`                                      | `.docx`, `.html`           | `DocxWorkflowConfig`          |
+| **`XlsxWorkflow`**          | Processes xlsx files. Flow: `xlsx -> Translate -> Export`.          | `.xlsx`, `.csv`                              | `.xlsx`, `.html`           | `XlsxWorkflowConfig`          |
+| **`SrtWorkflow`**           | Processes srt files. Flow: `srt -> Translate -> Export`.            | `.srt`                                       | `.srt`, `.html`            | `SrtWorkflowConfig`           |
+| **`EpubWorkflow`**          | Processes epub files. Flow: `epub -> Translate -> Export`.          | `.epub`                                      | `.epub`, `.html`           | `EpubWorkflowConfig`          |
+| **`HtmlWorkflow`**          | Processes html files. Flow: `html -> Translate -> Export`.          | `.html`, `.htm`                              | `.html`                    | `HtmlWorkflowConfig`          |

-> In the interactive interface, you can export to PDF format.
+> You can export to PDF format in the interactive interface.

-## Starting the Web UI and API Service
+## Launch Web UI and API Service

 For ease of use, DocuTranslate provides a full-featured Web interface and RESTful API.

-**Starting the Service:**
+**Start the service:**

 ```bash
 # Start the service, listening on port 8010 by default
@@ -151,16 +136,14 @@ export DOCUTRANSLATE_PORT=8011
 docutranslate -i
 ```

- **Interactive Interface**: After starting the service, please visit `http://127.0.0.1:8010` (or your specified port)
-  in your browser.
- **API Documentation**: The complete API documentation (Swagger UI) is available at `http://127.0.0.1:8010/docs`.
+-   **Interactive Interface**: After starting the service, visit `http://127.0.0.1:8010` (or your specified port) in your browser.
+-   **API Documentation**: The complete API documentation (Swagger UI) is available at `http://127.0.0.1:8010/docs`.

 ## Usage

-### Example 1: Translating a PDF File (using `MarkdownBasedWorkflow`)
+### Example 1: Translate a PDF file (using `MarkdownBasedWorkflow`)

-This is the most common use case. We will use the `minerU` engine to convert the PDF to Markdown, and then use an LLM
-for translation. Here is an example using the asynchronous approach.
+This is the most common use case. We will use the `minerU` engine to convert the PDF to Markdown and then use an LLM for translation. This example uses the asynchronous method.

 ```python
 import asyncio
@@ -171,37 +154,38 @@ from docutranslate.exporter.md.md2html_exporter import MD2HTMLExporterConfig


 async def main():
-    # 1. Build the translator configuration
+    # 1. Build translator configuration
    translator_config = MDTranslatorConfig(
-        base_url="https://open.bigmodel.cn/api/paas/v4",  # AI platform Base URL
-        api_key="YOUR_ZHIPU_API_KEY",  # AI platform API Key
+        base_url="https://open.bigmodel.cn/api/paas/v4",  # AI Platform Base URL
+        api_key="YOUR_ZHIPU_API_KEY",  # AI Platform API Key
        model_id="glm-4-air",  # Model ID
        to_lang="English",  # Target language
        chunk_size=3000,  # Text chunk size
-        concurrent=10,  # Concurrency
+        concurrent=10,  # Concurrency level
        # glossary_generate_enable=True, # Enable automatic glossary generation
-        # glossary_dict={"Jobs":"乔布斯"} # Pass in a glossary
+        # glossary_dict={"Jobs":"乔布斯"}, # Pass in a glossary
+        # system_proxy_enable=True, # Enable system proxy
    )

-    # 2. Build the converter configuration (using minerU)
+    # 2. Build converter configuration (using minerU)
    converter_config = ConverterMineruConfig(
        mineru_token="YOUR_MINERU_TOKEN",  # Your minerU Token
        formula_ocr=True  # Enable formula recognition
    )

-    # 3. Build the main workflow configuration
+    # 3. Build main workflow configuration
    workflow_config = MarkdownBasedWorkflowConfig(
        convert_engine="mineru",  # Specify the parsing engine
-        converter_config=converter_config,  # Pass in the converter configuration
-        translator_config=translator_config,  # Pass in the translator configuration
+        converter_config=converter_config,  # Pass the converter config
+        translator_config=translator_config,  # Pass the translator config
        html_exporter_config=MD2HTMLExporterConfig(cdn=True)  # HTML export configuration
    )

    # 4. Instantiate the workflow
    workflow = MarkdownBasedWorkflow(config=workflow_config)

-    # 5. Read the file and execute the translation
-    print("Starting to read and translate the file...")
+    # 5. Read the file and execute translation
+    print("Reading and translating the file...")
    workflow.read_path("path/to/your/document.pdf")
    await workflow.translate_async()
    # Or use the synchronous method
@@ -212,11 +196,11 @@ async def main():
    workflow.save_as_html(name="translated_document.html")
    workflow.save_as_markdown_zip(name="translated_document.zip")
    workflow.save_as_markdown(name="translated_document.md")  # Markdown with embedded images
-    print("Files have been saved to the ./output folder.")
+    print("Files saved to the ./output folder.")

    # Or get the content strings directly
    html_content = workflow.export_to_html()
-    markdown_content = workflow.export_to_markdown()
+    html_content = workflow.export_to_markdown()
    # print(html_content)


@@ -224,10 +208,9 @@ if __name__ == "__main__":
    asyncio.run(main())
 ```

-### Example 2: Translating a TXT File (using `TXTWorkflow`)
+### Example 2: Translate a TXT file (using `TXTWorkflow`)

-For plain text files, the process is simpler as it doesn't require a document parsing (conversion) step. Here is an
-example using the asynchronous approach.
+For plain text files, the process is simpler as it doesn't require a document parsing (conversion) step. This example uses the asynchronous method.

 ```python
 import asyncio
@@ -237,7 +220,7 @@ from docutranslate.exporter.txt.txt2html_exporter import TXT2HTMLExporterConfig


 async def main():
-    # 1. Build the translator configuration
+    # 1. Build translator configuration
    translator_config = TXTTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
@@ -245,7 +228,7 @@ async def main():
        to_lang="Chinese",
    )

-    # 2. Build the main workflow configuration
+    # 2. Build main workflow configuration
    workflow_config = TXTWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=TXT2HTMLExporterConfig(cdn=True)
@@ -254,7 +237,7 @@ async def main():
    # 3. Instantiate the workflow
    workflow = TXTWorkflow(config=workflow_config)

-    # 4. Read the file and execute the translation
+    # 4. Read the file and execute translation
    workflow.read_path("path/to/your/notes.txt")
    await workflow.translate_async()
    # Or use the synchronous method
@@ -262,7 +245,7 @@ async def main():

    # 5. Save the result
    workflow.save_as_txt(name="translated_notes.txt")
-    print("TXT file has been saved.")
+    print("TXT file saved.")

    # You can also export the translated plain text
    text = workflow.export_to_txt()
@@ -272,11 +255,9 @@ if __name__ == "__main__":
    asyncio.run(main())
 ```

-### Example 3: Translating a JSON File (using `JsonWorkflow`)
+### Example 3: Translate a JSON file (using `JsonWorkflow`)

-Here is an example using the asynchronous approach. The `json_paths` item in `JsonTranslatorConfig` needs to specify the
-JSON paths to be translated (satisfying the `jsonpath-ng` syntax). Only values matching the JSON paths will be
-translated.
+This example uses the asynchronous method. The `json_paths` item in `JsonTranslatorConfig` needs to specify the JSON paths to be translated (conforming to the `jsonpath-ng` syntax). Only values matching these paths will be translated.

 ```python
 import asyncio
@@ -287,16 +268,16 @@ from docutranslate.workflow.json_workflow import JsonWorkflowConfig, JsonWorkflo


 async def main():
-    # 1. Build the translator configuration
+    # 1. Build translator configuration
    translator_config = JsonTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
        model_id="gpt-4o",
        to_lang="Chinese",
-        json_paths=["$.*", "$.name"]  # Satisfies jsonpath-ng syntax, values at matching paths will be translated
+        json_paths=["$.*", "$.name"]  # Conforms to jsonpath-ng syntax, values at matching paths will be translated
    )

-    # 2. Build the main workflow configuration
+    # 2. Build main workflow configuration
    workflow_config = JsonWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=Json2HTMLExporterConfig(cdn=True)
@@ -305,7 +286,7 @@ async def main():
    # 3. Instantiate the workflow
    workflow = JsonWorkflow(config=workflow_config)

-    # 4. Read the file and execute the translation
+    # 4. Read the file and execute translation
    workflow.read_path("path/to/your/notes.json")
    await workflow.translate_async()
    # Or use the synchronous method
@@ -313,7 +294,7 @@ async def main():

    # 5. Save the result
    workflow.save_as_json(name="translated_notes.json")
-    print("JSON file has been saved.")
+    print("JSON file saved.")

    # You can also export the translated JSON text
    text = workflow.export_to_json()
@@ -323,9 +304,9 @@ if __name__ == "__main__":
    asyncio.run(main())
 ```

-### Example 4: Translating a docx File (using `DocxWorkflow`)
+### Example 4: Translate a DOCX file (using `DocxWorkflow`)

-Here is an example using the asynchronous approach.
+This example uses the asynchronous method.

 ```python
 import asyncio
@@ -336,7 +317,7 @@ from docutranslate.workflow.docx_workflow import DocxWorkflowConfig, DocxWorkflo


 async def main():
-    # 1. Build the translator configuration
+    # 1. Build translator configuration
    translator_config = DocxTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
@@ -346,7 +327,7 @@ async def main():
        separator="\n",  # Separator used in "append" and "prepend" modes
    )

-    # 2. Build the main workflow configuration
+    # 2. Build main workflow configuration
    workflow_config = DocxWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=Docx2HTMLExporterConfig(cdn=True)
@@ -355,7 +336,7 @@ async def main():
    # 3. Instantiate the workflow
    workflow = DocxWorkflow(config=workflow_config)

-    # 4. Read the file and execute the translation
+    # 4. Read the file and execute translation
    workflow.read_path("path/to/your/notes.docx")
    await workflow.translate_async()
    # Or use the synchronous method
@@ -363,9 +344,9 @@ async def main():

    # 5. Save the result
    workflow.save_as_docx(name="translated_notes.docx")
-    print("docx file has been saved.")
+    print("DOCX file saved.")

-    # You can also export the translated docx as binary
+    # You can also export the translated DOCX as bytes
    text_bytes = workflow.export_to_docx()


@@ -373,9 +354,9 @@ if __name__ == "__main__":
    asyncio.run(main())
 ```

-### Example 5: Translating a xlsx File (using `XlsxWorkflow`)
+### Example 5: Translate an XLSX file (using `XlsxWorkflow`)

-Here is an example using the asynchronous approach.
+This example uses the asynchronous method.

 ```python
 import asyncio
@@ -386,7 +367,7 @@ from docutranslate.workflow.xlsx_workflow import XlsxWorkflowConfig, XlsxWorkflo


 async def main():
-    # 1. Build the translator configuration
+    # 1. Build translator configuration
    translator_config = XlsxTranslatorConfig(
        base_url="https://api.openai.com/v1/",
        api_key="YOUR_OPENAI_API_KEY",
@@ -396,7 +377,7 @@ async def main():
        separator="\n",  # Separator used in "append" and "prepend" modes
    )

-    # 2. Build the main workflow configuration
+    # 2. Build main workflow configuration
    workflow_config = XlsxWorkflowConfig(
        translator_config=translator_config,
        html_exporter_config=Xlsx2HTMLExporterConfig(cdn=True)
@@ -405,7 +386,7 @@ async def main():
    # 3. Instantiate the workflow
    workflow = XlsxWorkflow(config=workflow_config)

-    # 4. Read the file and execute the translation
+    # 4. Read the file and execute translation
    workflow.read_path("path/to/your/notes.xlsx")
    await workflow.translate_async()
    # Or use the synchronous method
@@ -413,9 +394,9 @@ async def main():

    # 5. Save the result
    workflow.save_as_xlsx(name="translated_notes.xlsx")
-    print("xlsx file has been saved.")
+    print("XLSX file saved.")

-    # You can also export the translated xlsx as binary
+    # You can also export the translated XLSX as bytes
    text_bytes = workflow.export_to_xlsx()


@@ -425,76 +406,67 @@ if __name__ == "__main__":

 ## Prerequisites and Configuration Details

-### 1. Get a Large Language Model API Key
+### 1. Get a Large Model API Key

-The translation functionality relies on large language models. You need to obtain a `base_url`, `api_key`, and
-`model_id` from the respective AI platform.
+The translation feature relies on large language models. You need to obtain a `base_url`, `api_key`, and `model_id` from the respective AI platform.

-> Recommended models: Volcengine's `doubao-seed-1-6-250615`, `doubao-seed-1-6-flash-250715`, Zhipu's `glm-4-flash`,
-> Alibaba Cloud's `qwen-plus`, `qwen-turbo`, Deepseek's `deepseek-chat`, etc.
+> Recommended models: Volcengine's `doubao-seed-1-6-flash` and `doubao-seed-1-6` series, Zhipu's `glm-4-flash`, Alibaba Cloud's `qwen-plus` and `qwen-flash`, Deepseek's `deepseek-chat`, etc.

-| Platform Name         | Get API Key                                                                                   | baseurl                                                  |
-|-----------------------|-----------------------------------------------------------------------------------------------|----------------------------------------------------------|
-| ollama                |                                                                                               | http://127.0.0.1:11434/v1                                |
-| lm studio             |                                                                                               | http://127.0.0.1:1234/v1                                 |
-| openrouter            | [Click to get](https://openrouter.ai/settings/keys)                                           | https://openrouter.ai/api/v1                             |
-| openai                | [Click to get](https://platform.openai.com/api-keys)                                          | https://api.openai.com/v1/                               |
-| gemini                | [Click to get](https://aistudio.google.com/u/0/apikey)                                        | https://generativelanguage.googleapis.com/v1beta/openai/ |
-| deepseek              | [Click to get](https://platform.deepseek.com/api_keys)                                        | https://api.deepseek.com/v1                              |
-| Zhipu AI              | [Click to get](https://open.bigmodel.cn/usercenter/apikeys)                                   | https://open.bigmodel.cn/api/paas/v4                     |
-| Tencent Hunyuan       | [Click to get](https://console.cloud.tencent.com/hunyuan/api-key)                             | https://api.hunyuan.cloud.tencent.com/v1                 |
-| Alibaba Cloud Bailian | [Click to get](https://bailian.console.aliyun.com/?tab=model#/api-key)                        | https://dashscope.aliyuncs.com/compatible-mode/v1        |
-| Volcengine            | [Click to get](https://console.volcengine.com/ark/region:ark+cn-beijing/apiKey?apikey=%7B%7D) | https://ark.cn-beijing.volces.com/api/v3                 |
-| SiliconFlow           | [Click to get](https://cloud.siliconflow.cn/account/ak)                                       | https://api.siliconflow.cn/v1                            |
-| DMXAPI                | [Click to get](https://www.dmxapi.cn/token)                                                   | https://www.dmxapi.cn/v1                                 |
+| Platform Name       | Get API Key                                                                              | Base URL                                                 |
+|:--------------------|:-----------------------------------------------------------------------------------------|:---------------------------------------------------------|
+| ollama              |                                                                                          | `http://127.0.0.1:11434/v1`                              |
+| lm studio           |                                                                                          | `http://127.0.0.1:1234/v1`                               |
+| openrouter          | [Click to get](https://openrouter.ai/settings/keys)                                      | `https://openrouter.ai/api/v1`                           |
+| openai              | [Click to get](https://platform.openai.com/api-keys)                                     | `https://api.openai.com/v1/`                             |
+| gemini              | [Click to get](https://aistudio.google.com/u/0/apikey)                                   | `https://generativelanguage.googleapis.com/v1beta/openai/` |
+| deepseek            | [Click to get](https://platform.deepseek.com/api_keys)                                   | `https://api.deepseek.com/v1`                            |
+| Zhipu AI (智谱ai)     | [Click to get](https://open.bigmodel.cn/usercenter/apikeys)                                | `https://open.bigmodel.cn/api/paas/v4`                   |
+| Tencent Hunyuan (腾讯混元) | [Click to get](https://console.cloud.tencent.com/hunyuan/api-key)                          | `https://api.hunyuan.cloud.tencent.com/v1`               |
+| Alibaba Cloud Bailian (阿里云百炼) | [Click to get](https://bailian.console.aliyun.com/?tab=model#/api-key)                     | `https://dashscope.aliyuncs.com/compatible-mode/v1`      |
+| Volcengine (火山引擎) | [Click to get](https://console.volcengine.com/ark/region:ark+cn-beijing/apiKey?apikey=%7B%7D) | `https://ark.cn-beijing.volces.com/api/v3`               |
+| SiliconFlow (硅基流动) | [Click to get](https://cloud.siliconflow.cn/account/ak)                                    | `https://api.siliconflow.cn/v1`                          |
+| DMXAPI              | [Click to get](https://www.dmxapi.cn/token)                                                | `https://www.dmxapi.cn/v1`                               |
+| Juguang AI (聚光AI)   | [Click to get](https://ai.juguang.chat/console/token)                                      | `https://ai.juguang.chat/v1`                             |

 ### 2. PDF Parsing Engine (ignore if not translating PDFs)

-### 2.1 Get a minerU Token (online PDF parsing, free, recommended)
+#### 2.1 Get a minerU Token (Online PDF parsing, free, recommended)

-If you choose `mineru` as the document parsing engine (`convert_engine="mineru"`), you need to apply for a free token.
+If you choose `mineru` as your document parsing engine (`convert_engine="mineru"`), you need to apply for a free token.

-1. Visit the [minerU official website](https://mineru.net/apiManage/docs) to register and apply for an API.
-2. Create a new API Token in the [API Token management interface](https://mineru.net/apiManage/token).
+1.  Visit the [minerU official website](https://mineru.net/apiManage/docs) to register and apply for an API.
+2.  Create a new API Token in the [API Token Management interface](https://mineru.net/apiManage/token).

-> **Note**: minerU tokens have a 14-day validity period. Please re-create them after they expire.
+> **Note**: minerU Tokens are valid for 14 days. Please create a new one after expiration.

-### 2.2. docling Engine Configuration (local PDF parsing)
+#### 2.2. docling Engine Configuration (Local PDF parsing)

-If you choose `docling` as the document parsing engine (`convert_engine="docling"`), it will download the required
-models from Hugging Face on first use.
+If you choose `docling` as your document parsing engine (`convert_engine="docling"`), it will download the required models from Hugging Face upon first use.

-> A better option is to download `docling_artifact.zip`
-> from [GitHub releases](https://github.com/xunbu/docutranslate/releases) and unzip it to your working directory.
+> A better option is to download `docling_artifact.zip` from [GitHub Releases](https://github.com/xunbu/docutranslate/releases) and extract it to your working directory.

-**Solution for network issues when downloading `docling` models:**
-
-1. **Set a Hugging Face mirror (recommended)**:
-
-* **Method A (environment variable)**: Set the system environment variable `HF_ENDPOINT` and restart your IDE or
-  terminal.
-   ```
-   HF_ENDPOINT=https://hf-mirror.com
-   ```
-* **Method B (set in code)**: Add the following code at the beginning of your Python script.
+**Solutions for network issues when downloading `docling` models:**

+1.  **Set a Hugging Face mirror (Recommended)**:
+    *   **Method A (Environment Variable)**: Set the system environment variable `HF_ENDPOINT` and restart your IDE or terminal.
+        ```
+        HF_ENDPOINT=https://hf-mirror.com
+        ```
+*   **Method B (Set in code)**: Add the following code at the beginning of your Python script.
 ```python
 import os
-
+    
 os.environ['HF_ENDPOINT'] = 'https://hf-mirror.com'
 ```
-
-2. **Offline use (download the model package in advance)**:
-
-* Download `docling_artifact.zip` from [GitHub Releases](https://github.com/xunbu/docutranslate/releases).
-* Unzip it to your project directory.
-* Specify the model path in the configuration (if the model is not in the same directory as the script):
-
+2.  **Offline Usage (Download the model package in advance)**:
+    *   Download `docling_artifact.zip` from [GitHub Releases](https://github.com/xunbu/docutranslate/releases).
+    *   Extract it into your project directory.
+*   Specify the model path in your configuration (if the model is not in the same directory as the script):
 ```python
 from docutranslate.converter.x2md.converter_docling import ConverterDoclingConfig
-
+    
 converter_config = ConverterDoclingConfig(
-    artifact="./docling_artifact",  # Point to the unzipped folder
+    artifact="./docling_artifact",  # Path to the extracted folder
    code_ocr=True,
    formula_ocr=True
 )
@@ -502,32 +474,28 @@ converter_config = ConverterDoclingConfig(

 ## FAQ

-**Q: What if port 8010 is occupied?**
+**Q: Why is the translated text still in the original language?**  
+A: Check the logs for errors. It's usually due to an overdue payment on the AI platform or network issues (check if you need to enable the system proxy).
+
+**Q: Port 8010 is already in use. What should I do?**  
 A: Use the `-p` parameter to specify a new port, or set the `DOCUTRANSLATE_PORT` environment variable.

-**Q: Does it support translation of scanned PDFs?**
+**Q: Does it support translating scanned PDFs?**  
 A: Yes. Please use the `mineru` parsing engine, which has powerful OCR capabilities.

-**Q: Why is the first PDF translation so slow?**
-A: If you are using the `docling` engine, it needs to download models from Hugging Face on its first run. Please refer
-to the "Network Issues Solution" above to speed up this process.
+**Q: Why is the first PDF translation very slow?**  
+A: If you are using the `docling` engine, it needs to download models from Hugging Face on its first run. Please refer to the "Network Issues Solutions" section above to speed up this process.

-**Q: How to use it in an intranet (offline) environment?**
-A: It is entirely possible. You need to meet the following conditions:
+**Q: How can I use it in an intranet (offline) environment?**  
+A: Absolutely. You need to meet the following conditions:
+1.  **Local LLM**: Deploy a language model locally using tools like [Ollama](https://ollama.com/) or [LM Studio](https://lmstudio.ai/), and fill in the local model's `base_url` in `TranslatorConfig`.
+2.  **Local PDF Parsing Engine** (only for parsing PDFs): Use the `docling` engine and download the model package in advance as described in the "Offline Usage" section above.

-1. **Local LLM**: Use tools like [Ollama](https://ollama.com/) or [LM Studio](https://lmstudio.ai/) to deploy a language
-   model locally, and fill in the `base_url` of the local model in `TranslatorConfig`.
-2. **Local PDF parsing engine** (only needed for parsing PDFs): Use the `docling` engine and follow the "Offline use"
-   instructions above to download the model package in advance.
+**Q: How does the PDF parsing cache mechanism work?**  
+A: `MarkdownBasedWorkflow` automatically caches the results of document parsing (file-to-Markdown conversion) to avoid repetitive, time-consuming parsing. The cache is stored in memory by default and records the last 10 parses. You can change the cache size using the `DOCUTRANSLATE_CACHE_NUM` environment variable.

-**Q: How does the PDF parsing cache mechanism work?**
-A: `MarkdownBasedWorkflow` automatically caches the results of document parsing (file to Markdown conversion) to avoid
-repeated parsing that consumes time and resources. The cache is stored in memory by default and records the last 10
-parses. You can modify the cache size using the `DOCUTRANSLATE_CACHE_NUM` environment variable.
-
-**Q: How to make the software go through a proxy?**
-A: The software does not use a proxy by default. You can enable it by setting the environment variable
-`DOCUTRANSLATE_PROXY_ENABLED` to `true`.
+**Q: How can I make the software use a proxy?**  
+A: By default, the software does not use the system proxy. You can enable it by setting `system_proxy_enable=True` in `TranslatorConfig`.

 ## Star History

@@ -539,10 +507,10 @@ A: The software does not use a proxy by default. You can enable it by setting th
 </picture>
 </a>

-## Support the Author
+## Sponsorship

-Welcome to support the author. Please kindly mention the reason for your appreciation in the notes. ❤
+Your support is welcome! Please mention the reason for your donation in the memo.

 <p align="center">
-  <img src="./images/赞赏码.jpg" alt="赞赏码" style="width: 250px;">
+  <img src="./images/赞赏码.jpg" alt="Sponsorship QR Code" style="width: 250px;">
 </p>