# OCR

> OCR （Optical Character Recognition，光学字符识别）是指电子设备（例如扫描仪或数码相机）检查纸上打印的字符，通过检测暗、亮的模式确定其形状，然后用字符识别方法将形状翻译成计算机文字的过程；即，针对印刷体字符，采用光学的方式将纸质文档中的文字转换成为黑白点阵的图像文件，并通过识别软件将图像中的文字转换成文本格式，供文字处理软件进一步编辑加工的技术。如何除错或利用辅助信息提高识别正确率，是OCR最重要的课题，ICR（Intelligent Character Recognition）的名词也因此而产生。衡量一个OCR系统性能好坏的主要指标有：拒识率、误识率、识别速度、用户界面的友好性，产品的稳定性，易用性及可行性等。

对图片中的文字进行识别，使之变成可以编辑的文本。

## 一、基本操作

区域截图含有文字的区域，将在`结果页`的![](/files/-M2wcU7BYfHUeZs5b5uy)区域进行 OCR 识别。

* 点击重新载入按钮使用在线服务重新识别。
* 在取字结果页点击条目可编辑，单击放大镜按钮进入分词。

### 服务类型

`设置->截图->自动文字识别`选择本地/在线服务

* 本地服务需下载识别库，最多勾选三种，支持对 70 多种语言进行识别。
* 在线服务需选择一种语言。

![](http://ww1.sinaimg.cn/large/6b1dd0a7ly1fzr8u7yu4sj20u0103dr9.jpg)


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://sealt.gitbook.io/fooview/xuan-fu-qiu/ocr.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.