【方案三】JAVA中使用ocr（Umi-OCR）

embedded/2024/12/4 15:19:17/

前言：

需求：

代码：

难点：

参考文档：

前言：

前两个方案都是自己做着玩儿的，实际运用到上线项目是要收费的，该方案使用的是免费开源的工具，就算运用到商业项目也不会侵权，建议使用这个方案。

需求：

上传图片，识别出内容

代码：

java">@Value("${ocr.url}")
//"http://121.229.10.211:32009/umi/api/ocr"
private String ocrUrl;@PostMapping("/ocr")
@ApiOperationSupport(order = 3)
@ApiOperation(value = "识别图像", notes = "上传图像")
public R<FarmerApplyEntity> ocr(@RequestBody MultipartFile file) throws IOException, ParseException {// 创建临时文件File tfile = File.createTempFile("tempfile", file.getOriginalFilename());// 写入数据file.transferTo(tfile);// 构造请求体JSONObject inputObject = new JSONObject();String base64 = Base64.encode(tfile);inputObject.put("base64", base64.replace("data:image/png;base64,", ""));JSONObject options = new JSONObject();options.put("ocr.language", "models/config_chinese.txt");options.put("ocr.cls", true);options.put("ocr.limit_side_len", 4320);options.put("tbpu.parser", "multi_none");options.put("data.format", "txt");inputObject.put("options", options);String jsonInputString = inputObject.toString();// 发送 POST 请求HttpResponse response = HttpUtil.createPost(ocrUrl).contentType("application/json") // 设置请求头，指定内容类型为 JSON.body(jsonInputString) // 设置请求体.execute();System.out.printf(JSON.parseObject(response.body()).toJSONString());JSONObject retObject = JSON.parseObject(response.body());JSONArray data = retObject.getJSONArray("data");// 将JSONArray里的每一个object里的text串联起来// 用于存储串联结果的StringBuilderStringBuilder concatenatedText = new StringBuilder();// 遍历JSONArrayfor (int i = 0; i < data.size(); i++) {// 获取每个JSONObjectJSONObject jsonObject = data.getJSONObject(i);// 获取text属性String text = jsonObject.getString("text");// 将text添加到StringBuilderconcatenatedText.append(text);}tfile.delete();return R.data(contractService.boxOcrResult(concatenatedText.toString()));}