龙空技术网

超强合集:OCR 文本检测干货汇总(含论文、源码、demo 等资源)

胖头ai 1218

前言:

此时各位老铁们对“ocrjs”大概比较关怀,同学们都需要学习一些“ocrjs”的相关资讯。那么小编也在网络上收集了一些关于“ocrjs””的相关资讯,希望大家能喜欢,看官们一起来学习一下吧!

awesome-deep-text-detection-recognition

A curated list of awesome deep learning based papers on text detection and recognition.

资源链接:

Papers

Multi-digit Number Recognition from Street View Imagery using Deep Convolutional Neural Networks

intro: Google. Ian J. Goodfellowarxiv:

End-to-End Text Recognition with Convolutional Neural Networks

paper: thesis:

Word Spotting and Recognition with Embedded Attributes

paper:

Reading Text in the Wild with Convolutional Neural Networks

arxiv: : : :

Deep structured output learning for unconstrained text recognition

intro: "propose an architecture consisting of a character sequence CNN and an N-gram encoding CNN which act on an input image in parallel and whose outputs are utilized along with a CRF model to recognize the text content present within the image."arxiv:

Deep Features for Text Spotting

paper: : :

Reading Scene Text in Deep Convolutional Sequences

intro: AAAI 2016arxiv:

DeepFont: Identify Your Font from An Image

arxiv:

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

intro: Convolutional Recurrent Neural Network (CRNN)arxiv: : :

Recursive Recurrent Nets with Attention Modeling for OCR in the Wild

arxiv: Feature Learning for Offline Signature Verification using Deep Convolutional Neural Networksarxiv:

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

arxiv:

End-to-End Interpretation of the French Street Name Signs Dataset

paper: :

End-to-End Subtitle Detection and Recognition for Videos in East Asian Languages via CNN Ensemble with Near-Human-Level Performance

arxiv:

Smart Library: Identifying Books in a Library using Richly Supervised Deep Scene Text Reading

arxiv: Text Proposals for Scene Images with Fully Convolutional Networksintro: Universitat Autonoma de Barcelona (UAB) & University of Florenceintro: International Conference on Pattern Recognition (ICPR) - DLPR (Deep Learning for Pattern Recognition) workshoparxiv:

Scene Text Eraser

Attention-based Extraction of Structured Information from Street View Imagery

intro: University College London & Google Incarxiv: :

Implicit Language Model in LSTM for OCR

Detection

Object Proposals for Text Extraction in the Wild

intro: ICDAR 2015arxiv: :

Text-Attentional Convolutional Neural Networks for Scene Text Detection

arxiv:

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

arxiv:

Synthetic Data for Text Localisation in Natural Images

intro: CVPR 2016project page: : : :

Scene Text Detection via Holistic, Multi-Channel Prediction

arxiv:

Detecting Text in Natural Image with Connectionist Text Proposal Network

intro: ECCV 2016arxiv: : (CUDA8.0 support): : :

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

intro: AAAI 2017arxiv: : :

TextBoxes++: A Single-Shot Oriented Scene Text Detector

intro: TIP 2018. University of Science and Technology(HUST)arxiv: (official, Caffe):

Arbitrary-Oriented Scene Text Detection via Rotation Proposals

intro: IEEE Transactions on Multimediakeywords: RRPNarxiv: : :

Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection

intro: CVPR 2017intro: F-measure 70.64%, outperforming the existing state-of-the-art method with F-measure 63.76%arxiv:

Detecting Oriented Text in Natural Images by Linking Segments

intro: CVPR 2017arxiv: :

Deep Direct Regression for Multi-Oriented Scene Text Detection

arxiv:

Cascaded Segmentation-Detection Networks for Word-Level Text Spotting

Text-Detection-using-py-faster-rcnn-framework

github:

WordFence: Text Detection in Natural Images with Border Awarenessintro: ICIP 2017arcxiv:

SSD-text detection: Text Detector

intro: A modified SSD model for text detectiongithub:

R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

intro: Samsung R&D Institute Chinaarxiv:

R-PHOC: Segmentation-Free Word Spotting using CNN

intro: ICDAR 2017arxiv:

Towards End-to-end Text Spotting with Convolutional Recurrent Neural Networks

intro: ICCV 2017arxiv:

EAST: An Efficient and Accurate Scene Text Detector

intro: CVPR 2017. Megviiarxiv: : :

Deep Scene Text Detection with Connected Component Proposals

intro: Amap Vision Lab, Alibaba Grouparxiv:

Single Shot Text Detector with Regional Attention

intro: ICCV 2017arxiv: : :

Fused Text Segmentation Networks for Multi-oriented Scene Text Detection

Deep Residual Text Detection Network for Scene Text

intro: IAPR International Conference on Document Analysis and Recognition (ICDAR) 2017. Samsung R&D Institute of China, Beijingarxiv:

Feature Enhancement Network: A Refined Scene Text Detector

intro: AAAI 2018arxiv:

ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene

Detecting Curve Text in the Wild: New Dataset and New Solution

arxiv: :

FOTS: Fast Oriented Text Spotting with a Unified Network

PixelLink: Detecting Scene Text via Instance Segmentation

intro: AAAI 2018arxiv:

PixelLink: Detecting Scene Text via Instance Segmentation

intro: AAAI 2018. Zhejiang University & Chinese Academy of Sciencesarxiv:

Sliding Line Point Regression for Shape Robust Scene Text Detection

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

intro: CVPR 2018arxiv:

Single Shot TextSpotter with Explicit Alignment and Attention

intro: CVPR 2018arxiv:

Rotation-Sensitive Regression for Oriented Scene Text Detection

intro: CVPR 2018arxiv:

Detecting Multi-Oriented Text with Corner-based Region Proposals

arxiv: :

An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches

IncepText: A New Inception-Text Module with Deformable PSROI Pooling for Multi-Oriented Scene Text Detection

intro: IJCAI 2018. Alibaba Grouparxiv:

Boosting up Scene Text Detectors with Guided CNN

Shape Robust Text Detection with Progressive Scale Expansion Network

arxiv: :

A Single Shot Text Detector with Scale-adaptive Anchors

TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

intro: ECCV 2018arxiv:

Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes

intro: ECCV 2018. Huazhong University of Science and Technology & Megvii (Face++) Technologyarxiv:

Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping

intro: ECCV 2018arxiv:

TextContourNet: a Flexible and Effective Framework for Improving Scene Text Detection Architecture with a Multi-task Cascade

Correlation Propagation Networks for Scene Text Detection

Scene Text Detection with Supervised Pyramid Context Network

intro: AAAI 2019arxiv:

Improving Rotated Text Detection with Rotation Region Proposal Networks

Pixel-Anchor: A Fast Oriented Scene Text Detector with Combined Networks

Mask R-CNN with Pyramid Attention Network for Scene Text Detection

intro: WACV 2019arxiv:

TextField: Learning A Deep Direction Field for Irregular Scene Text Detection

intro: Huazhong University of Science and Technology (HUST) & Alibaba Grouparxiv:

Detecting Text in the Wild with Deep Character Embedding Network

intro: ACCV 2018intro: Baiduarxiv:

Text Recognition

Sequence to sequence learning for unconstrained scene text recognition

intro: master thesisarxiv:

Drawing and Recognizing Chinese Characters with Recurrent Neural Network

arxiv:

Learning Spatial-Semantic Context with Fully Convolutional Recurrent Network for Online Handwritten Chinese Text Recognition

intro: correct rates: Dataset-CASIA 97.10% and Dataset-ICDAR 97.15%arxiv:

Stroke Sequence-Dependent Deep Convolutional Neural Network for Online Handwritten Chinese Character Recognition

arxiv:

Visual attention models for scene text recognition

Focusing Attention: Towards Accurate Text Recognition in Natural Images

intro: ICCV 2017arxiv:

Scene Text Recognition with Sliding Convolutional Character Models

AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition

A New Hybrid-parameter Recurrent Neural Networks for Online Handwritten Chinese Character Recognition

AON: Towards Arbitrarily-Oriented Text Recognition

arxiv: :

Arbitrarily-Oriented Text Recognition

intro: A method used in ICDAR 2017 word recognition competitionsarxiv:

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

Edit Probability for Scene Text Recognition

intro: Fudan University & Hikvision Research Institutearxiv:

SCAN: Sliding Convolutional Attention Network for Scene Text Recognition

Adaptive Adversarial Attack on Scene Text Recognition

intro: University of Floridaarxiv:

ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification

Text Detection + Recognition

STN-OCR: A single Neural Network for Text Detection and Text Recognition

arxiv: :

Deep TextSpotter: An End-to-End Trainable Scene Text Localization and Recognition Framework

intro: ICCV 2017arxiv:

FOTS: Fast Oriented Text Spotting with a Unified Network

Single Shot TextSpotter with Explicit Alignment and Attention

An end-to-end TextSpotter with Explicit Alignment and Attention

intro: CVPR 2018arxiv: (official, Caffe):

Verisimilar Image Synthesis for Accurate Detection and Recognition of Texts in Scenes

intro: ECCV 2018arxiv: :

Scene Text Detection and Recognition: The Deep Learning Era

arxiv: :

A Novel Integrated Framework for Learning both Text Detection and Recognition

intro: Alibabaarxiv:

Breaking Captcha

Using deep learning to break a Captcha system

intro: "Using Torch code to break simplecaptcha with 92% accuracy"blog: :

Breaking reddit captcha with 96% accuracy

blog: :

I’m not a human: Breaking the Google reCAPTCHA

paper:

Neural Net CAPTCHA Cracker

slides: : :

Recurrent neural networks for decoding CAPTCHAS

blog: : :

Reading irctc captchas with 95% accuracy using deep learning

github:

端到端的OCR:基于CNN的实现

blog:

I Am Robot: (Deep) Learning to Break Semantic Image CAPTCHAs

intro: automatically solving 70.78% of the image reCaptchachallenges, while requiring only 19 seconds per challenge. apply to the Facebook image captcha and achieve an accuracy of 83.5%paper:

SimGAN-Captcha

intro: Solve captcha without manually labeling a training setgithub:

Handwritten Recognition

High Performance Offline Handwritten Chinese Character Recognition Using GoogLeNet and Directional Feature Maps

arxiv: :

Recognize your handwritten numbers

Handwritten Digit Recognition using Convolutional Neural Networks in Python with Keras

blog:

MNIST Handwritten Digit Classifier

github:

如何用卷积神经网络CNN识别手写数字集?

blog:

LeNet – Convolutional Neural Network in Python

blog:

Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention

arxiv:

MLPaint: the Real-Time Handwritten Digit Recognizer

blog: : :

Training a Computer to Recognize Your Handwriting

Using TensorFlow to create your own handwriting recognition engine

blog: :

Building a Deep Handwritten Digits Classifier using Microsoft Cognitive Toolkit

blog: :

Hand Writing Recognition Using Convolutional Neural Networks

intro: This CNN-based model for recognition of hand written digits attains a validation accuracy of 99.2% after training for 12 epochs. Its trained on the MNIST dataset on Kaggle.github:

Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling

intro: 0.57 MB, performance is decreased only by 0.91%.arxiv:

Handwritten digit string recognition by combination of residual network and RNN-CTC

Plate Recognition

Reading Car License Plates Using Deep Convolutional Neural Networks and LSTMs

arxiv:

Number plate recognition with Tensorflow

blog: (Deep ANPR):

end-to-end-for-plate-recognition

github:

Segmentation-free Vehicle License Plate Recognition using ConvNet-RNN

intro: International Workshop on Advanced Image Technology, January, 8-10, 2017. Penang, Malaysia. Proceeding IWAIT2017arxiv:

License Plate Detection and Recognition Using Deeply Learned Convolutional Neural Networks

arxiv: :

Adversarial Generation of Training Examples for Vehicle License Plate Recognition

Towards End-to-End Car License Plates Detection and Recognition with Deep Neural Networks

arxiv:

Towards End-to-End License Plate Detection and Recognition: A Large Dataset and Baseline

paper: : :

High Accuracy Chinese Plate Recognition Framework

intro: 基于深度学习高性能中文车牌识别 High Performance Chinese License Plate Recognition Framework.gihtub:

LPRNet: License Plate Recognition via Deep Neural Networks

intrp=o: Intel IOTG Computer Vision Groupintro: works in real-time with recognition accuracy up to 95% for Chinese license plates: 3 ms/plate on nVIDIAR GeForceTMGTX 1080 and 1.3 ms/plate on IntelR CoreTMi7-6700K CPU.arxiv:

How many labeled license plates are needed?

intro: Chinese Conference on Pattern Recognition and Computer Visionarxiv:

Blogs

Applying OCR Technology for Receipt Recognition

blog: :

Hacking MNIST in 30 lines of Python

blog: :

Optical Character Recognition Using One-Shot Learning, RNN, and TensorFlow

Creating a Modern OCR Pipeline Using Computer Vision and Deep Learning

Projects

ocropy: Python-based tools for document analysis and OCR

github:

Extracting text from an image using Ocropus

blog:

CLSTM : A small C++ implementation of LSTM networks, focused on OCR

github:

OCR text recognition using tensorflow with attention

github: :

Digit Recognition via CNN: digital meter numbers detection

github(caffe):

Attention-OCR: Visual Attention based OCR

github:

umaru: An OCR-system based on torch using the technique of LSTM/GRU-RNN, CTC and referred to the works of rnnlib and clstm

github:

Tesseract.js: Pure Javascript OCR for 62 Languages

homepage: :

DeepHCCR: Offline Handwritten Chinese Character Recognition based on GoogLeNet and AlexNet (With CaffeModel)

github:

deep ocr: make a better chinese character recognition OCR than tesseract

Practical Deep OCR for scene text using CTPN + CRNN

Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

SSD_scene-text-detection

github: :

Videos

LSTMs for OCR

youtube:

Resources

Deep Learning for OCR

Scene Text Localization & Recognition Resources

intro: A curated list of resources dedicated to scene text localization and recognitiongithub:

Scene Text Localization & Recognition Resources

intro: 图像文本位置感知与识别的论文资源汇总github:

awesome-ocr: A curated list of promising OCR resources

标签: #ocrjs