Yuantao Feng
commited on
Commit
·
7295415
1
Parent(s):
85b92e6
Renaming model files to have more information on architecture, training data and more (#7)
Browse files* add suffix of training dataset, arch & upload time to each model
* update DB-IC15 benchmark results
README.md
CHANGED
@@ -2,11 +2,13 @@
|
|
2 |
|
3 |
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
|
4 |
|
5 |
-
|
|
|
|
|
6 |
|
7 |
## Demo
|
8 |
|
9 |
-
***NOTE
|
10 |
|
11 |
Run the following command to try the demo:
|
12 |
```shell
|
|
|
2 |
|
3 |
An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition
|
4 |
|
5 |
+
Note:
|
6 |
+
- Model source: https://docs.opencv.org/4.5.2/d9/d1e/tutorial_dnn_OCR.html.
|
7 |
+
- For details on training this model, please visit https://github.com/zihaomu/deep-text-recognition-benchmark, which can only recognize english words.
|
8 |
|
9 |
## Demo
|
10 |
|
11 |
+
***NOTE***: This demo uses [text_detection_db](../text_detection_db) as text detector.
|
12 |
|
13 |
Run the following command to try the demo:
|
14 |
```shell
|
demo.py
CHANGED
@@ -26,7 +26,7 @@ def str2bool(v):
|
|
26 |
parser = argparse.ArgumentParser(
|
27 |
description="An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition (https://arxiv.org/abs/1507.05717)")
|
28 |
parser.add_argument('--input', '-i', type=str, help='Path to the input image. Omit for using default camera.')
|
29 |
-
parser.add_argument('--model', '-m', type=str, default='
|
30 |
parser.add_argument('--width', type=int, default=736,
|
31 |
help='The width of input image being sent to the text detector.')
|
32 |
parser.add_argument('--height', type=int, default=736,
|
|
|
26 |
parser = argparse.ArgumentParser(
|
27 |
description="An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition (https://arxiv.org/abs/1507.05717)")
|
28 |
parser.add_argument('--input', '-i', type=str, help='Path to the input image. Omit for using default camera.')
|
29 |
+
parser.add_argument('--model', '-m', type=str, default='text_recognition_CRNN_VGG_BiLSTM_CTC.onnx', help='Path to the model.')
|
30 |
parser.add_argument('--width', type=int, default=736,
|
31 |
help='The width of input image being sent to the text detector.')
|
32 |
parser.add_argument('--height', type=int, default=736,
|