Pythonç¨æ·±å±¤å¦ç¿ãã¼ã¹OCRã½ãªã¥ã¼ã·ã§ã³
docTRã§ç»åããé«ç²¾åº¦ãªããã¹ãæ½åºã»èªèãå®ç¾
Pythonç¨docTR APIã¨ã¯
docTR(Document Text Recognition)ã¯ãPythonç¨ã®æ·±å±¤å¦ç¿ãã¼ã¹å 妿åèªè(OCR)ãªã¼ãã³ã½ã¼ã¹ã©ã¤ãã©ãªã§ããã¹ãã£ã³ææ¸ã»ç»åã»PDFã«å¯¾ãã¦æå 端ã®ããã¹ãæ¤åºã»èªèæ©è½ãæä¾ãã¾ããç¾ä»£çãªæ·±å±¤å¦ç¿ã¢ã¼ããã¯ãã£ãæ¡ç¨ããææ¸æ§é ãä¿æããã¾ã¾é«ç²¾åº¦ãªããã¹ãæ½åºãå®ç¾ãã¾ãã
ææ¸ãã¸ã¿ã«åãèªåãã¼ã¿æ½åºãAIããã¹ãèªèã¢ããªã±ã¼ã·ã§ã³ã«åºãå©ç¨ããã¦ãããå¤è¨èªå¯¾å¿ãææ¸ãæåèªèãGPUã¢ã¯ã»ã©ã¬ã¼ã·ã§ã³ã«ããé«éå¦çããµãã¼ããã¦ãã¾ãã
docTR APIã®ä¸»è¦æ©è½
- å é²çãªæ·±å±¤å¦ç¿OCR: ãã¥ã¼ã©ã«ãããã¯ã¼ã¯ã«ããç²¾å¯ãªããã¹ãæ¤åºã»èªè
- å¤å½¢å¼å¯¾å¿: ç»å/PDF/ã¹ãã£ã³ææ¸ãã·ã¼ã ã¬ã¹ã«å¦ç
- ææ¸ãæåèªè: é«ãç²¾åº¦ã§ææ¸ãããã¹ããèªèã»æ½åº
- å¤è¨èªå¯¾å¿: æ§ã ãªè¨èªã»æåä½ç³»ããµãã¼ã
- é«éå¦ç: GPUã¢ã¯ã»ã©ã¬ã¼ã·ã§ã³ã«ããå¹ççãªããã¹ãæ½åº
- ã¬ã¤ã¢ã¦ãä¿æ: ããã¹ãèªèæã«ææ¸æ§é ãç¶æ
- ã¹ã±ã¼ã©ãã«ãªãªã¼ãã³ã½ã¼ã¹: ç¡æã§å©ç¨å¯è½ãç¶ç¶çã«æ¹å
docTR APIã®ä½¿ç¨æ¹æ³
docTRãã¤ã³ã¹ãã¼ã«ããã«ã¯ã以ä¸ã®pipã³ãã³ããå®è¡ãã¦ãã ãã:
docTRã¤ã³ã¹ãã¼ã«
pip install python-doctr
ããé«éãªå¦çã®ããã«GPUã¢ã¯ã»ã©ã¬ã¼ã·ã§ã³ãæå¹ã«ããå ´åã¯ã追å ã§ä»¥ä¸ãã¤ã³ã¹ãã¼ã«ãã¦ãã ãã:
GPUé¢é£ããã±ã¼ã¸
pip install tensorflow-gpu torch torchvision
docTR API使ç¨ä¾
docTRã使ç¨ããããã¹ãæ½åºã®å®è£ ä¾ããç´¹ä»ãã¾ãã

ä¾1: ç»åããã®ããã¹ãæ½åº
ç»åãèªã¿è¾¼ã¿ãdocTRã§OCRå¦çãå®è¡ããããã¹ããæ½åºããä¾ã§ããä½ç½®æ å ±ä»ãã§ããã¹ããæ½åºã§ãããããæ§é åææ¸å¦çã«é©ãã¦ãã¾ãã
ç»åããã®ããã¹ãæ½åº
from doctr.io import DocumentFile
from doctr.models import ocr_predictor
doc = DocumentFile.from_images("sample.png")
model = ocr_predictor(pretrained=True)
result = model(doc)
print(result.export())
ä¾2: è¤æ°ãã¼ã¸PDFã®å¦ç
è¤æ°ãã¼ã¸ã®PDFããããã¹ããæ½åºããä¾ã§ããdocTRãåãã¼ã¸ãèªåã§å¦çãã¾ãã
PDFããã®ããã¹ãæ½åº
from doctr.io import DocumentFile
from doctr.models import ocr_predictor
doc = DocumentFile.from_pdf("sample.pdf")
model = ocr_predictor(pretrained=True)
result = model(doc)
print(result.export())
ä¾3: ææ¸ãæåã®èªè
ææ¸ãææ¸ããããã¹ããæ½åºããä¾ã§ããææ¸ãã¡ã¢ãæ´å²çææ¸ã®ãã¸ã¿ã«åã«æé©ã§ãã
ææ¸ãæåèªè
from doctr.models import ocr_predictor
from doctr.datasets import synthetic_documents
doc = synthetic_documents()[0]
model = ocr_predictor(pretrained=True)
result = model(doc)
print(result.export())
ã¾ã¨ã
docTR APIã¯ãç»åã»PDFã»ææ¸ãææ¸ããããã¹ããæ½åºããå¼·åãªæ·±å±¤å¦ç¿ãã¼ã¹OCRã½ãªã¥ã¼ã·ã§ã³ã§ããææ¸æ§é ãä¿æããã¾ã¾é«ç²¾åº¦ãªããã¹ãèªèãå®ç¾ããAIææ¸å¦çã»èªååã»ãã¼ã¿æ½åºã«æé©ã§ãã
ææ¸ãã¸ã¿ã«åãèªåãã¼ã¿å ¥åãAIããã¹ãèªèãªã©ãæ§ã ãªç¨éã«æè»ã«å¯¾å¿ãã¾ãã