cudanexus
/
ocr-surya
Surya is a document OCR toolkit that does:
Prediction
cudanexus/ocr-surya:7ab5bedeIDboxpbijbbkpujtmdcgxz5q5ofmStatusSucceededSourceWebHardwareT4Total durationCreatedInput
{ "image": "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", { input: { image: "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", action: "Run OCR", page_number: 1, languages_input: "English", languages_choices: "English" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", input={ "image": "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", "input": { "image": "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2024-02-28T19:55:07.347179Z", "created_at": "2024-02-28T19:50:52.872348Z", "data_removed": false, "error": null, "id": "boxpbijbbkpujtmdcgxz5q5ofm", "input": { "image": "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" }, "logs": "Detecting bboxes: 0%| | 0/1 [00:00<?, ?it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:01<00:00, 1.17s/it]\nDetecting bboxes: 100%|██████████| 1/1 [00:01<00:00, 1.17s/it]\nRecognizing Text: 0%| | 0/1 [00:00<?, ?it/s]/root/.pyenv/versions/3.11.8/lib/python3.11/site-packages/transformers/generation/utils.py:1518: UserWarning: You have modified the pretrained model configuration to control generation. This is a deprecated strategy to control generation and will be removed soon, in a future version. Please use and modify the model generation configuration (see https://huggingface.co/docs/transformers/generation_strategies#default-text-generation-configuration )\nwarnings.warn(\nRecognizing Text: 100%|██████████| 1/1 [00:04<00:00, 4.77s/it]\nRecognizing Text: 100%|██████████| 1/1 [00:04<00:00, 4.77s/it]\n<tempfile._TemporaryFileWrapper object at 0x7dbfd272e190> /tmp/tmpup9030rb.jpg", "metrics": { "predict_time": 7.989043, "total_time": 254.474831 }, "output": { "text": "OCR completed.", "image": "https://replicate.delivery/pbxt/pz8BUQIXW6KPJZGw98JlWzpGP59JFKQr6e2SYvshNTfaGUbSA/tmpup9030rb.jpg", "jsona": "https://replicate.delivery/pbxt/fJj6xOVoedjMXkVVWepkFWST2zPdeMK7q6Eqif4bMOqbzgaTC/ocr_text.txt" }, "started_at": "2024-02-28T19:54:59.358136Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/boxpbijbbkpujtmdcgxz5q5ofm", "cancel": "https://api.replicate.com/v1/predictions/boxpbijbbkpujtmdcgxz5q5ofm/cancel" }, "version": "6bbfc505be178d4c78abd48f0facb5f17dc8adbdc3f8401b708c78365d122318" }
Generated inDetecting bboxes: 0%| | 0/1 [00:00<?, ?it/s] Detecting bboxes: 100%|██████████| 1/1 [00:01<00:00, 1.17s/it] Detecting bboxes: 100%|██████████| 1/1 [00:01<00:00, 1.17s/it] Recognizing Text: 0%| | 0/1 [00:00<?, ?it/s]/root/.pyenv/versions/3.11.8/lib/python3.11/site-packages/transformers/generation/utils.py:1518: UserWarning: You have modified the pretrained model configuration to control generation. This is a deprecated strategy to control generation and will be removed soon, in a future version. Please use and modify the model generation configuration (see https://huggingface.co/docs/transformers/generation_strategies#default-text-generation-configuration ) warnings.warn( Recognizing Text: 100%|██████████| 1/1 [00:04<00:00, 4.77s/it] Recognizing Text: 100%|██████████| 1/1 [00:04<00:00, 4.77s/it] <tempfile._TemporaryFileWrapper object at 0x7dbfd272e190> /tmp/tmpup9030rb.jpg
Prediction
cudanexus/ocr-surya:7ab5bedeIDg7gp2ejbefzmbe6uuzbjhwprkqStatusSucceededSourceWebHardwareT4Total durationCreatedInput
{ "image": "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", { input: { image: "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", action: "Run Text Detection", page_number: 1, languages_input: "English", languages_choices: "English" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", input={ "image": "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", "input": { "image": "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2024-02-28T20:13:42.327164Z", "created_at": "2024-02-28T20:13:41.458962Z", "data_removed": false, "error": null, "id": "g7gp2ejbefzmbe6uuzbjhwprkq", "input": { "image": "https://replicate.delivery/pbxt/KU3ZDwmFqwo7tsfY5m8OsN0XDJLqk2lvgSKOT5s7HFZOqkNq/D5300-1.jpg", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" }, "logs": "Detecting bboxes: 0%| | 0/1 [00:00<?, ?it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.78it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.76it/s]\n<tempfile._TemporaryFileWrapper object at 0x7be1980f0ad0> /tmp/tmpul0fjgo_.jpg", "metrics": { "predict_time": 0.849333, "total_time": 0.868202 }, "output": { "image": "https://replicate.delivery/pbxt/fdUwKIme1ooQF0UBJC3WLOJUbaWZBHNIe9pcl70Dtwttvo2kA/tmpul0fjgo_.jpg", "Status": "Text detection completed." }, "started_at": "2024-02-28T20:13:41.477831Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/g7gp2ejbefzmbe6uuzbjhwprkq", "cancel": "https://api.replicate.com/v1/predictions/g7gp2ejbefzmbe6uuzbjhwprkq/cancel" }, "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce" }
Generated inDetecting bboxes: 0%| | 0/1 [00:00<?, ?it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.78it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.76it/s] <tempfile._TemporaryFileWrapper object at 0x7be1980f0ad0> /tmp/tmpul0fjgo_.jpg
Prediction
cudanexus/ocr-surya:7ab5bedeIDok3y7vjbfvrpj6hkmxu5axszyaStatusSucceededSourceWebHardwareT4Total durationCreatedInput
- image
- nougat.pdf
- action
- Run OCR
- page_number
- 1
- languages_input
- English
- languages_choices
- English
{ "image": "https://replicate.delivery/pbxt/KU59tfDSZgrg3LGXzGkZovCun813YKQJssN7FJOJNAWCgmxT/nougat.pdf", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", { input: { image: "https://replicate.delivery/pbxt/KU59tfDSZgrg3LGXzGkZovCun813YKQJssN7FJOJNAWCgmxT/nougat.pdf", action: "Run OCR", page_number: 1, languages_input: "English", languages_choices: "English" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", input={ "image": "https://replicate.delivery/pbxt/KU59tfDSZgrg3LGXzGkZovCun813YKQJssN7FJOJNAWCgmxT/nougat.pdf", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", "input": { "image": "https://replicate.delivery/pbxt/KU59tfDSZgrg3LGXzGkZovCun813YKQJssN7FJOJNAWCgmxT/nougat.pdf", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2024-02-28T20:14:20.770314Z", "created_at": "2024-02-28T20:14:11.866079Z", "data_removed": false, "error": null, "id": "ok3y7vjbfvrpj6hkmxu5axszya", "input": { "image": "https://replicate.delivery/pbxt/KU59tfDSZgrg3LGXzGkZovCun813YKQJssN7FJOJNAWCgmxT/nougat.pdf", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" }, "logs": "Detecting bboxes: 0%| | 0/1 [00:00<?, ?it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.77it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.75it/s]\nRecognizing Text: 0%| | 0/1 [00:00<?, ?it/s]\nRecognizing Text: 100%|██████████| 1/1 [00:04<00:00, 4.39s/it]\nRecognizing Text: 100%|██████████| 1/1 [00:04<00:00, 4.39s/it]\n<tempfile._TemporaryFileWrapper object at 0x7be197f934d0> /tmp/tmp0cx01tzw.jpg", "metrics": { "predict_time": 8.887425, "total_time": 8.904235 }, "output": { "image": "https://replicate.delivery/pbxt/NMYlJjzCWWLxHFcNyFY5QfGVYvYw9soAiNCRoEn85VKOMqNJA/tmp0cx01tzw.jpg", "Status": "OCR completed.", "text_file": "https://replicate.delivery/pbxt/3RwFWIkd1pLjC5HjMg0epf9V5xqOsGQqQlXRl2EerOj4wo2kA/ocr_text.txt" }, "started_at": "2024-02-28T20:14:11.882889Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/ok3y7vjbfvrpj6hkmxu5axszya", "cancel": "https://api.replicate.com/v1/predictions/ok3y7vjbfvrpj6hkmxu5axszya/cancel" }, "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce" }
Generated inDetecting bboxes: 0%| | 0/1 [00:00<?, ?it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.77it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.75it/s] Recognizing Text: 0%| | 0/1 [00:00<?, ?it/s] Recognizing Text: 100%|██████████| 1/1 [00:04<00:00, 4.39s/it] Recognizing Text: 100%|██████████| 1/1 [00:04<00:00, 4.39s/it] <tempfile._TemporaryFileWrapper object at 0x7be197f934d0> /tmp/tmp0cx01tzw.jpg
Prediction
cudanexus/ocr-surya:7ab5bedeIDwfoktkbbr67vhvmwqowd26c6aaStatusSucceededSourceWebHardwareT4Total durationCreatedInput
- image
- nougat.pdf
- action
- Run Text Detection
- page_number
- 1
- languages_input
- English
- languages_choices
- English
{ "image": "https://replicate.delivery/pbxt/KU5ADqkUo7mV1GE2U863GGqvvVaaefkwl7g5IFKciks2adrs/nougat.pdf", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", { input: { image: "https://replicate.delivery/pbxt/KU5ADqkUo7mV1GE2U863GGqvvVaaefkwl7g5IFKciks2adrs/nougat.pdf", action: "Run Text Detection", page_number: 1, languages_input: "English", languages_choices: "English" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", input={ "image": "https://replicate.delivery/pbxt/KU5ADqkUo7mV1GE2U863GGqvvVaaefkwl7g5IFKciks2adrs/nougat.pdf", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", "input": { "image": "https://replicate.delivery/pbxt/KU5ADqkUo7mV1GE2U863GGqvvVaaefkwl7g5IFKciks2adrs/nougat.pdf", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2024-02-28T20:14:33.890384Z", "created_at": "2024-02-28T20:14:32.656793Z", "data_removed": false, "error": null, "id": "wfoktkbbr67vhvmwqowd26c6aa", "input": { "image": "https://replicate.delivery/pbxt/KU5ADqkUo7mV1GE2U863GGqvvVaaefkwl7g5IFKciks2adrs/nougat.pdf", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" }, "logs": "Detecting bboxes: 0%| | 0/1 [00:00<?, ?it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.77it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.75it/s]\n<tempfile._TemporaryFileWrapper object at 0x7be247fb2710> /tmp/tmpwcx1a7di.jpg", "metrics": { "predict_time": 1.215204, "total_time": 1.233591 }, "output": { "image": "https://replicate.delivery/pbxt/7eclQz5Z30U3FCeBIQZhWb2UY5bggLKg0S5oU12mP7DpYUbSA/tmpwcx1a7di.jpg", "Status": "Text detection completed." }, "started_at": "2024-02-28T20:14:32.675180Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/wfoktkbbr67vhvmwqowd26c6aa", "cancel": "https://api.replicate.com/v1/predictions/wfoktkbbr67vhvmwqowd26c6aa/cancel" }, "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce" }
Generated inDetecting bboxes: 0%| | 0/1 [00:00<?, ?it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.77it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.75it/s] <tempfile._TemporaryFileWrapper object at 0x7be247fb2710> /tmp/tmpwcx1a7di.jpg
Prediction
cudanexus/ocr-surya:7ab5bedeIDk2ksa6zb2pyuljzq3m6hn3mwomStatusSucceededSourceWebHardwareT4Total durationCreatedInput
- action
- Run Text Detection
- page_number
- 1
- languages_input
- English
- languages_choices
- English
{ "image": "https://replicate.delivery/pbxt/KU5AVFEyFfwzpK2LQzUByW9TAWlFZfDseNCDwzr8ppBN2d5W/calculus00marciala_0136.pdf", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", { input: { image: "https://replicate.delivery/pbxt/KU5AVFEyFfwzpK2LQzUByW9TAWlFZfDseNCDwzr8ppBN2d5W/calculus00marciala_0136.pdf", action: "Run Text Detection", page_number: 1, languages_input: "English", languages_choices: "English" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", input={ "image": "https://replicate.delivery/pbxt/KU5AVFEyFfwzpK2LQzUByW9TAWlFZfDseNCDwzr8ppBN2d5W/calculus00marciala_0136.pdf", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", "input": { "image": "https://replicate.delivery/pbxt/KU5AVFEyFfwzpK2LQzUByW9TAWlFZfDseNCDwzr8ppBN2d5W/calculus00marciala_0136.pdf", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2024-02-28T20:14:50.299194Z", "created_at": "2024-02-28T20:14:49.338949Z", "data_removed": false, "error": null, "id": "k2ksa6zb2pyuljzq3m6hn3mwom", "input": { "image": "https://replicate.delivery/pbxt/KU5AVFEyFfwzpK2LQzUByW9TAWlFZfDseNCDwzr8ppBN2d5W/calculus00marciala_0136.pdf", "action": "Run Text Detection", "page_number": 1, "languages_input": "English", "languages_choices": "English" }, "logs": "Detecting bboxes: 0%| | 0/1 [00:00<?, ?it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.77it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.74it/s]\n<tempfile._TemporaryFileWrapper object at 0x7be1986ccc90> /tmp/tmpnkpp104h.jpg", "metrics": { "predict_time": 0.944825, "total_time": 0.960245 }, "output": { "image": "https://replicate.delivery/pbxt/mrXhGF1nf40Fc6JrlSpxwmB94eC1s9vqJ5yFmzMqkQ76YUbSA/tmpnkpp104h.jpg", "Status": "Text detection completed." }, "started_at": "2024-02-28T20:14:49.354369Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/k2ksa6zb2pyuljzq3m6hn3mwom", "cancel": "https://api.replicate.com/v1/predictions/k2ksa6zb2pyuljzq3m6hn3mwom/cancel" }, "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce" }
Generated inDetecting bboxes: 0%| | 0/1 [00:00<?, ?it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.77it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.74it/s] <tempfile._TemporaryFileWrapper object at 0x7be1986ccc90> /tmp/tmpnkpp104h.jpg
Prediction
cudanexus/ocr-surya:7ab5bedeIDsjhwlkjbqf2o72tr54d3lcusd4StatusSucceededSourceWebHardwareT4Total durationCreatedInput
- action
- Run OCR
- page_number
- 1
- languages_input
- English
- languages_choices
- English
{ "image": "https://replicate.delivery/pbxt/KU3d6Hvdihst8NTXv9PcgQhhA8zzaSJQsEU7XtebaB3j23q4/calculus00marciala_0136.pdf", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" }
Install Replicate’s Node.js client library:npm install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import and set up the client:import Replicate from "replicate"; const replicate = new Replicate({ auth: process.env.REPLICATE_API_TOKEN, });
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
const output = await replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", { input: { image: "https://replicate.delivery/pbxt/KU3d6Hvdihst8NTXv9PcgQhhA8zzaSJQsEU7XtebaB3j23q4/calculus00marciala_0136.pdf", action: "Run OCR", page_number: 1, languages_input: "English", languages_choices: "English" } } ); console.log(output);
To learn more, take a look at the guide on getting started with Node.js.
Install Replicate’s Python client library:pip install replicate
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Import the client:import replicate
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
output = replicate.run( "cudanexus/ocr-surya:7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", input={ "image": "https://replicate.delivery/pbxt/KU3d6Hvdihst8NTXv9PcgQhhA8zzaSJQsEU7XtebaB3j23q4/calculus00marciala_0136.pdf", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" } ) print(output)
To learn more, take a look at the guide on getting started with Python.
Set theREPLICATE_API_TOKEN
environment variable:export REPLICATE_API_TOKEN=<paste-your-token-here>
Find your API token in your account settings.
Run cudanexus/ocr-surya using Replicate’s API. Check out the model's schema for an overview of inputs and outputs.
curl -s -X POST \ -H "Authorization: Bearer $REPLICATE_API_TOKEN" \ -H "Content-Type: application/json" \ -H "Prefer: wait" \ -d $'{ "version": "7ab5bedee2cd1f0c82b2df6718d19bf0b473f738f9db062f122e47e1467f96ce", "input": { "image": "https://replicate.delivery/pbxt/KU3d6Hvdihst8NTXv9PcgQhhA8zzaSJQsEU7XtebaB3j23q4/calculus00marciala_0136.pdf", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" } }' \ https://api.replicate.com/v1/predictions
To learn more, take a look at Replicate’s HTTP API reference docs.
Output
{ "completed_at": "2024-02-28T18:34:11.999201Z", "created_at": "2024-02-28T18:34:08.157416Z", "data_removed": false, "error": null, "id": "sjhwlkjbqf2o72tr54d3lcusd4", "input": { "image": "https://replicate.delivery/pbxt/KU3d6Hvdihst8NTXv9PcgQhhA8zzaSJQsEU7XtebaB3j23q4/calculus00marciala_0136.pdf", "action": "Run OCR", "page_number": 1, "languages_input": "English", "languages_choices": "English" }, "logs": "Detecting bboxes: 0%| | 0/1 [00:00<?, ?it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.76it/s]\nDetecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.74it/s]\nRecognizing Text: 0%| | 0/1 [00:00<?, ?it/s]\nRecognizing Text: 100%|██████████| 1/1 [00:01<00:00, 1.73s/it]\nRecognizing Text: 100%|██████████| 1/1 [00:01<00:00, 1.73s/it]\n<tempfile._TemporaryFileWrapper object at 0x788a9234b290> /tmp/tmpar2uafk9.jpg", "metrics": { "predict_time": 3.82614, "total_time": 3.841785 }, "output": [ "https://replicate.delivery/pbxt/Ue5XJomNnY1wWqYZ03VVefG8blNqZi7cpfMJUTCFZbiNqLtJB/tmpar2uafk9.jpg", "OCR completed.", "https://replicate.delivery/pbxt/afKmPfw6J2ux4ELuRvxzVpKpk6q7YoQXLZB2DnSrWXUj6SbSA/ocr_text.txt" ], "started_at": "2024-02-28T18:34:08.173061Z", "status": "succeeded", "urls": { "get": "https://api.replicate.com/v1/predictions/sjhwlkjbqf2o72tr54d3lcusd4", "cancel": "https://api.replicate.com/v1/predictions/sjhwlkjbqf2o72tr54d3lcusd4/cancel" }, "version": "598f8fef138fda202918b8024d0113177e79f3d4590e83b00c5b3089fa011487" }
Generated inDetecting bboxes: 0%| | 0/1 [00:00<?, ?it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.76it/s] Detecting bboxes: 100%|██████████| 1/1 [00:00<00:00, 6.74it/s] Recognizing Text: 0%| | 0/1 [00:00<?, ?it/s] Recognizing Text: 100%|██████████| 1/1 [00:01<00:00, 1.73s/it] Recognizing Text: 100%|██████████| 1/1 [00:01<00:00, 1.73s/it] <tempfile._TemporaryFileWrapper object at 0x788a9234b290> /tmp/tmpar2uafk9.jpg
Want to make some of these yourself?
Run this model