Skip to content

Instantly share code, notes, and snippets.

@incubated-geek-cc
Created February 14, 2024 21:21
Show Gist options
  • Save incubated-geek-cc/c4c6ae9cd2aff4b6161a6137e829e348 to your computer and use it in GitHub Desktop.
Save incubated-geek-cc/c4c6ae9cd2aff4b6161a6137e829e348 to your computer and use it in GitHub Desktop.
HOCR output produced by Tess4J V4, a Tesseract OCR wrapper for Java.
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html>
<head>
<title></title>
<meta http-equiv="Content-Type" content="text/html;charset=utf-8" />
<meta name='ocr-system' content='tesseract'/>
</head>
<body>
<div class='ocr_page' id='page_1' title='image ""; bbox 0 0 2480 3507; ppageno 0'>
<div class='ocr_carea' id='block_1_1' title="bbox 301 311 2176 1586">
<p class='ocr_par' id='par_1_1' lang='eng' title="bbox 301 311 2176 1586">
<span class='ocr_line' id='line_1_1' title="bbox 304 311 954 357; baseline 0.002 -10; x_size 46; x_descenders 9; x_ascenders 9">
<span class='ocrx_word' id='word_1_1' title='bbox 304 311 536 348; x_wconf 96'>Placement</span>
<span class='ocrx_word' id='word_1_2' title='bbox 554 311 805 357; x_wconf 96'>Preparation</span>
<span class='ocrx_word' id='word_1_3' title='bbox 825 311 954 348; x_wconf 96'>Guide</span>
</span>
<span class='ocr_line' id='line_1_2' title="bbox 301 415 2176 462; baseline 0 -10; x_size 46; x_descenders 10; x_ascenders 9">
<span class='ocrx_word' id='word_1_4' title='bbox 301 416 393 452; x_wconf 95'>This</span>
<span class='ocrx_word' id='word_1_5' title='bbox 411 416 529 462; x_wconf 96'>guide</span>
<span class='ocrx_word' id='word_1_6' title='bbox 545 416 610 452; x_wconf 96'>will</span>
<span class='ocrx_word' id='word_1_7' title='bbox 629 425 750 452; x_wconf 96'>cover</span>
<span class='ocrx_word' id='word_1_8' title='bbox 765 416 991 462; x_wconf 96'>everything</span>
<span class='ocrx_word' id='word_1_9' title='bbox 1010 425 1086 462; x_wconf 96'>you</span>
<span class='ocrx_word' id='word_1_10' title='bbox 1106 416 1211 452; x_wconf 96'>need</span>
<span class='ocrx_word' id='word_1_11' title='bbox 1229 417 1268 452; x_wconf 96'>to</span>
<span class='ocrx_word' id='word_1_12' title='bbox 1287 425 1454 462; x_wconf 96'>prepare</span>
<span class='ocrx_word' id='word_1_13' title='bbox 1470 415 1529 452; x_wconf 96'>for</span>
<span class='ocrx_word' id='word_1_14' title='bbox 1543 425 1640 462; x_wconf 96'>your</span>
<span class='ocrx_word' id='word_1_15' title='bbox 1655 416 1792 452; x_wconf 96'>dream</span>
<span class='ocrx_word' id='word_1_16' title='bbox 1806 416 1885 462; x_wconf 96'>job.</span>
<span class='ocrx_word' id='word_1_17' title='bbox 1904 416 1987 452; x_wconf 96'>The</span>
<span class='ocrx_word' id='word_1_18' title='bbox 2006 416 2119 452; x_wconf 92'>basic</span>
<span class='ocrx_word' id='word_1_19' title='bbox 2137 425 2176 452; x_wconf 91'>s-</span>
</span>
<span class='ocr_line' id='line_1_3' title="bbox 301 473 1074 510; baseline 0 0; x_size 47.212814; x_descenders 10.212815; x_ascenders 10">
<span class='ocrx_word' id='word_1_20' title='bbox 301 474 468 510; x_wconf 92'>tructure</span>
<span class='ocrx_word' id='word_1_21' title='bbox 485 473 527 510; x_wconf 96'>of</span>
<span class='ocrx_word' id='word_1_22' title='bbox 540 473 615 510; x_wconf 96'>this</span>
<span class='ocrx_word' id='word_1_23' title='bbox 633 483 779 510; x_wconf 95'>course</span>
<span class='ocrx_word' id='word_1_24' title='bbox 798 473 829 510; x_wconf 95'>is</span>
<span class='ocrx_word' id='word_1_25' title='bbox 846 483 896 510; x_wconf 96'>as</span>
<span class='ocrx_word' id='word_1_26' title='bbox 912 473 1074 510; x_wconf 96'>follows:</span>
</span>
<span class='ocr_line' id='line_1_4' title="bbox 416 531 787 577; baseline 0 -10; x_size 46; x_descenders 10; x_ascenders 9">
<span class='ocrx_word' id='word_1_27' title='bbox 416 531 597 567; x_wconf 96'>Resume</span>
<span class='ocrx_word' id='word_1_28' title='bbox 616 531 787 577; x_wconf 95'>Building</span>
</span>
<span class='ocr_line' id='line_1_5' title="bbox 413 588 899 634; baseline 0.002 -10; x_size 47; x_descenders 10; x_ascenders 10">
<span class='ocrx_word' id='word_1_29' title='bbox 413 588 654 634; x_wconf 95'>Application</span>
<span class='ocrx_word' id='word_1_30' title='bbox 675 588 899 625; x_wconf 96'>Procedure</span>
</span>
<span class='ocr_line' id='line_1_6' title="bbox 416 646 1164 682; baseline 0 0; x_size 46.212814; x_descenders 10.212815; x_ascenders 9">
<span class='ocrx_word' id='word_1_31' title='bbox 416 646 564 682; x_wconf 96'>Details</span>
<span class='ocrx_word' id='word_1_32' title='bbox 581 646 704 682; x_wconf 96'>about</span>
<span class='ocrx_word' id='word_1_33' title='bbox 719 646 786 682; x_wconf 96'>the</span>
<span class='ocrx_word' id='word_1_34' title='bbox 805 646 998 682; x_wconf 96'>interview</span>
<span class='ocrx_word' id='word_1_35' title='bbox 1015 646 1164 682; x_wconf 96'>rounds</span>
</span>
<span class='ocr_line' id='line_1_7' title="bbox 417 703 900 749; baseline 0.002 -10; x_size 46; x_descenders 9; x_ascenders 10">
<span class='ocrx_word' id='word_1_36' title='bbox 417 703 612 740; x_wconf 96'>Interview</span>
<span class='ocrx_word' id='word_1_37' title='bbox 630 703 900 749; x_wconf 95'>Experiences</span>
</span>
<span class='ocr_line' id='line_1_8' title="bbox 416 761 756 797; baseline 0 0; x_size 46.212814; x_descenders 10.212815; x_ascenders 9">
<span class='ocrx_word' id='word_1_38' title='bbox 416 761 591 797; x_wconf 96'>Practice</span>
<span class='ocrx_word' id='word_1_39' title='bbox 608 761 756 797; x_wconf 96'>Tracks</span>
</span>
<span class='ocr_line' id='line_1_9' title="bbox 416 818 667 855; baseline 0.004 -1; x_size 47.212814; x_descenders 10.212815; x_ascenders 10">
<span class='ocrx_word' id='word_1_40' title='bbox 416 818 532 855; x_wconf 96'>Mock</span>
<span class='ocrx_word' id='word_1_41' title='bbox 547 818 667 855; x_wconf 95'>Tests</span>
</span>
<span class='ocr_line' id='line_1_10' title="bbox 417 875 1033 922; baseline 0 -10; x_size 46; x_descenders 10; x_ascenders 9">
<span class='ocrx_word' id='word_1_42' title='bbox 417 875 539 914; x_wconf 95'>FAQs</span>
<span class='ocrx_word' id='word_1_43' title='bbox 558 876 762 922; x_wconf 95'>regarding</span>
<span class='ocrx_word' id='word_1_44' title='bbox 783 876 1033 922; x_wconf 96'>placements</span>
</span>
<span class='ocr_line' id='line_1_11' title="bbox 413 933 883 970; baseline 0.002 -1; x_size 47.212814; x_descenders 10.212815; x_ascenders 10">
<span class='ocrx_word' id='word_1_45' title='bbox 413 933 629 970; x_wconf 96'>Additional</span>
<span class='ocrx_word' id='word_1_46' title='bbox 650 933 883 970; x_wconf 96'>Resources</span>
</span>
<span class='ocr_line' id='line_1_12' title="bbox 304 990 2149 1037; baseline 0 -10; x_size 46; x_descenders 10; x_ascenders 9">
<span class='ocrx_word' id='word_1_47' title='bbox 304 991 536 1027; x_wconf 96'>Placement</span>
<span class='ocrx_word' id='word_1_48' title='bbox 554 991 800 1037; x_wconf 96'>preparation</span>
<span class='ocrx_word' id='word_1_49' title='bbox 818 991 944 1037; x_wconf 96'>solely</span>
<span class='ocrx_word' id='word_1_50' title='bbox 960 991 1149 1037; x_wconf 96'>depends</span>
<span class='ocrx_word' id='word_1_51' title='bbox 1166 1000 1216 1027; x_wconf 96'>on</span>
<span class='ocrx_word' id='word_1_52' title='bbox 1235 991 1301 1027; x_wconf 96'>the</span>
<span class='ocrx_word' id='word_1_53' title='bbox 1319 1000 1519 1037; x_wconf 96'>company</span>
<span class='ocrx_word' id='word_1_54' title='bbox 1534 990 1593 1027; x_wconf 96'>for</span>
<span class='ocrx_word' id='word_1_55' title='bbox 1606 991 1730 1027; x_wconf 95'>which</span>
<span class='ocrx_word' id='word_1_56' title='bbox 1748 1000 1824 1037; x_wconf 95'>you</span>
<span class='ocrx_word' id='word_1_57' title='bbox 1843 1000 1912 1027; x_wconf 96'>are</span>
<span class='ocrx_word' id='word_1_58' title='bbox 1931 991 2149 1037; x_wconf 96'>preparing.</span>
</span>
<span class='ocr_line' id='line_1_13' title="bbox 301 1048 1923 1095; baseline -0.001 -10; x_size 48; x_descenders 11; x_ascenders 11">
<span class='ocrx_word' id='word_1_59' title='bbox 301 1048 429 1085; x_wconf 96'>There</span>
<span class='ocrx_word' id='word_1_60' title='bbox 447 1058 515 1085; x_wconf 96'>are</span>
<span class='ocrx_word' id='word_1_61' title='bbox 534 1048 722 1095; x_wconf 96'>basically</span>
<span class='ocrx_word' id='word_1_62' title='bbox 737 1048 848 1085; x_wconf 96'>three</span>
<span class='ocrx_word' id='word_1_63' title='bbox 866 1048 1044 1085; x_wconf 96'>different</span>
<span class='ocrx_word' id='word_1_64' title='bbox 1060 1048 1287 1095; x_wconf 96'>categories</span>
<span class='ocrx_word' id='word_1_65' title='bbox 1306 1048 1381 1085; x_wconf 96'>into</span>
<span class='ocrx_word' id='word_1_66' title='bbox 1397 1048 1521 1085; x_wconf 96'>which</span>
<span class='ocrx_word' id='word_1_67' title='bbox 1539 1058 1600 1085; x_wconf 96'>we</span>
<span class='ocrx_word' id='word_1_68' title='bbox 1618 1058 1693 1085; x_wconf 96'>can</span>
<span class='ocrx_word' id='word_1_69' title='bbox 1712 1048 1839 1085; x_wconf 96'>divide</span>
<span class='ocrx_word' id='word_1_70' title='bbox 1856 1048 1923 1085; x_wconf 96'>the</span>
</span>
<span class='ocr_line' id='line_1_14' title="bbox 302 1105 2085 1152; baseline 0 -10; x_size 46; x_descenders 10; x_ascenders 9">
<span class='ocrx_word' id='word_1_71' title='bbox 302 1106 540 1152; x_wconf 96'>companies</span>
<span class='ocrx_word' id='word_1_72' title='bbox 557 1106 705 1152; x_wconf 96'>visiting</span>
<span class='ocrx_word' id='word_1_73' title='bbox 724 1115 949 1152; x_wconf 96'>campuses</span>
<span class='ocrx_word' id='word_1_74' title='bbox 965 1105 1024 1142; x_wconf 96'>for</span>
<span class='ocrx_word' id='word_1_75' title='bbox 1040 1106 1291 1152; x_wconf 95'>placements</span>
<span class='ocrx_word' id='word_1_76' title='bbox 1310 1106 1439 1142; x_wconf 96'>based</span>
<span class='ocrx_word' id='word_1_77' title='bbox 1458 1115 1509 1142; x_wconf 96'>on</span>
<span class='ocrx_word' id='word_1_78' title='bbox 1527 1106 1624 1142; x_wconf 96'>their</span>
<span class='ocrx_word' id='word_1_79' title='bbox 1640 1106 1886 1142; x_wconf 96'>recruitment</span>
<span class='ocrx_word' id='word_1_80' title='bbox 1904 1115 2085 1152; x_wconf 96'>process.</span>
</span>
<span class='ocr_line' id='line_1_15' title="bbox 343 1194 772 1231; baseline 0.002 -1; x_size 47.212814; x_descenders 10.212815; x_ascenders 10">
<span class='ocrx_word' id='word_1_81' title='bbox 343 1194 375 1230; x_wconf 96'>1.</span>
<span class='ocrx_word' id='word_1_82' title='bbox 416 1194 530 1231; x_wconf 95'>Mass</span>
<span class='ocrx_word' id='word_1_83' title='bbox 550 1194 772 1231; x_wconf 95'>Recruiters</span>
</span>
<span class='ocr_line' id='line_1_16' title="bbox 339 1251 680 1288; baseline 0 0; x_size 47.212814; x_descenders 10.212815; x_ascenders 10">
<span class='ocrx_word' id='word_1_84' title='bbox 339 1252 375 1288; x_wconf 96'>2.</span>
<span class='ocrx_word' id='word_1_85' title='bbox 414 1252 520 1288; x_wconf 96'>Tech</span>
<span class='ocrx_word' id='word_1_86' title='bbox 540 1251 680 1288; x_wconf 96'>Giants</span>
</span>
<span class='ocr_line' id='line_1_17' title="bbox 340 1309 806 1355; baseline 0 -9; x_size 46; x_descenders 9; x_ascenders 10">
<span class='ocrx_word' id='word_1_87' title='bbox 340 1309 375 1346; x_wconf 95'>3.</span>
<span class='ocrx_word' id='word_1_88' title='bbox 415 1309 561 1346; x_wconf 78'>Others</span>
<span class='ocrx_word' id='word_1_89' title='bbox 565 1305 584 1359; x_wconf 81'>/</span>
<span class='ocrx_word' id='word_1_90' title='bbox 607 1309 806 1355; x_wconf 95'>Start-ups</span>
</span>
<span class='ocr_line' id='line_1_18' title="bbox 303 1424 2174 1471; baseline -0.001 -10; x_size 47; x_descenders 10; x_ascenders 10">
<span class='ocrx_word' id='word_1_91' title='bbox 303 1424 551 1470; x_wconf 95'>Companies</span>
<span class='ocrx_word' id='word_1_92' title='bbox 570 1424 780 1471; x_wconf 96'>belonging</span>
<span class='ocrx_word' id='word_1_93' title='bbox 798 1426 837 1461; x_wconf 96'>to</span>
<span class='ocrx_word' id='word_1_94' title='bbox 854 1424 920 1461; x_wconf 96'>the</span>
<span class='ocrx_word' id='word_1_95' title='bbox 938 1424 1070 1461; x_wconf 96'>above</span>
<span class='ocrx_word' id='word_1_96' title='bbox 1088 1424 1315 1471; x_wconf 96'>categories</span>
<span class='ocrx_word' id='word_1_97' title='bbox 1334 1424 1437 1461; x_wconf 96'>have</span>
<span class='ocrx_word' id='word_1_98' title='bbox 1454 1424 1551 1461; x_wconf 96'>their</span>
<span class='ocrx_word' id='word_1_99' title='bbox 1565 1434 1652 1461; x_wconf 96'>own</span>
<span class='ocrx_word' id='word_1_100' title='bbox 1672 1424 1919 1461; x_wconf 96'>recruitment</span>
<span class='ocrx_word' id='word_1_101' title='bbox 1937 1434 2118 1470; x_wconf 96'>process.</span>
<span class='ocrx_word' id='word_1_102' title='bbox 2141 1424 2174 1460; x_wconf 96'>In</span>
</span>
<span class='ocr_line' id='line_1_19' title="bbox 301 1481 2098 1528; baseline 0 -10; x_size 46; x_descenders 10; x_ascenders 9">
<span class='ocrx_word' id='word_1_103' title='bbox 301 1482 376 1518; x_wconf 96'>this</span>
<span class='ocrx_word' id='word_1_104' title='bbox 394 1491 551 1525; x_wconf 95'>course,</span>
<span class='ocrx_word' id='word_1_105' title='bbox 570 1491 631 1518; x_wconf 95'>we</span>
<span class='ocrx_word' id='word_1_106' title='bbox 648 1482 713 1518; x_wconf 96'>will</span>
<span class='ocrx_word' id='word_1_107' title='bbox 731 1483 786 1528; x_wconf 96'>try</span>
<span class='ocrx_word' id='word_1_108' title='bbox 800 1483 840 1518; x_wconf 96'>to</span>
<span class='ocrx_word' id='word_1_109' title='bbox 857 1491 978 1518; x_wconf 95'>cover</span>
<span class='ocrx_word' id='word_1_110' title='bbox 993 1491 1113 1528; x_wconf 95'>every</span>
<span class='ocrx_word' id='word_1_111' title='bbox 1130 1482 1309 1528; x_wconf 95'>possible</span>
<span class='ocrx_word' id='word_1_112' title='bbox 1326 1482 1441 1518; x_wconf 95'>detail</span>
<span class='ocrx_word' id='word_1_113' title='bbox 1461 1482 1638 1528; x_wconf 96'>required</span>
<span class='ocrx_word' id='word_1_114' title='bbox 1656 1483 1695 1518; x_wconf 96'>to</span>
<span class='ocrx_word' id='word_1_115' title='bbox 1715 1482 1828 1518; x_wconf 96'>know</span>
<span class='ocrx_word' id='word_1_116' title='bbox 1843 1481 1902 1518; x_wconf 96'>for</span>
<span class='ocrx_word' id='word_1_117' title='bbox 1917 1482 2098 1528; x_wconf 96'>cracking</span>
</span>
<span class='ocr_line' id='line_1_20' title="bbox 303 1539 1755 1586; baseline 0.001 -11; x_size 47; x_descenders 10; x_ascenders 10">
<span class='ocrx_word' id='word_1_118' title='bbox 303 1539 497 1576; x_wconf 96'>interview</span>
<span class='ocrx_word' id='word_1_119' title='bbox 512 1539 554 1576; x_wconf 95'>of</span>
<span class='ocrx_word' id='word_1_120' title='bbox 568 1539 635 1576; x_wconf 95'>the</span>
<span class='ocrx_word' id='word_1_121' title='bbox 652 1539 890 1585; x_wconf 95'>companies</span>
<span class='ocrx_word' id='word_1_122' title='bbox 906 1539 1033 1586; x_wconf 96'>falling</span>
<span class='ocrx_word' id='word_1_123' title='bbox 1054 1539 1086 1575; x_wconf 96'>in</span>
<span class='ocrx_word' id='word_1_124' title='bbox 1105 1539 1208 1576; x_wconf 96'>each</span>
<span class='ocrx_word' id='word_1_125' title='bbox 1227 1539 1268 1576; x_wconf 96'>of</span>
<span class='ocrx_word' id='word_1_126' title='bbox 1282 1539 1349 1576; x_wconf 96'>the</span>
<span class='ocrx_word' id='word_1_127' title='bbox 1366 1539 1498 1576; x_wconf 96'>above</span>
<span class='ocrx_word' id='word_1_128' title='bbox 1516 1539 1755 1586; x_wconf 96'>categories.</span>
</span>
</p>
</div>
</div>
</body>
</html>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment