|
1 |
| -## SimpleOCR |
| 1 | +# SimpleOCR |
2 | 2 |
|
3 |
| -SimpleOCR is a Simba plugin for reading text in Old School RuneScape. |
| 3 | +SimpleOCR is a Simba plugin for reading text in Old School RuneScape originally developed by [@slackydev](https://github.com/slackydev/SimpleOCR). |
4 | 4 |
|
5 |
| -Results: |
| 5 | +The algorithm is very much designed for the games "blocky" text where no anti aliasing is applied. Every pixel of the glyph must match for for a character to be recognized. |
6 | 6 |
|
7 |
| - |
8 |
| -``` |
9 |
| -Recognized: Talk-to Grand Exchange Clerk / 262 more options |
10 |
| -```` |
| 7 | +The actual character recognition is quite similar to calling Simba's `FindBitmap` for every character in the fontset. |
11 | 8 |
|
12 |
| - |
13 |
| -``` |
14 |
| -Recognized: Climb-up Ladder / 2 more option |
| 9 | +---- |
| 10 | + |
| 11 | +## Exported Methods |
| 12 | + |
| 13 | +```pascal |
| 14 | +function TSimpleOCR.Recognize(Area: TBox; Filter: TOCRFilter; Font: TFontSet): String; |
| 15 | +function TSimpleOCR.RecognizeStatic(Area: TBox; Filter: TOCRFilter; Font: TFontSet; MaxWalk: Integer = 20): String; |
| 16 | +function TSimpleOCR.RecognizeLines(Area: TBox; Filter: TOCRFilter; Font: TFontSet; out TextBounds: TBoxArray): TStringArray; overload; |
| 17 | +function TSimpleOCR.RecognizeLines(Area: TBox; Filter: TOCRFilter; Font: TFontSet): TStringArray; overload; |
| 18 | +function TSimpleOCR.RecognizeUpText(Area: TBox; Filter: TOCRFilter; Font: TFontSet; MaxWalk: Integer = 20): String; |
| 19 | +function TSimpleOCR.RecognizeNumber(Area: TBox; Filter: TOCRFilter; Font: TFontSet): Int64; |
| 20 | +
|
| 21 | +function TSimpleOCR.LocateText(Area: TBox; Font: TFontSet; Filter: TOCRFilter; out Bounds: TBox): Single; overload; |
| 22 | +function TSimpleOCR.LocateText(Area: TBox; Font: TFontSet; out Bounds: TBox): Single; overload; |
15 | 23 | ```
|
16 | 24 |
|
17 |
| ------ |
| 25 | +---- |
18 | 26 |
|
19 |
| -## Exports |
| 27 | +## Filters |
20 | 28 |
|
21 |
| -```pascal |
22 |
| -procedure TFontSet.Load(constref Font: String; constref Space: Int32 = 4); |
23 |
| -function TSimpleOCR.DrawText(constref Text: String; constref FontSet: TFontSet): T2DIntegerArray; |
24 |
| -function TSimpleOCR.Recognize(constref B: TBox; constref Filter: TCompareRules; constref Font: TFontSet; constref IsStatic: Boolean = False; constref MaxWalk: Int32 = 40): String; overload; |
25 |
| -function TSimpleOCR.RecognizeNumber(constref B: TBox; constref Filter: TCompareRules; constref Font: TFontSet; constref IsStatic: Boolean = False; constref MaxWalk: Int32 = 40): Int64; |
26 |
| -function TSimpleOCR.LocateText(constref B: TBox; constref Text: String; constref Font: TFontSet; constref Filter: TCompareRules; out Bounds: TBox): Single; overload; |
27 |
| -function TSimpleOCR.LocateText(constref B: TBox; constref Text: String; constref Font: TFontSet; constref Filter: TCompareRules; constref MinMatch: Single = 1): Boolean; overload; |
28 |
| -``` |
| 29 | +- ### Color filter |
29 | 30 |
|
30 |
| ------ |
| 31 | + Basic color finding. |
| 32 | + ```pascal |
| 33 | + function TOCRColorFilter.Create(Colors, Tolerances: TIntegerArray): TOCRColorFilter; static; |
| 34 | + function TOCRColorFilter.Create(Colors: TIntegerArray): TOCRColorFilter; static; overload; |
| 35 | + ``` |
| 36 | + Example: |
| 37 | + ```pascal |
| 38 | + TOCRColorFilter.Create([255]); // Find color red |
| 39 | + ``` |
| 40 | +  |
31 | 41 |
|
32 |
| -`Filter` parameter: |
| 42 | + --- |
33 | 43 |
|
34 |
| -```pascal |
35 |
| -TCompareRules = packed record |
36 |
| - Color, Tolerance: Int32; // Color and tolerance. Color can be -1 to match any color. |
37 |
| - UseShadow: Boolean; // If the fontset has a shadow, it can be used to improve recognition. |
38 |
| - ShadowMaxValue: Int32; // Max brightness of shadow, Shadows are black so this is often low. |
39 |
| - Threshold: Boolean; // Threshold the image? If so all above fields are ignored. |
40 |
| - ThresholdAmount: Int32; // Threshold amount. |
41 |
| - ThresholdInvert: Boolean; // Threshold invert? |
42 |
| - UseShadowForColor: Boolean; // Find and offset shadows by (-1,-1) and use most common color to recognize text with. |
43 |
| - MinCharacterMatch: Int32; // Minimum hits required to match a character. Useful to remove smaller characters (like dots) that are often misread. |
44 |
| -end; |
45 |
| -``` |
| 44 | +- ### Invert Color Filter |
| 45 | + |
| 46 | + Basic color finding but inverted. |
| 47 | + ```pascal |
| 48 | + function TOCRInvertColorFilter.Create(Colors, Tolerances: TIntegerArray): TOCRInvertColorFilter; static; overload; |
| 49 | + function TOCRInvertColorFilter.Create(Colors: TIntegerArray): TOCRInvertColorFilter; static; overload; |
| 50 | + ``` |
| 51 | + Example: |
| 52 | + ```pascal |
| 53 | + TOCRInvertColorFilter.Create([3358536, 0], [3, 10]); // Everything but brown and black (text shadow) |
| 54 | + ``` |
| 55 | +  |
| 56 | + |
| 57 | + --- |
46 | 58 |
|
47 |
| ------ |
| 59 | +- ### Threshold Filter |
48 | 60 |
|
49 |
| -`IsStatic` parameter: |
| 61 | + If a pixel brightness value is greater than a threshold amount, it is assigned white, else it is assigned black. |
| 62 | + ```pascal |
| 63 | + function TOCRThresholdFilter.Create(Amount: Integer; Invert: Boolean = False): TOCRThresholdFilter; static; |
| 64 | + ``` |
| 65 | + Example: |
| 66 | + ```pascal |
| 67 | + TOCRThresholdFilter.Create(10); // Orange and red are brighter than the brown background |
| 68 | + ``` |
| 69 | +  |
50 | 70 |
|
51 |
| -If the starting position (X1,Y1) of the text never changes the text is static which greatly increases accuracy and speed. If this is the case you must pass the *pixel perfect* bounds of the text you want to read. |
| 71 | + --- |
52 | 72 |
|
53 |
| ------ |
| 73 | +- ### Shadow Filter |
54 | 74 |
|
55 |
| -`MaxWalk` parameter: |
| 75 | + First shadows are found using average R,G,B value. `Shadow = (R+G+B div 3) < MaxShadowValue` |
56 | 76 |
|
57 |
| -How far the OCR looks on the X axis before giving up. By default this is `40` so if no characaters are matched in 40 pixels the function finishes. |
| 77 | + Next shadow pixels are offset by -1,-1. The most common color from the offset pixels is used for the search color. |
| 78 | + ```pascal |
| 79 | + function TOCRShadowFilter.Create(MaxShadowValue: Integer = 25; Tolerance: Integer = 5): TOCRShadowFilter; static; |
| 80 | + ``` |
| 81 | + Example: |
| 82 | + ```pascal |
| 83 | + TOCRShadowFilter.Create(); |
| 84 | + ``` |
| 85 | +  |
| 86 | + |
| 87 | +---- |
58 | 88 |
|
59 |
| ------ |
| 89 | +## Locate Text |
60 | 90 |
|
61 |
| -`LocateText` function: |
| 91 | +The `LocateText` function does **not** OCR! An image of the desired text is generated and searched for. Again quite similar to Simba's `FindBitmap`. |
| 92 | +- A filter is not required if each pixel of the text is the exact same color. Although this is quite a lot slower. |
| 93 | +- Will not work if the spacing between characters is dynamic. |
62 | 94 |
|
63 |
| -LocateText does not OCR! In simple terms a bitmap of the text is created (Using DrawText) and searched for. Color can be `-1` though each pixel of the text must be the exact same color. |
|
0 commit comments