New Beta 3.2.98

jirou · Apr 8, 2025

Hello, since the new update to the OCR, the Screen OCR seems to be giving me this message no matter what. Before I could just drag the little window over whatever I wanted the OCR to parse. Note how actual characters are blocked out, but clickable as with 进士 in my screenshot.

Am I misunderstanding something about the new update or is this a bug of some kind? I'm on Google Pixel 7. Hope I'm not repeating anything, as far as I could tell the already discussed issues were different. Screen Reader seems to work as always, if not better.

mikelove · Apr 8, 2025

Thanks. Does the message go away if you disable the option to hide recognized text? (We actually were planning to not offer that option with the new engine, since unfortunately while it’s generally more accurate it’s less precise about reporting character positions)

jirou · Apr 9, 2025

mikelove said:
Thanks. Does the message go away if you disable the option to hide recognized text? (We actually were planning to not offer that option with the new engine, since unfortunately while it’s generally more accurate it’s less precise about reporting character positions)

Yup, that fixed it, thanks!
Another thing I forgot to mention: when I enable the legacy OCR algorithm option, the green frame/window pops up for a second, only to then crash. As of right now the legacy option seems unusable for me.

I must admit that I thought the legacy version of the screen reader to be more suitable for my needs (but I recognize that I haven't had much time to fully test out the new version of course, so not a definitive statement!). Precision in reporting character positions does seem to get lost as you've mentioned, before I could even parse a whole manga page and it would recognize everything with the Google OCR most times. Something that doesn't work on my iPad or on the new algorithm it seems.

Thank you for your continued hard work on this great app!

mikelove · Apr 9, 2025

jirou said:
Another thing I forgot to mention: when I enable the legacy OCR algorithm option, the green frame/window pops up for a second, only to then crash. As of right now the legacy option seems unusable for me.

Thanks, just uploaded a fix for this.

jirou said:
Precision in reporting character positions does seem to get lost as you've mentioned

Yeah, not much we can do about that one unless we want to never update our OCR algorithm, and for anything other than clean straight on-screen text like with Screen OCR the need for a change was pretty clear. We are looking at some other options for Screen OCR specifically, though.

jirou said:
Something that doesn't work on my iPad or on the new algorithm it seems.

Sorry, could you go into a little more detail about how the behavior on the iPad differs from that in old Android system? iPad uses the exact same algorithm, so if it's not working on there then that suggests something else wrong besides the algorithm change.

Dkhr77 · Apr 9, 2025

Possibly found a bug that hasn't been in version 97. Sometimes the previous selected main page (history, continue test, organize) will stay blank after jumping to another word in the opened entry, mainly after searching. I often encountered that while doing searches in dictionary during a test.

I can reproduce it with the history screen by exactly following these steps: fresh open pleco, select history, select dictionary, search for any entry you like, open it. Then in the headword or description, click on another 汉字 (one without a direct link) to open it's description popup, then click the search glass icon to search for that entry, finally open that entry from the search result list. Then use the back icon and open the history screen again, it just shows "Pleco" as headline.

First found that in build April 2nd, currently recorded in the 8th of April version.

mikelove · Apr 9, 2025

Thanks - reproduced and fixed for the next build.

mikelove · Apr 9, 2025

The next beta - now in review - adds a new option in Settings / OCR for *another* new OCR algorithm, this one an open-source algorithm from Baidu, just for still + screen OCR. They have a whole bunch of them and we picked the biggest / slowest one to see if it provides any meaningful accuracy improvements over Google's algorithm in exchange for that extra size and performance cost. (our testing suggests it might)

jirou · Apr 10, 2025

mikelove said:
Sorry, could you go into a little more detail about how the behavior on the iPad differs from that in old Android system? iPad uses the exact same algorithm, so if it's not working on there then that suggests something else wrong besides the algorithm change.

What I meant was that, on my Google Pixel, the Still OCR (as well as Screen OCR) had previously allowed me to parse the whole screen very accurately even if there were images involved, like in a manga for example.

On the iPad on the other hand the OCR works beautifully for regular PDFs that are just text, but barely recognizes anything when scanning a whole manga page at once. I know there are options between using Apple's OCR, Pleco's OCR, and a combined method but in the case of a manga page it doesn't seem to matter and none of them really work. Tried on Legacy Pleco and the Beta.

I know this use case probably isn't the priority lol, it was just strange to me given Google's OCR seemed to work flawlessly vs. Apple's. Though I have to add the iPad I use is old - a 2017 iPad Pro. It's on iPad OS 17.7.5, perhaps the hardware is just to old to make use of the latest versions of Apple's OCR?

mikelove said:
Yeah, not much we can do about that one unless we want to never update our OCR algorithm, and for anything other than clean straight on-screen text like with Screen OCR the need for a change was pretty clear. We are looking at some other options for Screen OCR specifically, though.

Completely understandable!

mikelove said:
The next beta - now in review - adds a new option in Settings / OCR for *another* new OCR algorithm, this one an open-source algorithm from Baidu, just for still + screen OCR. They have a whole bunch of them and we picked the biggest / slowest one to see if it provides any meaningful accuracy improvements over Google's algorithm in exchange for that extra size and performance cost. (our testing suggests it might)

Excited to try this one out as well.

Thanks again for your continued development, fast responses and open communication!

mikelove · Apr 10, 2025

jirou said:
I know this use case probably isn't the priority lol, it was just strange to me given Google's OCR seemed to work flawlessly vs. Apple's. Though I have to add the iPad I use is old - a 2017 iPad Pro. It's on iPad OS 17.7.5, perhaps the hardware is just to old to make use of the latest versions of Apple's OCR?

What's confusing is that the old algorithm is literally identical - same exact ML model - between iOS and Android. It's ancient - originally developed for scanning business cards on Windows Mobile devices - so there should be no performance issue on a 2017 iPad, and no difference in general between how it performs on Android and iOS.

Perhaps it has something to do with the way the PDF is being rendered, resolution or some such - I don't suppose you have an example of a file that works well on Android and badly on iPad that you could email or PM me?

Both of the new algorithms are also available on iOS, so while we're currently feeling like Apple's algorithm (which we're likewise making the primary / default option in the next 4.0 beta, even for live OCR) is good enough - and has the advantage of being built into the OS so it's zero maintenance and doesn't take up any space - if people find that one of the Android ones is better for some use cases it wouldn't be hard to support it too.

SunAtEight · May 12, 2025

Great to see the new features listed and look forward to trying to them out! Just came across a bug using the handwriting recognition to look up the character 䞍. The new handwriting algorithm absolutely refused to recognize it (as did handwriting input in Gboard—is there a connection?), but after some frustration and using an online character database to confirm the character and confirming it was in Pleco, I then turned on "Legacy HWR algorithm" and Pleco immediately recognized it. On a sidenote, I also couldn't find it through radical input (before I moved onto the other steps).

mikelove · May 12, 2025

Thanks.

䞍 is in the rare character block CJK Unified Extension A, so I'm afraid this is actually the expected behavior with the new HWR - support for ExtA was the main feature of our 'enhanced handwriting' add-on, and after virtually nobody bought that we concluded it wasn't something people cared about enough to keep offering it, when doing so a) entailed an expensive fixed annual license fee which those sales weren't remotely covering (and which we could not plausibly imagine was convincing many people to buy bundles) and b) introduced an extra 6,582 potential false positives, since people who got that add-on in bundles rarely knew about the option to disable recognition of rare characters.

As a practical matter, the main purpose of offering handwriting input in Pleco nowadays - despite the availability of free Chinese handwriting keyboards built into iOS and Android - is "make it really easy for people to input characters with handwriting without having to figure out a bunch of settings screens," so that's kind of what we have to optimize around; offer something similar to those keyboards in accuracy, but which you can get at on any device with a single tap without having to figure out how to turn on Chinese input support on your phone. (back in 2009 our handwriting was also a lot more accurate than theirs, but it's hard to make that argument now)

But we have no plans to drop the 'legacy algorithm' option for people who want that extra character set support - we never want to take away anything people already had, we have dictionaries in our catalog we've stopped selling many years ago but continue supporting (and even updating to new data formats). We just didn't see enough benefit in it to continue selling handwriting with ExtA support to new customers.

Going forward, we'd eventually like to offer our own HWR models with even greater character set support, and if there's a lot of demand then we're happy to look into licensing a model with expanded support where the price is more reasonable or where we can pay per-copy royalties, but the nearer-term alternative is that 4.0 is shipping with a multi-component search feature as an alternative to radical input and that has much greater character set support (we've only deployed extension A in the 4.0 betas so far, but the data we have goes all the way through extension F - almost 89,000 characters).

TSeral · Jun 25, 2025

Hello, thank you for your work!
I'm using the OCR screen reader to read Chinese books. In the old version, I had the background and characters set to transparent. In the new version, that does not seem possible. I find that rather inconvenient, since pleco occasionally misidentifies characters. Before, I could still see the real character, and either read it myself or look it up by hand. But now I just see the misidentified one. Please, could you reintroduce the transparency, or, if it's still hidden somewhere, tell me how to get it back?

mikelove · Jun 25, 2025

You can get it back via Settings / OCR / 'use legacy interface.'

The change is basically because modern OCR algorithms aren't very good at pinpointing the locations of the characters they recognize; they're a lot more accurate, which is why we made the switch, but if we tried to place characters on a transparent background with the raw locations we get from the OCR algorithm they'd be so far off as to make the result unreadable.

benreynwar · Jul 6, 2025

I'm also having this same issue with the new OCR interface. I use the OCR just to add a tap-to-lookup for any webpages or books that I'm reading. With the old system I always set the overlay to transparent, so the only effect was that I could tap on a word to get the definition. It worked pretty well, although every now and then it would misidentify a character and I'd have to enter it into the dictionary manually. No big deal.
With new interface it seems like it's not possible to make the overlay invisible. Now when characters are misidentified they hide the original characters. This makes the OCR quite unusable for reading anything.
The "Use Legacy Interface" seems like the only solution.
P.S. Thanks for developing and maintaining Pleco! I've been using it for years and it's been a huge help learning Chinese.

mikelove · Jul 6, 2025

Thanks - at some point we'd like to go back and figure out a way to restore that sort of functionality (e.g. by running a second pass with an older recognizer to try to get locations more accurately), but basically we were in a situation where we had to do *something* to shore up OCR on Android - we were getting a LOT of complaints - but didn't have time for a more comprehensive rewrite because we're still working on finishing 4.0. So now new users have something that's reasonably competitive in terms of accuracy etc, but users who were happy with the old system can continue on with the same thing as before. (and indeed the complaints have now stopped and OCR sales are up a good bit)

As far as making the overlay disappear, for still OCR it should disappear as soon as you start scrolling; we don't currently have anything like that for screen OCR because we assumed that most people wouldn't keep it open very long since it blocks the rest of the UI, but we could look into adding a toggle option for people who want to use it the way you are where they pull it up for an entire page and leave it open.

alex_hk90 · Jul 12, 2025

On the topic of OCR (on Android), I hadn't used it for a while and was surprised by the new UI - thanks for keeping the option to use the "Legacy OCR interface" as I find it much easier to use (as you can see whether you have lined up all the characters correctly before pausing).

mikelove · Jul 12, 2025

No problem - again, happy to keep it for old users, we just needed to move on to something more current in order to stay competitive accuracy-wise and that necessitated getting rid of overlays because we can't position characters in them accurately enough.

alex_hk90 · Jul 14, 2025

mikelove said:
No problem - again, happy to keep it for old users, we just needed to move on to something more current in order to stay competitive accuracy-wise and that necessitated getting rid of overlays because we can't position characters in them accurately enough.

I wonder if there is some alternative UX that could work where it doesn't try to position the characters over the screen but you can still see all the characters that have been recognised in the box, like by having the recognised characters show in a box offset above or below the drawn box?

mikelove · Jul 14, 2025

Isn't this more-or-less what we're doing with the new live OCR? Show the characters above the box?

alex_hk90 · Jul 15, 2025

mikelove said:
Isn't this more-or-less what we're doing with the new live OCR? Show the characters above the box?

The difference is that the new UI only shows the current word/definition above the box, whereas the pervious UI showed all the recognised words/characters within the box. If the new UI similarly showed all the words/characters above the box then I would use it instead of the old UI.

New Beta 3.2.98

Member

皇帝

Member

皇帝

Member

Attachments

皇帝

皇帝

Member

皇帝

Member

皇帝

Member

皇帝

Member

皇帝

状元

皇帝

状元

皇帝

状元