FTS5 CJK Support (ICU or other options)? #3560
Replies: 2 comments 5 replies
-
|
Hi!
I'm also wondering where that came from. There are ICU extensions for SQLite, but they're meant to add locale-aware implementations of functions like So it's definitely possible to build SQLite with extensions linked into it (and I'm happy to help with more pointers if you need that, unfortunately it requires a custom build at the moment). But I'm not sure if the ICU extensions from the SQLite project would be helpful for better tokenization here. |
Beta Was this translation helpful? Give feedback.
-
|
Hello! I find it interesting that ICU support does not come out of the box. Is there any reason? I'm making an application that should support searching for a pattern in a TEXT column of a table. However, the ASCII only implementation of the LIKE operator means that I have to do the search in the application layer, which seems counter-intuitive, especially since a really common use-case for Drift would be in a Flutter application. Curiously, Is it typical to avoid string pattern matching in SQLite, and instead opt to do it in the application? For reference, I think ICU (or something of the like) is automatically enabled in CPython and since I had only used it there, I struggled until I found out about how it defaults to only ASCII in general. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Heya, thanks again for all your fantastic work with this library and speaking to us at Superlist!
I had a question: I'm looking at adding a full search index to our app. I was interested in trying the ICU tokenizer for FTS5. I saw in a comment that ICU support might be interesting, but I did not see it listed on the official extensions page.
If we wanted to try ICU tokenization, what would you recommendation be? Should we follow the examples in https://github.com/simolus3/sqlite3.dart/tree/e682c9792cc1ea094281a7d2edfc7db5ca802b05/sqlite3 to compile a version with ICU support?
Thanks again,
Brian
Beta Was this translation helpful? Give feedback.
All reactions