* Libmobi: C library for handling one specific vendor's eBook formats
If this let you at least ingest basically any eBook format, I'd be super interested. Unfortunately, it does not.
For example, literally the only (OSS) tool I've found that can parse .lrf files is calibre. There are zero other library tools that can let me extract the text content from them. MS's .lit files have similar issues, but at least there's no python libraries for it.
Outside of epub, there's a similar story for most other file formats too.
If this let you at least ingest basically any eBook format, I'd be super interested. Unfortunately, it does not.
For example, literally the only (OSS) tool I've found that can parse .lrf files is calibre. There are zero other library tools that can let me extract the text content from them. MS's .lit files have similar issues, but at least there's no python libraries for it.
Outside of epub, there's a similar story for most other file formats too.