SumatraPDF Reader

Helmut10001 · on Oct 24, 2023

SumatraPDF is among the OSS PDF tools I use.

Since Adobe is pushing a more aggressive stance for monetization of Acrobat, I am trying to replace selected PDF workflows with OSS. Here are some of the tools I use.

    qpdf
        removing passwords, unlocking PDFs, conversion
        install in WSL with apt-get install qpdf
        remove password with qpdf --decrypt --password="" input.pdf output.pdf
    PDF4QT - Open Source PDF Editing
        Deleting, Sorting, Extracting Pages
        Currently, no choco release available, must be installed manually from PDF4QT/releases
    Inkscape, LibreOffice Draw
        editing PDFs, adding text
    Mupdf
        Command line tool and Python package for parsing, filling forms, adding text
    SumatraPDF
        Viewing of PDFs
    pdfplumber
        Awesome python package to extract tables from PDFs into data pipelines. Use with Jupyter Lab

dustypotato · on Oct 24, 2023

FYI, you can use firefox for viewing,signing, and adding text to PDFs. You can also use it to remove password (just do print to PDF after unlocking it).

rstuart4133 · on Oct 24, 2023

> you can use firefox for viewing,signing

I got all excited - then realised "signing" just means inserting a picture. Notably absent are open source tools for digitally signing and verifying PDF's. Apparently pdftk does it in a paid version.

It's funny in a way - in this thread we have people wanting ways to modify a PDF. Yet to me, being any to prove it's not modified (eg, it's statement provably issued by some bank saying they transferred funds to my bank on behalf of person XYZ) is far more important. Instead we have companies offering paid "document signing services" which are built on sand - you can easily forge / modify any signed document they issue.

ajyotirmay · on Oct 26, 2023

Okular. Okular offers digital pdf signing

beagle3 · on Oct 24, 2023

At least as of Firefox 109, support for non Latin languages was broken to the point of being completely unusable.

xattt · on Oct 24, 2023

128 characters ought to be enough for everybody.

GoblinSlayer · on Oct 24, 2023

Yep, it's codepage 1276 https://en.wikipedia.org/wiki/Adobe_StandardEncoding

mkl · on Oct 24, 2023

PDFTK and pdfjam are two other useful command line tools. I use PDFTK for merging PDFs, extracting/deleting/duplicating pages, and decompressing so I can extract and manipulate text/data in raw PDF commands. I use pdfjam for n-up and adjusting page size and margins.

djbusby · on Oct 24, 2023

PDFTK can choke on some merges, some newish pdf features. In those cases one can use Ghostscript to merge and other manipulation.

PhilippGille · on Oct 24, 2023

You mention qpdf not available in Chocolatey, but it's available in Scoop, which is another Windows package manager: https://github.com/ScoopInstaller/Main/blob/master/bucket/qp...

SushiHippie · on Oct 24, 2023

For other distributions / OS / package managers see here: https://repology.org/project/qpdf/versions

systems · on Oct 24, 2023

seem to be available on winget too

   > winget search qpdf
   Name Id        Version Source
   ------------------------------
   QPDF QPDF.QPDF 11.6.3  winget

Steve44 · on Oct 24, 2023

For extracting to tables I've been using http://tabula.technology/ for a couple of years. It seems to do a pretty good job even with some fairly complex tables and I've not had any problems with it.

Helmut10001 · on Oct 24, 2023

Yes, tabula is the other table extraction tool. I used both and prefer pdfplumber because it is really robust and works well with Jupyter Lab.

silicon_laser · on Oct 25, 2023

you can also use okular for visually select a table from a pdf and paste it in sone excel kinda software and okulars table select tool is not too bad

quyleanh · on Oct 24, 2023

Actually SumatraPDF is using MuPDF now. But there is some limitation on rendering PDF and eBook files. For example, formatting PDF file or displaying Unicode characters in epub file.

sebras · on Oct 24, 2023

Do you mind reporting those issues either to SumatraPDF at https://github.com/sumatrapdfreader/sumatrapdf/issues or directly to MuPDF at https://bugs.ghostscript.com/ if it also has the same issue? Thank you!

There are many wonderfully weird PDFs and epubs out there, but we do our best to fix issues. :)

quyleanh · on Oct 25, 2023

Thank you for reaching out. Please refer to the following issue list from SumatraPDF repo [1] [2]. Epub rendering issue like [3] [4].

[1] https://github.com/sumatrapdfreader/sumatrapdf/issues?q=is%3...

[2] https://github.com/sumatrapdfreader/sumatrapdf/issues?q=is%3...

[3] https://github.com/sumatrapdfreader/sumatrapdf/issues/2752

[4] https://github.com/sumatrapdfreader/sumatrapdf/issues/3761

acqq · on Oct 24, 2023

I like k2pdfopt for reformatting pdfs for my e-reader.

I've also used poppler's pdfimages but I'd prefer like something less buggy for my use case; any version I've tried had problem with one pdf made by Adobe InDesign.

Also, tesseract allows creation of a pdf from the images with the embedded OCR text. It is also built in in the k2pdfopt.

qwerty456127 · on Oct 24, 2023

Also take a look at Okular. It's the only PDF reader I've seen to let you select a specific column in a table.

deepspace · on Oct 24, 2023

Okular is my go-to document reader across operating systems. In addition to PDF, it can open EPub, DjVU, JPEG, PNG, GIF, Tiff, WebP, CBR, CBZ, DVI, XPS, ODT and other formats.

nolok · on Oct 24, 2023

May I recommend NAPS2 ? https://www.naps2.com/

It's like PDF4QT but works and feel better to me.

xinayder · on Oct 24, 2023

You can also use Okular to open and edit PDFs, it's the document viewer from KDE.

nirav72 · on Oct 25, 2023

Adobe Acrobat reader installer is also almost a 1 gb download these days. One thing I do find that Acrobat does better is compression. I can usually reduce a PDF down to about 30%-40% of its original size without much loss in quality. I've tried other tools and they haven't worked nearly as well.

ed_balls · on Oct 24, 2023

META: What's the best way to convert pdfs to CSV/excel? Any new LLM tool?

Helmut10001 · on Oct 24, 2023

I think the best way to convert pdfs to tabular data is using pdfplumber together with a pandas (dataframes) workflow and writing results to CSV.

emeril · on Oct 24, 2023

abbyy finereader 8.0(!!!)

good luck finding it

justsomehnguy · on Oct 24, 2023

It's funny what you are downvoted, but FR8 was way better OCRing Office-printer-scanned documents even against the much later versions of FR, I saw the comparison on the same source documents.

> good luck finding it

It's still available on the trackers.

enthdegree · on Oct 24, 2023

This is a really obscure recommendation. Can anyone who knows about the specifics expound?

MilStdJunkie · on Oct 24, 2023

Tabula

georges_gomes · on Oct 25, 2023

To edit PDFs in Figma, it exists pdf.to.design now. https://pdf.to.design

tjrgergw · on Oct 24, 2023

I use xournal to add signatures (just pngs) and text to pdfs.

magical_spell · on Oct 24, 2023

Pdf-tools in Emacs is also great.

ParetoOptimal · on Oct 26, 2023

And it has a killer feature:

pdf-view-themed-minor-mode

It matches the pdf style/colors to your emacs theme! Sort of like a dark reader for pdfs, but it automatically adjusts to any theme based on some good but likely imperfect heuristics.

dustingetz · on Oct 24, 2023

how can i sign and date a PDF with a bitmap signature without 9999 janky clicks

commandersaki · on Oct 24, 2023

Great retrospective of SumatraPDF from the author: https://blog.kowalczyk.info/article/2f72237a4230410a888acbfc... .

mvonballmo · on Oct 24, 2023

From the retrospective.

> And yet I do know that you can write complex, relatively bug free code without tests, because I did it.

> I do know that you can write complex, relatively bug free code without anyone looking over your code, because I did it.

> If no one uses your app then who cares if it crashes.

> If many people use your app and it crashes, they’ll tell you and then you’ll fix it.

Those four statements are contradictory. What they're saying is not that you don't need testing or code reviews, but that you can get your users to test for you.

I figure the author probably does test their code (everybody tests, even if that just means running the app), but not rigorously or in a way that you could say gives one the security of regression tests.

No-one worth discussing the issue with claims that it's impossible to write complex code without automated testing. I'm a huge proponent of automated testing, and I wrote a relatively large, cross-platform renderer without a single automated test back in the late 90s/early 00s ... it just took a long time, and I became increasingly terrified of making changes.

Edited for formatting.

kjksf · on Oct 24, 2023

(I wrote that blog post).

What I was trying to say is: there's dogma about tests and code reviews.

At Google you would get fired for suggesting skipping code review.

Even at smaller Silicon Valley companies (smaller == less than 10 devs) it's unthinkable to not do code reviews. I haven't worked outside SV so it might be different.

That's the dogma.

My point is that maybe we should apply a bit of common sense on top of that.

I'm not saying Google should stop doing code reviews - the cost (to Google) of google search breaking is so high that you do 100x more than just code reviews.

But maybe those smaller companies don't need to dogmatically review the checkin for a documentation fix.

VHRanger · on Oct 24, 2023

The problem with tests is the TDD dogma, which wastes time and makes code harder to change (because even reasonable changes break a bunch of tests).

There's a good rule of testing top level behaviors described in this talk [1]

For code reviews, it's about knowledge handoff. No one disputes you can write great code alone. The problem is that singular geniuses writing functional but unmaintainable code only they understand and then getting hit by a bus or changing jobs is a real issue.

1. https://m.youtube.com/watch?v=EZ05e7EMOLM

RNAlfons · on Oct 24, 2023

I work in Health IT here in Germany and for the past 3 years we've been "testing" those "smaller companies" for different parts of our business.

It's a mess. We've been paying them serious money for a product. We've never been warned that their product isn't finished yet or that we're the beta testers for the product they'll sell to other clients. Or that we have to invest our own personal and their time to fix their problems and talk to their useless support.

This has become a pattern and I'm done with it. We are slowly moving back to older and larger companies who actually do their work properly before they roll out products and updates.

noAnswer · on Oct 25, 2023

What "older and larger" company do you have in mind? What ever you do, never choose CGM! They wouldn't even be able to spell "test" if their live depended on it! Nothing new ever works, like at all. Everything older needs at least one or two server restarts a day!

RNAlfons · on Oct 25, 2023

medavis for example.

I know CGM from medico...their KIS is a nightmare. We have to communicate with it in hospitals. What an ugly monster and somehow no hospital IT is able to admin it properly.

cstork12 · on Oct 25, 2023

Wow, didn't expect to see medavis mentioned here on HN. I'm currently writing Data Warehouse software (and more) interfacing with their RIS. However, I don't really know what their testing practices are.

Which of their products are you considering?

RNAlfons · on Oct 26, 2023

Right now we switched back from DoctoLib to booking4med but we're using all kinds of RIS modules here.

What Data Warehouse software do you write for RIS? Maybe I can use it :D

...and yeah Radiology communities are rare. I'm still looking for one...if I have time.

cstork12 · on Nov 2, 2023

Interesting! From where I'm sitting DoctoLib seems to win over the market.

The DWH software writes snapshots of db_direct into a temporal DB (implemented in Postgres using multiranges) and then uses dbt to transform the data into usable tables. Right now, I use Power BI for visualisation and reporting.

RNAlfons · on Nov 7, 2023

Yes they are since Corona unfortunately.

However it's good for usual Doctors offices. It's terrible for Radiology Planning. Also many institutions just put a link to their homepage or phone number in there. This way they're on DL but don't have to deal with the calendar.

wolverine876 · on Oct 24, 2023

> My point is that maybe we should apply a bit of common sense on top of that.

...

> But maybe those smaller companies don't need to dogmatically review the checkin for a documentation fix.

It's not dogma, it's just the necessities of large groups of people working together. A small organization can use common sense, a function that scales up to (20? 50?). 10,000 people can't operate on common sense; they need another function: rules.

wg0 · on Oct 24, 2023

Now I'm ready for massive downvotes here but hear me out.

Much of our professional habits are part of the corporate chains which is optimised to deliver and squeeze as much as possible.

Software developed in the wild does not have those corporate obligations and the sole purpose is to enjoy the process, the sheer joy of creating something. Of programming as a creative medium.

You don't get your paintings code reviewed. It's just that artistry. You like it, then you like it, end of the story, you're not playing for the gallery.

Corporate enslavement works differently. It has moved and distributed the part of the factory shift in charge to the dude sitting next to you, cleverly. Many are just complaining to make sure they're considered the quality sensitive cooperate loyals.

You two might like different pigments for the grass and he'll strike down your painting with a red ballpoint if not to his taste.

Happens to all of us and if not, wait for it.

Declaring a single boolean flag in a corporate environment might cost more then an hour to get to a consensus because one I-am-dffierent-I-care-too-much guy has some objection about some ambiguity in the flag name in some far future and has now swayed roughly half the team on his side.

That doesn't exist in open source. Open source is all about anti status quo. It is pure rebillion. It started that way, it is about hippies and naysayers. The very root of the GNU toolchain, Herd etc are probably there.

EDIT: Typos + corporate software development environment.

al_be_back · on Oct 24, 2023

I'm with the author (op) on this, unless it's critical code, launching a buggy project that gets some use & feedback is way better than holding-on for perfection and face certain project failure.

seek forgiveness rather than permission - gets you launched - gets you better

naasking · on Oct 24, 2023

> I became increasingly terrified of making changes.

That's the main value of automated tests.

TehShrike · on Oct 24, 2023

I rarely feel like tests make it easier for me to make changes later. Types do give me that feeling pretty reliably, though.

naasking · on Oct 24, 2023

Agreed that types are better, but good test suites contain plenty of reminders of corner cases that you probably forgot to consider during your refactor.

arp242 · on Oct 24, 2023

Lessons learned from 15 years of SumatraPDF, an open source Windows app - https://news.ycombinator.com/item?id=27968900 - Jul 2021 (133 comments)

Lessons learned from 15 years of SumatraPDF, an open source Windows app (2021) - https://news.ycombinator.com/item?id=35065785 - Mar 2023 (173 comments)

zerr · on Oct 24, 2023

> The problem is that Gtk is ugly, Qt is extremely bloated and WxWidgets barely works.

Seems like the author didn't look at cross-platform toolkits since 90s.

kjksf · on Oct 24, 2023

(I wrote the post).

That's fair in the sense that I did not look closely at latest Gtk or Qt or WxWidgets.

That being said they certainly did not get lighter.

Last I checked Qt was over 10 MB of libraries). Sumatra is 12 MB and I'm guessing over 8 MB is fonts needed to render PDF documents.

So just Gtk or Qt code would be more than the whole app.

Latest Gtk4 does seem to look nice so maybe calling it ugly was uncalled for.

signaru · on Oct 24, 2023

The reason I never had to install Adobe Reader for more than a decade now is that Sumatra was/is such a tiny install. Thanks!

Edit: I just checked and Acrobat Reader requires 450/900/380 MB for Win32/Win64/Mac respectively [1]. One might argue that it does more than just read PDFs. But in many cases, reading PDFs is all that I need.

[1] https://helpx.adobe.com/reader/system-requirements.html

a1o · on Oct 24, 2023

Thanks for caring about file size!

zerr · on Oct 24, 2023

If you are counting MBs at that order, then wxWidgets would be the best option. It can be statically linked easily. The size overhead is about 2.5-3 MB. It is a thin wrapper on top of native controls, so e.g. you can always get to the underlying HWND on Windows. Even for the Windows only app, I'd still pick it due to the sane API.

aidenn0 · on Oct 24, 2023

What about Tk?

pjerem · on Oct 24, 2023

What changed exactly ? Gtk is still ugly on anything that is not Linux, Qt is still bloated and wxWidgets, well here I don’t know.

OvbiousError · on Oct 24, 2023

What does "Qt is bloated" even mean. It's a big framework, split into separate libraries, you include only what you use. Qt is free (most parts are LGPL) and is developed by 100+ professional developers. The quality of the implementations, API and documentation is very very good.

Discarding it out of hand by "qt is bloated" just feels disingenuous to me. You can add to that the fact that qml on the desktop if finally maturing into a viable alternative for widgets, and UI development with qml is such a breath of fresh air.

dspillett · on Oct 24, 2023

> What does "Qt is bloated" even mean.

From a user perspective I remember that being a big thing some time ago when people didn't have anything already using Qt installed did an “apt install” or “yum install” on something that did and saw the small tool they were wanting was going to drag half a desktop environment in with it as dependencies. The same could likely be said for GTK in reverse, I'm not sure what their relative sizes for similar features are these days.

Some use bloated to mean the memory footprint. IIRC GTK has more of a reputation for eating RAM than Qt, but again maybe people notice a single Qt app using a lot of resource (that would be shared if running multiple apps against the same libs) when it is the only one they run.

As you suggest, just stating that “<whatever> is bloated” without reference to some details of what is meant by that, sounds a bit like someone parroting old information and/or group-think rather than having looked into it recently.

Having said that the author has a minimal dependency stance in order to try to maintain a small footprint for the app (“I avoid unnecessary abstractions.” in the section about keeping things small) so any framework that isn't little more than a cosmetic wrapper could legitimately be called more bloated than using nothing at all and talking more directly to the standard OS libs. Also in context (discussing why the product is not cross-platform and is never likely to be) this is not the only reason being given and probably not the most significant one (there may be significant selection of cross-platform issues beyond the UI framework).

The key to a lot of what is in that document is the “It’s my project and I act like it” part. All too often we forget this very important side of things, especially with one-man or small-team projects, and people comment on project decisions as if using the product gives some automatic expectation that the creator will mould it around the needs/wants of a given user or someone's idea of “the community”. For an open source project the community has the option of forking the project or offering to fund the changes they want that aren't otherwise on the creator's roadmap (though obviously the larger the project, the less practical these options may be)…

LinAGKar · on Oct 24, 2023

Sumatra is know for being small, with the portable version currently being 15.3 MiB. QtCore and QtGui together is some 11 MiB (checking the DLLs used by KDE programs on Windows), so those alone would increase the size significantly.

KRAKRISMOTT · on Oct 24, 2023

Flutter is decent, most of the browser/Skia based UI kits are more consistent and made by people who have an appreciation for design and aesthetics. But I think Sumatra predates most, if not all of them.

Sakos · on Oct 24, 2023

> appreciation for design and aesthetics

Sure, but not accessibility or UX design. There's more to a good UI than looking pleasant. Far more. I wish we hadn't unlearned that in the past decade, because Flutter and browser-based UI kits throw all of that out the window.

whizzter · on Oct 24, 2023

Honestly, this is a guy who uses GDI, so most of the above options will feel bloated already when looking at the source tarball sizes.

That said, has there been any fundamentally groundbreaking crossplatform classic UI toolkits released since the 90s? (IMGui is the only one that has seemed interesting but that's specialized and not a general one really)

rizky05 · on Oct 24, 2023

There are plethora of cross-platform UI toolkit, each have their own philosophy. IMO, current popular toolkit have declarative aspect in it. Believe or not the most mature UI toolkit is the one built for chromium. It just works everywhere.

EDIT: Another commenter suggest only office as example. IIRC, they are building it using chromium as front-end and .NET as backend.

whizzter · on Oct 24, 2023

I think that's the sticking point, declarative toolkits didn't mesh too well with C++(or C) projects so we have a bunch in Rust but that's not too interesting for those with existing codebases, I started on a C++ 20 prototype myself a few years back (because to make it even remotely elegant I felt that I wanted designated initializers from C++20) but it kinda turned into a huge hairball of templates to even approach something like react style jsx/tsx rendering.

I'm pretty sure that something based on IMGui could be more or less isomorphous to React style rendering, it'd be up to someone to implement it though (and it'd probably be worth it since building applications at scale you win back a lot of time by not fiddling with state manually all over the place).

Using Chromium however is explicitly not where we should want to go (it's basically a kitchen sink in itself), but we go there anyhow (me included often) because it's just so much more quick thanks to progress in dev experience in the webdev area.

_wf2l · on Oct 25, 2023

if by plethora you mean different web implementations, sure. but there has basically been less than a handful cross platform UI toolkits since GTK/QT

jcparkyn · on Oct 24, 2023

Interesting read. I was very surprised by the mention of adding editing features at the end. That sounds like a proverbial black hole of potential feature requests and "bloat".

ykonstant · on Oct 24, 2023

I implore all developers of PDF readers to implement sioyek's overview feature[0]. When you hover on a cross-referenced entry, it opens a little preview window with the contents of the reference. It is an absolute game-changer for reading textbooks and technical papers; I cannot overstate its utility.

[0] https://github.com/ahrm/sioyek#overview

kjksf · on Oct 24, 2023

(I wrote SumatraPDF).

I know about sioyek and been meaning to steal some of their ideas.

Maybe in the next version.

wolverine876 · on Oct 24, 2023

> When you hover on a cross-referenced entry, it opens a little preview window with the contents of the reference.

Wikipedia does the same thing, but I assume you would know that. What am I misunderstanding?

Also, if it makes your PDF reader execute arbitrary remote code, isn't that a serious risk?

drannex · on Oct 24, 2023

I don't believe it's a risk, since you're not injecting arbitrary remote code into a document, you're more overlaying on top of it. Even if the reader wants to execute code, then it will, that's how you have a reader to begin with. As long as it's not the PDF itself (source document) deciding to execute inline code, you should be fine.

wolverine876 · on Oct 24, 2023

What does it matter if it's displayed in the document or in an overlay window or if it's saved in the document or not? That code could try to do anything; many PDF readers have a security option to prevent opening any remote documents or links.

joveian · on Oct 25, 2023

Why do you think there is remote code execution? Without being familiar with that code (or evince code, which does the same thing) I can almost guarantee there is no such thing (what even would they be executing and why?).

wolverine876 · on Oct 25, 2023

How is the remote reference rendered, other than by executing a renderer for that format in a window? Renderers for web pages, for example, are very powerful platforms with a long history of security issues.

hju22_-3 · on Oct 25, 2023

I think you've misunderstood the feature. It shows an overlay window of another part of the same PDF you're already in. It doesn't open another PDF or a website or anything, it's just a small preview of another page essentially. You could just as easily scroll to where the reference is placed in the PDF, but this lets you quickly preview the section without actually leaving your position in the document.

wolverine876 · on Oct 25, 2023

I did misunderstand; you are correct. Thanks for explaining it.

And that is indeed very useful!

esquivalience · on Oct 25, 2023

The reference being discussed seems to be another part of the same pdf.

spit2wind · on Oct 24, 2023

Is this not Ted Nelson's transclusion? Namely, "...the same content knowably in more than one place".

https://en.m.wikipedia.org/wiki/Transclusion

cs3f16 · on Oct 24, 2023

Thank you so much for sharing this extremely amazing software! I hope it gains more popularity and traction.

majora2007 · on Oct 24, 2023

This is really cool, I'm going to steal some of these ideas for my app Kavita.

V1ndaar · on Oct 24, 2023

Evince does the same. It's a great feature, I agree.

javale · on Oct 24, 2023

Thanks for sharing!!

ivraatiems · on Oct 24, 2023

Wow, I just installed this for the first time and its performance blows both Adobe Reader (not an achievement) and Foxit (something of an achievement) out of the water! Nice work by these devs. And its install footprint is around 10% of those programs.

What the hell is Adobe doing, I wonder, that makes their software so unbearably slow and painful to use?

dagw · on Oct 24, 2023

What the hell is Adobe doing

Probably supporting 100% of the PDF spec. plus addressing all those obscure feature requests that 6 companies in this one very niche industry really really need. Sumatra is fantastic and basically the only PDF reader I use on Windows, but it does have maybe 10% of the features Adobe acrobat has. It is however the 10% that that basically everybody needs.

kjksf · on Oct 24, 2023

(I wrote SumatraPDF).

I agree that Adobe has more features than SumatraPDF.

Not necessarily the PDF spec itself - SumatraPDF displays pretty much any PDF you throw at it, just way more options and stuff.

And it's not exactly slow. I'm sure the core rendering of PDF pages is same or faster.

It's just slow to start. Like very, very, sluggish slow. And it's very visible to users.

I don't think it's the features that cause the slow startup. They just don't seem to care about optimizing it.

Chrome has more features that Adobe Reader. It has video calling, a capable PDF viewer and all the other stuff in ever growing web standards.

And yet it starts up fast. Not instant as SumatraPDF but way, way faster than Adobe Reader.

I think that it's more than fair benchmark regarding complexity of the app.

The difference is that Chrome team cares about performance, including startup speed, and they spend a lot of resources on it.

I remember Chrome was counting and removing C++ static initializers from their code (the code that runs before main()) because that contributes to startup speed.

That's the level of care you need to have and I think Adobe just doesn't have it.

dharmab · on Oct 24, 2023

> SumatraPDF displays pretty much any PDF you throw at it, just way more options and stuff

Sumatra includes a 3D renderer to display embedded CAD models? https://helpx.adobe.com/acrobat/using/displaying-3d-models-p...

legends2k · on Oct 24, 2023

I understood "pretty much" as "most" but not "all". I've been viewing PDFs for 20+ years and never encountered one with 3D content so ”pretty much” seems like a good adjective to use here for me.

dharmab · on Oct 25, 2023

In engineering and manufacturing it's an essential feature used daily.

_wf2l · on Oct 25, 2023

that doesn't explain you purposefully misunderstanding people

_wf2l · on Oct 25, 2023

what did you think this gotcha was? you just look annoying and pretentious. he didn't say literally every PDF for a reason

baz00 · on Oct 24, 2023

I actually have to pay for Adobe because I need the features (signing, annotation, redaction, OCR etc). Does SumatraPDF cover those at all?

As for startup speed, it's not really an issue as Adobe just lurks there all day in the taskbar for me at least ready to roll.

kjksf · on Oct 24, 2023

In latest version you can annotate (highlight, underline etc.) and create a few other annotation types and move them.

More / better annotation editing, signing, redaction is something I want to do.

OCR - I don't have a good handle on how important that is.

solarman5000 · on Oct 25, 2023

I have to pay for about 30 adobe acrobat licenses a year

I would much rather throw this money your way, but I need those features

baz00 · on Oct 25, 2023

Thanks will look into it. Reply appreciated.

wolverine876 · on Oct 24, 2023

> SumatraPDF displays pretty much any PDF you throw at it

I often use PDF applications for documents that I want to keep for decades, including annotations I make. How much can I count on SumatraPDF, or any PDF application, outputting future compatible documents from conversions, annotations, deleting/merging/etc, content editing, etc.? Is there a difference between applications?

My instinct is to play it safe and use Adobe, figuring whatever they do is the de facto standard. But I strongly dislike the applications and all the privacy invasions they impose. (Yes, I'm aware of PDF/A; I'm talking about applications' outputs and not the standards.)

ivraatiems · on Oct 25, 2023

Thank you for responding and congratulations on the success of SumatraPDF, it seems well-deserved.

I will say that on older systems, Reader/Acrobat are not just slow at startup. I am writing this from a machine that has an i7-2600 and 16GB of DDR3 RAM. Reader is almost unusably slow. It's absurd.

Now where's that Mac port? /s

Xerox9213 · on Oct 24, 2023

Acrobat Pro is pretty incredible. One of my favourite features is the following: on a scanned PDF after performing OCR you can edit the text and it will match the font. As in, it will create a new font based on the characters it found in OCR.

I’m a high school math teacher and scan dozens of textbooks every year. Adjusting a few words before printing to match what we did in class is a huge time saver for me.

Somehow my school division was able to buy me a one time fee perpetual license. I’m very happy with it.

soco · on Oct 24, 2023

For me even Sumatra or Foxit have too many features. I only ever open PDFs to read and print, maybe zoom, but all those other buttons there only distract me - if there'd only be a way to hide them... But yeah first world problems. I'm happy they exist.

sundarurfriend · on Oct 24, 2023

Have you tried Zathura: https://pwmt.org/projects/zathura/index.html ?

It looks like it's Linux only (I only have Linux so I hadn't checked before), but when I want to sit down and properly read something, the keyboard-centric UI and minimalism make it a really smooth and frictionless experience.

dizhn · on Oct 24, 2023

On Android mupdf mini gives me exactly that with a few implicit features that makes reading pdf books on mobile be 'ok'.

There's a mupdf windows build. Perhaps that will do what you want.

(Mupdf is also a library so there are a lot of different programs with the name in it by various programmers.)

SushiHippie · on Oct 24, 2023

Well there are always the stripped down/simplified gtk/gnome versions

https://wiki.gnome.org/Apps/Evince

yoyohello13 · on Oct 24, 2023

Even more impressive is I believe it’s a solo dev. SumatraPDF is one of my go to examples of a great software project. Something to aspire to.

tayo42 · on Oct 24, 2023

> What the hell is Adobe doing

Maybe they're handling every posts feature wish list?

Like every comment so far here is Sumatra is nice but...< Some random feature > is missing

seb1204 · on Oct 24, 2023

PDF-exchange - I have been using this for many years in a paid version. Fast and good editing features e.g. mark-ups. https://pdf-xchange.eu/pdf-xchange-editor/index.htm

CodeCompost · on Oct 24, 2023

I got burned by Foxit when they started shipping spyware with it, so I'm hesitant now to try anything else.

wolverine876 · on Oct 24, 2023

Where can I learn about that?

CodeCompost · on Oct 27, 2023

https://en.wikipedia.org/wiki/Foxit_PDF_Reader#Issues

supertrope · on Oct 24, 2023

https://old.reddit.com/r/geek/comments/ddh5p/comment/c0zfics...

MilStdJunkie · on Oct 24, 2023

Rolling around in money from all the licenses, I would imagine.

user3939382 · on Oct 24, 2023

Bless the heart of whoever looked at the PDF spec and said to themselves, "Nice, I'd like to writer a parser for this."

kjksf · on Oct 24, 2023

(I wrote SumatraPDF).

In fairness, I didn't write the PDF rendering. That is indeed quite a tall order.

I used to use poppler and switched to mupdf (was more active at the time, poppler seems to have picked up pace since).

The core PDF feature set isn't that bad to implement.

From what I've seen, the bad / complex parts are:

- stuff they added years later, like some XML stuff (of course you had to add XML in 2000!), JavaScript in forms

- some more complex vector graphics features like masking with vectors, support for bunch of color spaces, cmyk separation

- font handling, text rendering is surprisingly complex

- rendering fast even if PDF was badly created

- PDF is easy to screw up when you create it and boy, do people screw up in every imaginable way. You can't just say "it's bad PDF" when Adobe or Chrome opens it so a lot of effort by mupdf devs is adding heuristics to show even broken PDF docs

omginternets · on Oct 24, 2023

When I bought my first computer in 2005, I discovered SumatraPDF and good lord it was miles better than the bloated alternatives. In particular, it was lighter and faster than everything else, which for someone like me who had to buy used hardware, was a godsend. So thanks for that!

user3939382 · on Oct 24, 2023

That's an awesome insight that was clearly hard-earned. SumatraPDF is awesome, thank you very much for all your hard work on it.

viraptor · on Oct 24, 2023

Even once you have a parser itself, actually figuring out what to display and where is... interesting. Especially in generated rather than hand-created documents. What's the element's position? Grab your math library, we're multiplying matrices! What does this text say? Let's write another parser for the table of very custom codepoints!

VMG · on Oct 24, 2023

mapping coordinates via projection matrices pretty common tbh

viraptor · on Oct 24, 2023

If you change/transform then on the fly - sure. If they're on a final, not-designed-as-editable format... Not that common. I don't really see any reason (beyond PS origins and its historical usage) for PDFs to not flatten the final positions/sizes.

hu3 · on Oct 24, 2023

No joke. Yesterday I found out PDFs can have forms with JavaScript.

https://tcpdf.org/examples/example_014/

How does this even work?

viraptor · on Oct 24, 2023

I hope I can amuse you further: PDF embedded 3d models https://helpx.adobe.com/mt/acrobat/using/displaying-3d-model... and interaction https://www.astrobetter.com/blog/2012/03/07/tutorial-for-emb...

You can also do animated page transitions like PowerPoint, but I don't have the right link available...

hu3 · on Oct 24, 2023

You did amuse me! Thanks!

To me this is the kind of scope creep that causes a "simple" PDF renderer department to end up with 50 devs working in it full-time.

I wonder... can a PDF have an iframe that opens itself? Causing an infinie loop of it loading itself?

izacus · on Oct 24, 2023

In reality most PDF apps don't support those elements (even software like macOS Preview) and people get along just fine.

bluish29 · on Oct 24, 2023

That's a nice idea. Can we call that PDF bomb?

MilStdJunkie · on Oct 24, 2023

Please please don't mention 3d pdfs. Please. I had to implement this. 3d manipulation in a block inside of a print format that might make deliverables in the hundreds of gigs. It's . . it's one of the dumbest functional requirements I've ever seen, and the fact it exists. . I'm sorry, I have to go be by myself for a while.

somat · on Oct 24, 2023

The stupidest part of the whole thing is that pdf is basically a neutered postscript. The problem with postscript as a document format, is that there is no good way to do metadata, jump to a specific page, count pages, etc. So pdf uses the rendering engine of postscript with all that annoying turing complete behavior torn out. Then at some point they wanted some computational capability in the document[1], but instead of reintroducing postscript into the mix they went with a third language, a wierd poorly designed one invent for browser scripting.

1. Yes we all know how stupid this was. but they wanted fillable forms and validating those forms made sense at the time. Really it was because they were trying to compete with the web.

baal80spam · on Oct 24, 2023

I remember someone wrote a game in a PDF document...

temny · on Oct 24, 2023

I think this is it: https://github.com/osnr/horrifying-pdf-experiments

lost_tourist · on Oct 24, 2023

because some viewers have built in javascript interpreters. In the pdf it's just text labelled as a script

jimjimjim · on Oct 24, 2023

badly

izacus · on Oct 24, 2023

Having done that, PDF spec is in many ways much saner and better designed than plenty of "APIs" modern JS jockeys create.

The original spec is a bit ugly because they were saving bytes in the format (using things like single letters for dictionary keys), but some things are actually quite well thought out (Appearance Streams are great for forward and backward compatibility and are probably no. 1 reason why nothing managed to replace PDF.)

wolverine876 · on Oct 24, 2023

> Appearance Streams are great for forward and backward compatibility and are probably no. 1 reason why nothing managed to replace PDF.

I've been wondering why they took that approach. Do you know their original reasoning? And how does it help compatibility?

mkl · on Oct 24, 2023

I think writing a parser for in-spec PDF files probably isn't too hard (though writing a complete renderer and interface certainly is), but many PDF files don't match the spec, so your parser has to be tolerant of invalid PDF files, because Adobe's is.

grishka · on Oct 24, 2023

...in a memory-unsafe language!

isatty · on Oct 24, 2023

Why is this comment under every post? Low quality bait.

quyleanh · on Oct 24, 2023

The big update of SumatraPDF was out yesterday [1]. There are a lot of bugs fix and improvement in the backlog [2].

[1] https://github.com/sumatrapdfreader/sumatrapdf/releases/tag/...

[2] https://github.com/sumatrapdfreader/sumatrapdf/issues/3672

puika · on Oct 24, 2023

this is my "dark mode" via advanced options, e.g. warmer page and dark background:

  MainWindowBackground = #191919
  FixedPageUI [
   TextColor = #282828
   BackgroundColor = #ebdbb2
   SelectionColor = #2d938f
   ...
  ]

what I have been doing so far is switch between other modes with autohotkey by overwriting `SumatraPDF-settings.txt`. I'd share the little script but it suddenly broke a while back

lencastre · on Oct 24, 2023

Dark mode and tab switching with the keyboard are 2 great additions to the best lightweight PDF reader EVER!!!

nuxi · on Oct 24, 2023

Tab switching with the keyboard was already possible previously, IIRC you can use Alt+<N> (or maybe Ctrl+<N>) to switch to the N-th tab.

puika · on Oct 24, 2023

yes, CTRL+TAB and ALT+<N>, I use them all the time.

Sakos · on Oct 24, 2023

Did they ever fix the issues around printing? I always ended up using a different PDF viewer just because of that.

kjksf · on Oct 24, 2023

(I wrote SumatraPDF).

Not yet but I'm thinking about how to improve it. Maybe next version.

Sakos · on Oct 24, 2023

It's such a good PDF viewer otherwise. I hate having to use something else.

mattegan · on Oct 24, 2023

Like others have mentioned, Sumatra is one of a few Windows-only utilities that I routinely miss when on Mac or Linux, primarily due to two simple interactions which I miss every day viewing schematics, mechanical/technical drawings or datasheets -- Alt + Scroll == Zoom and Right Click + Drag == Pan.

Does anyone know of any viewers on Mac or Linux that provide these two features? Skim on Mac implements Option + Scroll and Left Click + Drag Pan, but it's not reconfigurable to any other keys or mouse buttons.

vladvasiliu · on Oct 24, 2023

Zathura does that under Linux, with the difference that zoom is achieved with Ctrl instead of Alt. Right-Click dragging = pan.

One feature I absolutely love is that Page Down goes to the top of the next page. It's very practical when you want to skim something quickly, with a zoom level that doesn't fit a page size perfectly.

plumeria · on Oct 24, 2023

I really like Okular (especially with the theme that allows me to read PDFs with dark red background and yellow text), but haven't been able to run it on my Mac Mini with Apple silicon. The brew formula appears to be broken for newer macs.

TylerE · on Oct 24, 2023

FoxIt does zoom on Ctrl+Scroll, and Pan on Left Click+Drag. Runs on Mac and I'd assume linux.

Close enough?

I assume if you REALLY want to go nuclear on it, there is some shareware app that will let you do per-app keyboard emulation and rebind inputs "in flight" or something.

I believe https://www.keyboardmaestro.com/main/ is the standard solution.

SV_BubbleTime · on Oct 24, 2023

The linux version of foxit is absolute trash. They stopped development on years ago. It looks 1990 bad, a lot of features are missing.

It actually runs better under wine, with all sorts of errors that pop up because its updater Service can’t be found.

andrepd · on Oct 24, 2023

I use qpdfview and I'm very happy. Loads of customisability.

eviks · on Oct 24, 2023

> Skim on Mac implements Option + Scroll and Left Click + Drag Pan, but it's not reconfigurable to any other keys or mouse buttons

You can use Karabiner Elements + BetterTouchTool to rebind that when Skim is in the foreground?

viraptor · on Oct 24, 2023

Every browser pdf reader I've seen handles the alt+scroll for zooming (since the browser itself does it) Not sure about panning shortcut.

keehun · on Oct 24, 2023

I use three-finger drag on Mac (with a magic trackpad) and find that better than any combination of click & drag. Have you tried it?

mattegan · on Oct 24, 2023

That's fair! I'm just not a big trackpad person since I find doing ECAD or MCAD with a trackpad to be not so enjoyable :)

Izkata · on Oct 24, 2023

evince (linux) does Zoom with Ctrl + Scroll, maybe yours does too? I don't think it has Pan, but I'm keyboard-heavy and use horizontal scroll with Shift + Scroll.

joveian · on Oct 24, 2023

Middle button and drag moves the text wherever you drag it.

The evince feature I can't live without is the find, which shows a side panel with all matches in the document along with a bit of context. I wish all document find everywhere did this.

barbs · on Oct 24, 2023

Evince uses middle-click to pan

cmplxconjugate · on Oct 24, 2023

I'm in the same boat. It's almost the only application I miss! I use Okular but I think Sumatra just has the right UI for me.

ww520 · on Oct 24, 2023

Can it run under Wine on Linux?

app4soft · on Oct 24, 2023

Yes, it runs under Wine or CrossOver on Linux and Mac.

pixelpoet · on Oct 24, 2023

What's super handy about SumatraPDF is that it will auto reload normal image files if they're modified, so it's an easy way to get some sort of windowed graphics output by saving image files.

Semaphor · on Oct 24, 2023

Not just images. Whole PDFs as well. I do some PDF generation for work, and running the update command and instantly seeing the changes when it’s done, is great.

globular-toast · on Oct 24, 2023

Yep. Pretty standard workflow when using LaTeX or something.

Culonavirus · on Oct 24, 2023

Also what's super handy is that it can send PDFs to select printers from the command line (which is a killer feature, at least in Windows).

Ghostt8117 · on Oct 24, 2023

I love SumatraPDF. I've used it for years and it is wonderful. I have been writing Latex in vim and I compile with the document open in sumatra side by side for instant updates. Very smooth workflow with the instant reload of the page.

fiforpg · on Oct 24, 2023

Yep. An obligatory hat tip to lervag — his vimtex plugin* is a beast.

*Which you are most likely using, since it integrates with Sumatra out of the box. And with MuPDF. And Skim, etc. It's cool like that.

bee_rider · on Oct 24, 2023

Evince does this as well. It is a completely necessary feature in my opinion.

I tried Sumatra a while ago, before switching over to Linux. It seemed pretty decent, nice and snappy.

HKH2 · on Oct 24, 2023

Atril automatically reloads too.

esalman · on Oct 24, 2023

Sumatra PDF was my go-to tool for viewing PDF changes in real time while writing manuscripts and thesis during PhD. Acrobat (paid) and foxit (free) have more features but locks the file undergoing changes.

Zuiii · on Oct 24, 2023

Good example of how the implementation of unnecessary restrictions, without proper consideration of how such restrictions would impede users, can harm your product. I hope all software doing the same on android (safetynet, prevent screenshots, etc) follow Acrobat reader into irrelevance.

lxgr · on Oct 24, 2023

Exclusive locking is simply the default on Windows.

pbmonster · on Oct 24, 2023

Sure, but for a viewer, deviating from that default makes a lot of sense.

I started using Sumatra years ago exclusively because of that feature. Exports/compiles failing, just because the PDF is still open somewhere is just unbelievably annoying.

lxgr · on Oct 24, 2023

Definitely! I'm just providing some background on why "lock by default" is so common for Windows applications, and so rare on Unix: It's the defaults.

aragonite · on Oct 24, 2023

Sumatra is great, though occasionally I wish it were possible to have something like Firefox's wrapped scrolling [1] where more than 2 pages can be shown side by side. On a reasonably large monitor, being able to quickly zoom out to see e.g. 30 pages (sometimes all the pages of a journal article) at the same time can be very useful. You can be on p. 20, then go and quickly look up a definition on p.2, then frictionlessly switch back to p. 20. Having to remember a page number to go back to, or a key word to search for, or to worry about where the "back" button might take you (as with most PDF readers) is just too much friction.

[1] https://superuser.com/questions/1365482/how-to-view-a-pdf-wi...

urlwolf · on Oct 24, 2023

Love this view. I can't get pdf display on FF to invert colors. Is there a way? Then it would replace okular for me (which also does this overview mode)

SushiHippie · on Oct 24, 2023

https://support.mozilla.org/en-US/questions/1281116

wolverine876 · on Oct 24, 2023

As you probably know, you could use bookmarks. Or you could open two copies of the same PDF. (But yes, I love that feature in viewers.)

throwaway2037 · on Oct 24, 2023

Submit a patch?

teleforce · on Oct 24, 2023

Is there any open source seamless PDF editor available given how pervasive is the PDF document nowadays? I really think that we need one and it has been long overdue.

What I meant by seamless is that all of the open source software that currently able to edit PDF document is dong it in a clunky way at best for example Krita and LibreOffice Draw. The resulting edited output document also is also looks distinct, in a bad way, from the original document unlike the output from Adobe PDF editor.

pelalmqvist · on Oct 24, 2023

Maybe not exactly what you are looking for, but take a look at Stirling-PDF. https://github.com/Frooodle/Stirling-PDF

teleforce · on Oct 24, 2023

Thanks for the link will try it out, apparently this app was initially made by ChatGPT, whatever that means

>This locally hosted web application started as a 100% ChatGPT-made application and has evolved to include a wide range of features to handle all your PDF needs

aunterste · on Oct 24, 2023

Great tool, easily self hosted for the family, their rare beyond viewing PDF needs don't warrant to install anything.

Zufriedenheit · on Oct 24, 2023

I have done extensive research in past when i was considering switching from mac to linux. Came to the conclusion that there is no viable open source alternative for the mac preview app unfortunately. My requirements where: - Fill forms. - Add text and markup. - Reorganize/add/remove pages. - Redact parts of the document. (should also safely delete underlying text data, without rasterization of the whole document) - Add image of signature without rasterization of the whole document. This [1] is a long discussion i found about the topic.

[1] https://unix.stackexchange.com/questions/85873/how-can-i-add...

fithisux · on Oct 24, 2023

The spec is very complex. there are some, here and there but nothing coherent.

Even worse PDF is not "open source", it should have been accompanied by LaTeX source.

teleforce · on Oct 24, 2023

The PDF is now standardized as ISO 32000, it's open for everyone to create compatible tools.

https://en.wikipedia.org/wiki/PDF

looperhacks · on Oct 24, 2023

"Open" for the cheap price of ~$230

SV_BubbleTime · on Oct 24, 2023

I deal with ISO all the time, they’re cutting you a “deal” there.

mesebrec · on Oct 24, 2023

Have you tried the latest version of OnlyOffice?

davikr · on Oct 24, 2023

I really love SumatraPDF but I wish it'd fare better on the search story: there's no widget to show all search matches, afaik, and searching can be slow on really big files with thousands of pages.

To be fair, most PDF readers struggle with this, and I suspect only Acrobat attempts to cache or index files (as a Pro feature).

mahdi7d1 · on Oct 24, 2023

I remember reading a blog post from the author of sioyek boasting about search performance.

Haven't personally compared the options but for me sumatra zathura and sioyek all feel fast enough to not notice any problems.

https://ahrm.github.io/jekyll/update/2022/09/11/pdf-viewer-t...

"Now I must admit, the reason sioyek is so fast is because it creates a search index when you open the document."

davikr · on Oct 24, 2023

Ah, I had heard about Sioyek but I didn't know there was an option I could enable to build a search index. That is really nice.

sammy2255 · on Oct 24, 2023

Love SumatraPdf, completely no-bs software unlike Adobe. If I ever need to sign a PDF Microsoft Edge works wonderfully

Mengkudulangsat · on Oct 24, 2023

I still need Adobe to sign and fill forms. Wish Sumatra can do those too :(

autoexec · on Oct 24, 2023

I'm really glad that they don't. I'd much rather have a safer PDF viewer that only supports a limited subset of what adobe's shitware does to handle 99% of my PDF file needs so that I only need to risk opening Adobe software 1% of the time. That's so much better than turning Sumatra into the same bloated mess of risky features that is Adobe Acrobat.

nip · on Oct 24, 2023

No need for Adobe: https://simplePDF.eu

Disclosure: I’m the developer behind it

Steve44 · on Oct 24, 2023

I'm not sure, one thing I like about Sumatra is that it's very light and clean.

When we need to sign or view some complex documents then I'll use Acrobat.

leeman2016 · on Oct 24, 2023

Xournal++ is nice too

urlwolf · on Oct 24, 2023

If you are on linux and miss the snappiness of sumatra, try llpp. They use the same renderer. In fact, llpp has more features, like a better overview mode where you can get 3 pages on screen. Search does suck. llpp is written in Ocaml and a fantastic piece of software.

mmh0000 · on Oct 24, 2023

I’ve been using Sumatra through WINE (a tool that emulates the Windows api) for years. Works great on Linux that way.

sinuhe69 · on Oct 24, 2023

I have the bad habit of keeping many PDF files in the reader open (yeah, I keep reading long books). The thing is when I open Sumatra for a new file (maybe only for a quick glance) Sumatra will reload all my files and slowed the startup significantly. I wish I can set the option which will load only if I’ve actually switched to the tab, just like Firefox.

hans_castorp · on Oct 24, 2023

This can be controlled when use the "Advanced Options". It opens a config file where you can set:

RememberOpenedFiles = false

https://www.sumatrapdfreader.org/settings/settings3-4-6