Hacker Newsnew | past | comments | ask | show | jobs | submit | mikabasketball's commentslogin

France?


What do people use to perform those pdf tasks without uploading sensitive files to a website?


For PDF to markdown, I recently released V2 of my tool marker - https://github.com/vikparuchuri/marker


This is very effective - it consistently yields great results.


The files are deleted immediately after processing I'm considering implementing WebAssembly (Wasm) to do most of the work on the client's device and enable offline use


You say that here, but its not in the privacy policy as far as I can see.

When you get to the stage of monetizing the site, I expect the most obvious starting point is monetizing the information inside the pdfs.

Then there's the obscurity of how much (if any) is passed on to other services (like google). You may have one policy about the PDFs, they may gave another.

So yeah, I'm in the "not for general use" camp myself. (Although there are edge cases where it may be useful.)

Don't get me wrong, I can see the upsides, and your web site looks professional, but alas the downsides are too significant to overcome the inconvenience of searching out something local.


Stirling-pdf. You can self-host it.


using this myself. its pretty good.


Only last week there was a HN thread about how the author said they just used chatgpt to make the entire thing and as a result the code is beyond bad. I don't think I'd trust it.


They didn't say that. They said they wrote the first version in a few days, using ChatGPT. Then worked on it almost another year since then. Something of that nature. Pretty big difference.


>I don't think I'd trust it.

You can audit the code yourself then. What's stopping you?


Nothing is stopping me using something else that isn’t ChatGPT hope and pray code.


Am not a Coder :-P


That's a fair point lol, my bad.


Pandoc, ocrmypdf, libreoffice, pdftk, pypdf2


There is a site listed elsewhere in these comments that does the work entirely in-browser. That's what I would use.


I use Preview on mac or even Spotlight for a good portion of these functions


A good alternative I use is https://www.goldencheetah.org/, open source, and you don't need Strava/Garmin/etc...


Outside of the basics of analysis, Golden Cheetah is nothing like Strava.

Strava is a social network, map-based ride reviewer, heatmap tool, and basic ride analysis tool. GC is a not-great map viewer and VERY deep ride data dive tool for training.

A venn diagram for the two would only overlap by about 15%.


Personally I find the "social" aspect of Strava off-putting.

Running for me has always been a solitary pursuit. Making it "social" and letting you know that BigDave69 ran that segment faster than you or whatever totally jars with the whole ethos for me.


That's actually my least favorite part as well. I like being able to see what friends did and use that info for routing, but I have to be careful. If I start flipping through the feed on a day (or week) when I'm not riding I'll start to feel like I'm not living up to some expectation. Mentally that's very, very unhealthy.


Why should I switch to SQLMesh if I am already on DBT? DBT works fine for my use case today.


Even if you don't have much data, SQLMesh will save you time over dbt. Iteration is an important part to developing data pipelines, and fully refreshing your warehouse every time is wasteful and can slow you down.

Trying out SQLMesh is easy, you can import your existing project and leverage SQLMesh's runtime. At the very least, you can add unit tests to your project.

You can see a more detailed comparison here https://sqlmesh.readthedocs.io/en/stable/comparisons/.


there is also theothermail.com to generate throwaway email addresses that forwards emails to your original inbox, you don't need to use your personal email address to sign up for a newsletter


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: