https://news.ycombinator.com/item?id=37800951 (183 comments)
I tested it out then and am considering migrating from my current system (Google Drive) to using a self-hosted approach. Paperless seems to have a good approach for minimizing the mental overhead of ingesting and categorizing new documents - which is what ultimately leads me to stacking documents up for months before processing them. My initial pilot run was promising, but I haven't gotten around to switching yet.
From the changelog, it's not really clear to me what's notable about this release, especially as a new/potential user.
This page is a better introduction to the product, although it doesn't mention the v2 release yet:
One feature which seems to be quite a nice improvement (speculating as I haven't upgraded yet) is consumption templates [0]. My workflow involves an ADF scanner with an Android application, sharing the scanned PDF with Paperless Share [1] and then it's uploaded to the server via API. It seems that consumption templates will enable adjusting tags/sharing settings/permissions of a document at ingestion time based on where it's ingested from.
[0] https://github.com/paperless-ngx/paperless-ngx/pull/4196
It's a one-way sync. Paperless is the authoritative location. The only reason I back up to Google drive is so that my phone has easy access to the documents I may need on the go.
https://github.com/paperless-ngx/paperless-ngx/releases/tag/...
* Feature: Implement custom fields for documents @stumpylog (#4502)I just updated my install to v2.0.0 with a simple podman pull and a systemctl restart of my paperless pod and everything looks great. Hats off to the contributors of the project. Every update, even major ones like this have been really smooth.
I've been thinking of moving from docker-compose to podman, specifically using the [podman-play-kube](https://docs.podman.io/en/v4.2/markdown/podman-play-kube.1.h...) but haven't gotten around to it.
I like Podman has a lot to offer for self-hosters but it isn't popular (yet?)
If you check out the bash script on my ppngx project you can get an idea of how you could write your own script for your workloads. I can run ./start.sh over and over again and it will replace the running containers with my changes which is a very fast DX.
The README on ppngx talks about using the podman generate systemd command to create units from the pod so you can run them via systemd, but this command is being deprecated in favor of using Quadlet [1] (systemd generator) to crate the units on the fly. I haven't gotten around to using it since I like to have more control over my systemd units. I could see Quadlet being very good for users that don't know the inner workings of systemd and podman.
1: https://docs.podman.io/en/latest/markdown/podman-systemd.uni...
I do PDF editing offline, on the desktop, then re-upload to paperless. Not the most integrated flow, but much more bulletproof. I want the PDFs themselves to be immutable once on paperless. Only metadata should be editable.
Last I checked it doesn't and had to run a separate service to advertise to the printer the paperless endpoint.
Maybe it's not something standard and every company has their own implementation :shrug:
Does anyone have a good scanner recommendation though? I am eyeing the Brother ADS-1700W since it seems to be recommended often, but I would really like to use the "scan to webhook" feature (it's 2023 after all) instead of SMTP or whatever else are the options I would have with the Brother.
I am using iPhone as a scanner and it automatically scans, OCRs, uploads and ingests to the paperless-ngx instance, even remotely using tailscale.
The iPhone camera is more than good enough for scanning documents.
I love that it integrates with Paperless so well!
all works perfectly.
I'll clean all the rollers and stuff next week and test it :P
In that case I scanned to a USB drive attached to the scanner (since each photo was a separate file). For Paperless I use the Epson Smart app, scan the document with whatever settings, remove/rotate pages as needed, and then share it to Paperless with Paperless Share [0].
Many network attached scanners can scan to SMB, no device needed, but I kind of like the human-in-the-loop aspect. Since my Paperless server runs on an HDD next to the scanner I can actually hear once the file lands which is quite satisfying.
They're the Brother laser printer of scanners.
But yes, the scanner is pricey. It was definitely an investment.
The only thing that comes to mind is either do a convoluted SnapScan Online -> Google Drive -> rclone -> Paperless or bite the bullet and figure out how to directly scan into the local box via USB.
Source (German): https://www.synology-forum.de/threads/dokumentenscanner-dire...
I installed Paperless on my home server & spent a night digitizing everything. After being comfortable with it for a few months I went back & shredded all my paper copies. Today my process is similar - when I get a document I would normally toss in that filing cabinet I just scan, upload to Paperless, and shred it. It's also really nice for storing large purchase receipts - I've previously had the writing on thermal paper receipts go invisible after a period of time, no longer an issue.
Searching for something specific is so easy now! Huge QOL improvement. Just make sure you have a solid backup strategy, losing my Paperless database & filestore would be devastating.
To me it means Angular (the web framework). So, I was surprised to learn this wasn't an Angular plugin. Angular is often referred to as ng for short and as such their plugins tend to have ngx as a prefix. For example, the angular wrapper for ChartJS is ngx-chartjs.
The paperless-ngx's core team focused on gathering a group of people to support it to avoid any burnout problems and keep the project sustainable.
paperless (https://github.com/the-paperless-project/paperless) -> paperless-ng (https://github.com/jonaswinkler/paperless-ng/) -> paperless-ngx (https://github.com/paperless-ngx/paperless-ngx/)
https://github.com/jonaswinkler/paperless-ng/tree/master/src...
At least that is my understanding following the Paperless project over the years.
For me, as someone who wants my docs on my own server, but well, doesn't care enough to want to constantly keep up with forks/changes/migration/updates, I've been looking for just something stable I can use for years (or maybe decades?, eg part of the appeal of something like Obisidian is that it just falls back to .md text files).
Curious if there are any long-term active users of this (or other systems) for handling all their paper and what they think about maintainability/longevity?
So far I’ve probably updated the software ~5 times across various releases, each time I’ve updated it been because there was a new feature I wanted rather than needing to pull in fixes (the software has been bug free for me). The update process is well documented and very straight forward if you are using their docker compose setup to run the application
Their entrypoint script makes a lot of assumptions and in their docker-compose example they use a single container running supervisord instead of multiple containers, each with a dedicated purpose (ingestion, consuming, web server). The setup is almost insistent on logging to a file instead of stdout. It also checks and tries to modify permissions of some folders(!!). This requires quite a bit of unpicking.
This is doable, but not frictionless to get it to do what I consider “best practices” but I understand that it’s probably a mix of “easy for someone who’s day job is not to be an infrastructure engineer” and “we were using supervisord for baremetal anyway”. Maybe a lot of it is personal preference but I do feel like the project is not taking containerisation fully to heart. Maybe being more user-friendly in their eyes is more important than being a containerisation purist.
Either way, I’ve got it nearly working with my Brother ADS-1700W, which has shortcuts for me, my wife, and “joint”, which uploads documents to different directories via SFTP which then automatically have their paperless-ngx owner set appropriately.
I have zero regrets so far. Paperless ngx is so much more user friendly, the automatic date extraction from OCR, the auto tagging and document type classification, and the ease to backup and restore sold me. I highly recommend it.
For years I was eyeing Mayan as one the variants I could use. Not anymore.
For home I’d go with paperless-ngx no contest, especially if you can run it in a docker container.
https://github.com/the-paperless-project/paperless/issues/20
I don't know if it made it's way into this fork.