- cross-posted to:
- [email protected]
- [email protected]
- cross-posted to:
- [email protected]
- [email protected]
Has anyone used ArchiveBox for self hosted web archiving? If so, what are your thoughts on it compared to Internet Archive or other publicly available services?
I have been experimenting with it, for what it is, it works pretty well … for now. I have concerns about the fact that it’s a ton of moving parts basically duct-taped together by an abuse of the Django admin (that’s the web app platform it’s based on, which I was a developer for long ago). Also, the search function is primitive at best. I don’t think it’s going to be my long-term solution for this need, but maybe I’m wrong.
The archived pages are available as files on disk, I also added a script which generates index.html so I can browse it without starting the program. Basically the only time I run archivebox code is when adding a new site. And I never look at the GUI, it adds nothing to the table