Den fre 27 nov. 2020 kl 19:26 skrev Daniel Shahaf <d.s_at_daniel.shahaf.name>:
> Sounds good. Nathan, Daniel Sahlberg — could you work with Infra on
> getting the data over to ASF hardware?
>
I have been given access to svn-qavm and uploaded a tarball of the website
(including mboxes). I'm a bit reluctant to unpack it since it takes almost
7GB, and there is only 14 GB disk space remaining. Is it ok to unpack or
should we ask Infra for more disk space?
Note that svn-org@ doesn't have an equivalent @s.a.o list, and that, as
> mentioned upthread, the post-migration (from tigris.org to apache.org)
> mboxes may be in a different order than the official ones, and shouldn't
> be "deduplicated".
>
The mboxes will be preserved but I don't plan to make them available for
download (since they are not available from lists.a.o or mail-archives.a.o).
> You indicate a desire to maintain URLs. Do you have some ideas on that?
>
> Each individual message .shtml file contains the message-id in
> a comment. We can extract the comments and build a redirector around
> them. (By the way, this is basically the same exercise that Infra must
> have solved back when Sebb received that CSV file from the lists.a.o
> vendor, so there may be an opportunity for code reuse.) Of course, the
> full rsync likely has the same info available less scrapily.
>
> Or, as mentioned above, the .shtml files could just be preserved
> statically (plus or minus an appropriate message in the list of years on
> the /${listname}/ page). In fact, I'm having trouble coming up with
> a reason _not_ to serve a static snapshot of the pages, even if we do
> build a redirector.
>
No redirector as of now, only the static [s]html pages.
I will need some help from root to:
1. Install a web server. nginx? (just kidding)
2. Setup httpd.conf
3. Configure a DocumentRoot where I can put the files. Doesn't seem right
to store them in /home
Kind regards
Daniel Sahlberg
Received on 2020-12-21 09:00:06 CET