Post As2fIkWuNp1P3pffs0 by [email protected] | |
More posts by [email protected] | |
Post #As1qRKDMUrZqOCrOXA by [email protected] | |
0 likes, 1 repeats | |
@johncarlosbaez here's more info on the arxiv mirror shutdown, for those cu… | |
Post #As1rSlGBWlksVtQgTI by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez piracy is data preservation | |
Post #As1skmMmFjlf6xfV0S by [email protected] | |
0 likes, 1 repeats | |
@johncarlosbaez I'm currently doing a separate archiving project. Anybody g… | |
Post #As1uKIrJXQcbyUSG00 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez cc @SafeguardingResearch | |
Post #As1vzjvzsPrNm6y396 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez TIme to bring back the arXiv mirrors, no? | |
Post #As1y9QB8NoYLJ7nAP2 by [email protected] | |
0 likes, 1 repeats | |
@johntimaeus @johncarlosbaez Of this site or something else entirely. | |
Post #As2080WAVHAJLC5HKi by [email protected] | |
0 likes, 1 repeats | |
@mwguy @johncarlosbaez Other stuff. FB is deleting all old live videos. There&#… | |
Post #As25CgtTOOJ7hF2NKC by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez plugging this repo https://github.com/Phylliida/ExtractArxivTex… | |
Post #As2B09EHKUtvXQAfpI by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez 100 Gb a month? This probably counts generated data, no? I cann… | |
Post #As2DzhmlfFxsHkmejw by [email protected] | |
0 likes, 0 repeats | |
@dimpase - The arXiv keeps the PDFs, not just the LaTeX. My original post was … | |
Post #As2EDhjwd9SOCM8eqe by [email protected] | |
0 likes, 0 repeats | |
@weekend_editor - yes. I don't know who ran those, but some of them should… | |
Post #As2FfqBouO2wT09p6e by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez @dimpase So by this estimate, by now the size would be roughly … | |
Post #As2JffM1asp6g66Qi0 by [email protected] | |
0 likes, 0 repeats | |
@moritz_negwer - sounds right! Truly reliable storage requires multiple backu… | |
Post #As2JvoTPSOVVasFxh2 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez poke @Lydie ! | |
Post #As2LAtvJZlQWnL4EBE by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez on it. Just need to do a little space management first. | |
Post #As2NKYgdaWg1jaw1KK by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez I've been similarly wondering about NCBI databases.. | |
Post #As2PnMqos86fKgYvWC by [email protected] | |
0 likes, 0 repeats | |
@fluffykittycat it's worth educating yourself before making such comments. … | |
Post #As2QKJTOS4HaPeJZ8y by [email protected] | |
0 likes, 0 repeats | |
@moritz_negwer @johncarlosbaez @dimpase S3 is an option, but seeing how events … | |
Post #As2RSsV6CiCBJrlChk by [email protected] | |
0 likes, 0 repeats | |
@moritz_negwer @johncarlosbaez one would typically use compression for such dow… | |
Post #As2dM7xDY5AwfdSa24 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez Yes, I agree - proper live-service mirroring requires a lot mor… | |
Post #As2fIkWuNp1P3pffs0 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez I think I'd have 8T to spare but little time to spend on se… | |
Post #As2kv2Mx0LpjAD5dLc by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez @weekend_editor I was thinking to suggest that to my university… | |
Post #As2mWHYNc1yqvSl8ue by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez and it's either 8T single disk or 4T RAID1. Useless for an … | |
Post #As2mWHf7CzMtGLuWPY by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez ah the archive.org torrents could be a good solution https://ar… | |
Post #As2mYD8J1EQRnLU0vo by [email protected] | |
0 likes, 0 repeats | |
@castarco @johncarlosbaez @dimpase I agree! As I understand the S3 is where arx… | |
Post #As2mYDFkZYNeAQxxXE by [email protected] | |
0 likes, 0 repeats | |
@moritz_negwer 👋 Yes, we're doing exactly that :)Also: 10TB are not an i… | |
Post #As2mYWggJeDERSiEeO by [email protected] | |
0 likes, 0 repeats | |
@castarco @moritz_negwer @johncarlosbaez @dimpase https://archive.org/details/a… | |
Post #As2mYWnltHsqnS1thY by [email protected] | |
0 likes, 0 repeats | |
@engarneering This is cool, but also inside US-based and US-controlled machines… | |
Post #As2mbO6KhCtwgH4ihs by [email protected] | |
0 likes, 0 repeats | |
@moritz_negwer @castarco @johncarlosbaez @dimpase @SafeguardingResearch FediArx… | |
Post #As2mg8Uc9f0viihGN6 by [email protected] | |
0 likes, 0 repeats | |
@eqe @johncarlosbaez wow that was an optimistic evaluation of the future 😬�… | |
Post #As2n0rmRUDnby7v93w by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez @dimpase the bulk of the size is probably figures, and the diff… | |
Post #As2n382a0BtQyiY0eW by [email protected] | |
0 likes, 0 repeats | |
@johntimaeus 👋 Do you want to chat about this?(also in private, if you like)… | |
Post #As2n3zcCoSsl8Yci2a by [email protected] | |
0 likes, 0 repeats | |
@foobarry @fluffykittycat well, probably the shadow libraries are the most resi… | |
Post #As2yGHHg2rdouI58KG by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez A critical system should, in general, not have single points of… | |
Post #As32HCIKTtBJXS54To by [email protected] | |
0 likes, 0 repeats | |
@eqe - thanks for the blog link! | |
Post #As32TWRdOCa1VYJI3M by [email protected] | |
0 likes, 0 repeats | |
@Mehrad @weekend_editor - what we could use now are mirrors not under US jurisd… | |
Post #As32igK0wt14U7yTgG by [email protected] | |
0 likes, 0 repeats | |
@Mattcraig - cool! If you do it (or more optimistically: when you're done)… | |
Post #As39rARhX1y3uOYzSK by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez Looks like a work for anna's archivehttps://annas-archive.o… | |
Post #As39vLuMrBn8cIy2b2 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez Has @SafeguardingResearch been mentioned anywhere in this threa… | |
Post #As3A0CJvvmUqKTEh60 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez @neuralreckoning Why were non-US arXiv mirrors shut down? what… | |
Post #As3A76XXoxh6QctbjU by [email protected] | |
0 likes, 0 repeats | |
@SafeguardingResearch @moritz_negwer @castarco @johncarlosbaez @dimpase Whew, I… | |
Post #As3LgoKK773FrUwMzo by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez 5.6 TB? In all honesty, that's not much. At that rate of gr… | |
Post #As3Lo6PFIFYjxeigQC by [email protected] | |
0 likes, 0 repeats | |
@mkj - The issue is getting from "one could" to "I will". … | |
Post #As3M3daYCjnfDaPoEy by [email protected] | |
0 likes, 1 repeats | |
@[email protected] hmm 5.6tb growing at 100gb is not much, totally … | |
Post #As3WZq5WGYw4nDtVR2 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez OK, have 8TB spinning platter ready but am confused by instruct… | |
Post #As3WxsoNn89cBqLNE8 by [email protected] | |
0 likes, 0 repeats | |
@mkj - right. There's a popular setup for solving the distribution problem… | |
Post #As3ZVsMQAsOoXsbMfY by [email protected] | |
0 likes, 0 repeats | |
@adredish - they were no longer needed, or so people thought at the time: https… | |
Post #As3ZbyRoSLJTA420YK by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez I'd like to help with at least a portion of the data, but i… | |
Post #As3Zqo4DKj5HJI49Uu by [email protected] | |
0 likes, 0 repeats | |
@penryn - probably good to read all the comments on my post, like this:https://… | |
Post #As3b2oFsYkWZK4oFea by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez 5.6 terabytes sounds like small potatoes: even a single NVMe di… | |
Post #As3dqSsQW8KNPHe3Qu by [email protected] | |
0 likes, 0 repeats | |
@albertcardona - we'll see if that happens. Preferably someone outside the… | |
Post #As3faujNOAN54N7lqq by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez Any particular contact to request a copy, upon us sending them … | |
Post #As3hAuoQfAfLhYDbHs by [email protected] | |
0 likes, 0 repeats | |
@albertcardona - Let me find out someone to contact about doing it by mail! Th… | |
Post #As3jrdsPhmuxXKaF8K by [email protected] | |
0 likes, 0 repeats | |
@penryn - no problemo! | |
Post #As3m3cFFnU97rM6W6y by [email protected] | |
0 likes, 0 repeats | |
@Mattcraig - I can't figure this out any better than you: I'm not a t… | |
Post #As3mDvtla5Edi2yOTQ by [email protected] | |
0 likes, 0 repeats | |
@penryn - hey, this might be free:https://archive.org/details/arxiv-bulk | |
Post #As3nphlUg2EMpkg8PY by [email protected] | |
0 likes, 0 repeats | |
@albertcardona - by the way, this looks like a free way to download the whole a… | |
Post #As4VU6rH61tcGstZz6 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaezMy question is if it is still possible to have an official mirro… | |
Post #As4zxNMVoNr8ToNTwu by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez be aware that the internet archive version is out of data by al… | |
Post #As53sC9yQwFC6bIY40 by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez @avsm your copy does not have this cut date, right? | |
Post #As5Aa1j9BNzyDR2Agi by [email protected] | |
0 likes, 0 repeats | |
@[email protected] 5.6tb growing at 100gb a month sounds incorrect … | |
Post #As5Aa1pAoyoqW7qz56 by [email protected] | |
0 likes, 0 repeats | |
@m - I imagine they know what they're talking about; the arXiv is becoming … | |
Post #As5s01KvYO3qYvLY1I by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez Just FYI, if we want to get a serious solution in place, we'… | |
Post #As5saVF7q7Sdjy1r1M by [email protected] | |
0 likes, 0 repeats | |
@dginev - there was an international network of arXiv mirrors until September 1… | |
Post #As66TSzBESAU4T6rAW by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez Right, and they were a pre-cloud setup that would need to be re… | |
Post #As9Qm9MWzqdrGn68cC by [email protected] | |
0 likes, 0 repeats | |
@moritz_negwer @johncarlosbaez @SafeguardingResearch https://annas-archive.org/… | |
Post #As9Qm9V2UDRnhB4vsO by [email protected] | |
0 likes, 0 repeats | |
@patrick - that's an excellent article, thanks! | |
Post #AsB5ffNTrAPDobWrQG by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez My company could host a copy (in Europe). Do you know whom shou… | |
Post #AsVyZA2Wq1nPWcq2AC by [email protected] | |
0 likes, 0 repeats | |
@dimpase @moritz_negwer @johncarlosbaez unless there is something weird going o… | |
Post #Aspn3Lhui5hq6u1T9c by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez Thank you. This seems remarkably short-sighted by the arXiv adm… | |
Post #AsqSZ20xYjSEQC9oQq by [email protected] | |
0 likes, 0 repeats | |
@SafeguardingResearch The archive.org torrents serm to mirror the tarballs from… | |
Post #AsqSqvP4sSh988yEUa by [email protected] | |
0 likes, 0 repeats | |
@johncarlosbaez brilliant!No red carpet at all would suggest you had forgotten… | |
Post #AsqSsC74EmEFFA9St6 by [email protected] | |
0 likes, 0 repeats | |
@ulfr well, we're working on exactly that :)You can look at the alpha versi… | |
Post #AsqSsq5hbGu7EVowJU by [email protected] | |
0 likes, 0 repeats | |
Some of us have proper internet uplinks. In Swizerland or the Netherlands, for … | |
Post #AsqT1HRlm8JUOPARou by [email protected] | |
0 likes, 0 repeats | |
@moritz_negwer @castarco @johncarlosbaez @dimpase @SafeguardingResearch arxiv h… | |
Post #AsqT3ivmYr2gDDN344 by [email protected] | |
0 likes, 0 repeats | |
@dimpase @johncarlosbaez a lot of arxiv content has a lot of high resolution f… | |
Post #AsqT6B3qwdoW2GJdkO by [email protected] | |
0 likes, 0 repeats | |
@foobarry @fluffykittycat Very patronising reply for someone who clearly has no… |