Hacker Newsnew | past | comments | ask | show | jobs | submit | dginev's commentslogin

You will be delighted. Feel free to inspect some sources.


ar5iv tracks the arXiv collection with a one month lag. Exactly as to signal that this is not the "official" arXiv rendering. It is also a showcase predating the arXiv /html/ route, but largely using the same technology. Nowadays maintained by the same people (hi!)

There used to be another showcase, called arxiv-vanity. They captured what happened pretty well with their farewell post on their homepage:

https://www.arxiv-vanity.com/


You can help make LaTeXML better, or you can simply report issues when you spot them during reading. Some we have collected automatically (any errors and missing packages), but others we can't - wrong colors, broken aspect ratios of figures, weirdly layed out author lists, etc.


Hi, an arXiv HTML Papers developer here.

As a very brief update - we are pending a larger update.

You will spot many (many) issues with our current coverage and fidelity of the paper rendering. When they jump at you, please report them to us. All reports from the last 2 years have landed on github. We have made a bit of progress since, but there are (a lot of) more low-hanging fruit to pick.

Project issues:

https://github.com/arXiv/html_feedback/issues/

The main bottleneck at the moment is developer time. And the main vehicle for improvements on the LaTeX side of things continues to be LaTeXML. Happy to field any questions.


I would like to write code for latexml to translate a package but I found the documentation to be hard to understand. That might be what is holding developers back. I looked at this a year ago and gave up.


Tell us what you would need described in a tutorial to be productive, as well as your background with the technologies involved (TeX/LaTeX, perl, XML, XSLT, HTML). Probably best as a new issue:

https://github.com/brucemiller/LaTeXML/issues

It's a pretty deep rabbit hole, but I wholeheartedly agree most standard package support incantations should be easy and few to use.


Just passing by to mention that if you get excited about seeing your upgrades in arXiv itself, we can talk about contributing them to the arXiv HTML pages.

But seeing your plans for Science Stack, all the best with the endeavour!

And I am curious to know if arXiv:2105.10386 works well.


It works! After the initial data load (big paper), the scrolling and performance works nicely.

Can visit at sciencestack.ai/arxiv/2105.10386

Note: no support for nomenclature/index yet. I'm also working on refactoring the data/json to a streaming model (right now it's one big json dump on load)


One can also grab a random arXiv paper in HTML (via ar5iv), if that was desired.

Just visit:

https://ar5iv.labs.arxiv.org/feeling_lucky


Is the code for the "feeling lucky" selection mechanism open? Or, do you know how they select papers at random?


Sure, the repository is open source (linked from the front page).

The selection did not get much thought at all, just a Rust rand shuffle over all ids performed at first page visit and then cached: https://docs.rs/rand/latest/rand/seq/trait.SliceRandom.html#...

I had all IDs already computed for the previous/next article navigation feature, so it seemed fun to reuse them.


Thanks for this! If I knew this existed, I wouldn't have built the page myself.


If you are already using LaTeXML, it is also easy to get the ar5iv rendering (example article: https://ar5iv.labs.arxiv.org/html/1910.06709)

You can either copy the CSS asset from: https://github.com/dginev/ar5iv-css

and add "--css=ar5iv.css --nodefaultresources" to the command line options for latexmlpost (or latexmlc).

Or alternatively, for people not interested in learning how to install and configure latexml, they can try the self-contained Dockerfile at:

https://github.com/dginev/ar5ivist/


To me one of the exciting aspects of HTML is that we can theme the same article in different ways, tailored to individual preferences - just swap in a different CSS file.

Having a two-column theme, or left-aligned vs justified themes, could be workable in the long run. I hope that we get to see some browser extensions modding the pages before too long.

The reason for the current justified text is that it is the default aesthetic for a LaTeX-based article, and a lot of authors expect it.


That's it in spirit, but in practice it's refreshed:

https://github.com/arXiv/arxiv-view-as-html


For the image widths, there is some CSS fine-tuning that is still needed on the arXiv HTML side. I think that will get fixed soon, just needs the right height directive set.

Getting subfigures emulated via flexbox is one of our more recent LaTeXML enhancements, and still has some ongoing work (working on it today actually). It can be a bit finicky to test - there are easily 20 different ways people can write LaTeX for subfigures in arXiv.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: