Incidentally, HN is up at the top of painful websites to scrape. All comments are at the same level of hierarchy, and the appearance of threading is accomplished by a spacer <img> with a width set according to the nesting depth. There are also no absolute times, and the times that are there slowly lose resolution. In order to get a comment time that's accurate to the minute, you have to load the page twice an hour and store the post id to time mapping.
Same with the comment time. Looked into the Sort by Recently Added request, and there's no good way to do it without polling.
Regex Filter added
I wrote it in about a day, so there's a lot of jenky code. Just wanted to get it out there and see if it's worth spending more time on.
Everything's full-screen and stretched so it seems like a ton of information thrown at you. Each posting has a top line, so you might be able to put that in a header bar of some sort, like an orange div that stretches across with an side and down arrow indicating nested or not - and maybe start out with all of them nested, that way users can really pick the ones they want easily instead of scrolling all the way through. A little more padding between jobs would be nice as well.
I like it though! Keep up the good work.
Bumped the padding a little.
As an example, do a search for "Python" and you'll see a desc with this sentence in it: "We use Python/Django, JavaScript (jQuery, AngularJS), ActionScript, MySQL (and some MongoDB), but you’d be free to choose your own tools and libraries."
[I realize this could be read sarcastically, but I mean it sincerely.]
This isn't possible at the moment on the regular thread :/