HN Theater @HNTheaterMonth

The best talks and videos of Hacker News.

Hacker News Comments on
That feel when you can't captcha

Taran Van Hemert · Youtube · 99 HN points · 2 HN comments
HN Theater has aggregated all Hacker News stories and comments that mention Taran Van Hemert's video "That feel when you can't captcha".
Youtube Summary
This (BY FAR) wins the award of "stupidest comments" compared to all my other videos. I've got half a mind to disable them, to stop them from clogging my feed.
Sort by "new." Look upon them and despair.

(You guys wanna know the REAL reason why this captcha didn't work? I realized soon after that it was because of my VPN.)
HN Theater Rankings

Hacker News Stories and Comments

All the comments and stories posted to Hacker News that reference this video.
You’re thinking of reCaptcha “v2”. That was when it was about digitizing books. One distorted photo of a word had a known spelling, and the other didn’t. If you spelled the known one right, it let you through. If enough people spelled the unknown one the same way, it would “learn” how to OCR that word (of sorts). If you had time to spare/waste, you could spell the first one right, and put garbage for the second one. About half the time, it would let you through.

Not sure how reCaptcha “v3” (the photos of streets one) works. It could work the same way, but just with more known and unknown (unclassified) pictures? It’s also a lot more harsh if you get it wrong: artificial delays, switching to the “click until no more remain” mode (with more delays), and more. Sometimes, it just refuses to let you through no matter what: https://youtu.be/zGW7TRtcDeQ It’s extremely frustrating.

No, sorry but that's nonsense. reCAPTCHAv2 doesn't do the tile fade in unless your v3 score is low. The fade in I'm talking about can take one to five seconds per tile, plainly designed to annoy the user.

Here is a video of the fade-in in action: https://youtu.be/zGW7TRtcDeQ?t=89

(Note that you do not need to be on a shared IP to experience this. Merely using firefox with resist fingerprinting and a adblocker is enough to trigger this behavior on an IP google normally 'trusts' when using chrome.)

SquareWheel
>Here is a video of the fade-in in action

Okay, I agree that's punitive. I've never seen a fade that slow, so it must be at the lowest levels of trust.

chipperyman573
If you use a VPN (a popular one like PIA or ExpressVPN) you will almost always get a fade in that slow. And it is really annoying.
stordoff
I submit all the time before realising one has started to change. It's terribly frustrating.
snuxoll
This happens to me all the time at home, I use Safari with uBlock Origin and reCaptcha never fails to decide to fuck with me.

I have a static IP address, and Google will happily give me a not-dick captcha if I use another browser without anti-tracking features enabled (even with a clean profile).

Mar 30, 2018 · 99 points, 58 comments · submitted by akkartik
Spivak
I use the Tor browser pretty frequently so I've gotten pretty good at these. It's much easier once you realize that there are three sources of data that determine correctness: basic computer vision, hurried/lazy humans, and a lot of bots. It's not about truth it's about following the crowd.

- Forget the slivers. If it's not more that 75% of the square forget about it.

- Anything not in the foreground might as well not exist.

- Poles != Signs If you saw pole sticking out of the ground you wouldn't call it a sign would you?

- It's not about whether it's a storefront, it's about whether it obviously looks like one. If you spend literally any time thinking about it it's a no.

- Mark the most prominent vehicle and skip everything else. Only mark a second vehicle if they're equally sized.

None
None
Izkata
Aside from that, I'm pretty sure his very first sentence, "Apparently I am incapable of identifying street signs", is correct. He keeps clicking on traffic signs, not street signs.
birksherty
So that means we have to think like bots not humans. Otherwise it wouldn't have these rules and people would easily solve these. What's the point of captcha?
candiodari
I remember 15 years ago making a neural net captcha OCR cracker (for DNS registrations). And after about 3 hours I have my accuracy, recall and confusion numbers and I'm not really getting them any better. This was a captcha like from those old PHP libraries. Many colors (sometimes bright yellow on white GRMBL), varying background color, a few things on the background that were clearly not letters (like large circles).

Now in order to train this network I had to download 1000 or so captchas and solve them manually, so I had written a quick program that allows a person to solve captchas. So I figured "let's see how good it is", and used the program to test myself (I figured, worst case I have some more raw data).

Turns out the neural net was 2% better than I was. It actually scored 2% better on captcha solving than I did, and this was when I was trying really hard, pausing when there's doubt and giving it 5 seconds thought. I was a bit surprised that trying really hard or just going with my first guess didn't make more than a percent difference.

Then I got someone who had nothing to do with IT to solve 50 captchas and calculated ... precision was 8% below the neural net (also known as she'd do about 1 in 6 wrong, whereas I only had one in 10-12 or so. The neural only had one in 14-15 wrong. It was a pretty sizable difference).

And lastly I figured, this is not possible. I'm somehow messing this up. After all, I should have made that same 8%+ errors on the original set of 1000 captchas I downloaded, did the network really learn to read better than I can, while being fed close to 10% falsehoods* ? So I generated the set where it consistently errors. And ... Yep. It did. About 40% of those was just me doing the ground truth wrong. I fixed them, but it only improved the CNNs performance slightly.

* of course of those 10% wrong would still almost always have 4 out of 5 letters correct

EtDybNuvCu
I feel like users should be compensated for giving Google their time and mental energy since this is literally making users manually tag their machine-learning corpus for free.
mfgs
If there wasn't a cost associated with completing captchas then wouldn't it defeat the purpose?
candiodari
Except machine learning algorithms are already better at every last captcha task it comes up with.

I doubt they get much value out of it.

Buge
Isn't the compensation being able to view and use the website?

If you're not getting anything valuable by completing the captcha, then don't do it.

IIAOPSW
your being paid with website content.

Take your pick. Do you want to pay with:

1) a few seconds of your mental energy

2) your privacy

3) background cryptocurrency mining

Thats just how the internet work. Content must be paid for somehow.

EtDybNuvCu
Let us analyze this further. Suppose that Alice browses 4chan. 4chan uses a Google image-classifying captcha. Alice is viewing user-generated content posted for free, including 4chan advertisements. When Alice chooses to post, she completes a captcha for Google and her post is sent to 4chan.

Who gets paid for 4chan's content? Alice does not profit. 4chan's users do not profit. 4chan 'profits' via advertising revenue but they actually operate at a loss.

Who is left to get paid? Google. Google gets a human-completed captcha for its machine-learning projects.

I look forward to the demise of the Web and its replacement with a content-addressed alternative where your views will be technologically obsolete.

yorwba
Alice profits by not having her post be drowned out by a flood of automated spam. 4chan's users benefit by not having to look for interesting content among a flood of automated spam. 4chan's operators benefit by not having to moderate a flood of automated spam.

If captchas weren't useful to 4chan, they wouldn't make use of it. When you see a system that's obviously not beneficial to anyone, it's useful to take a step back and think about the incentives that keep things as they are. Because if it's so obviously wrong, why hasn't it been changed yet?

I'm not sure how a content-addressed web would handle these problems, but there'll need to be some hoops to jump through before a significant number of other people become willing to look at your content.

stordoff
The PlayStation store, where literally the point is to pay for content, puts a reCaptcha on log in, and (for me) it quite often triggers the "Please flag all images containing cars/signs/shop fronts" version.

Edit: fixed typo/missing word

kerkeslager
> Thats just how the internet work. Content must be paid for somehow.

Not true. Some (most?) of the best content on the internet is there completely for free, because people believe in it and put it out there out of their own pocket.

The internet is worse, because of the content that is paid for by the methods you describe, than the internet of the 90s when content was there by the methods I just described.

I do think there's some value in paying artists and writers for their work, with, you know, money, because those people have to eat too. But I refuse to buy into this narrative where "I pay you for content and you give me content" (a consensual transaction) is the same as "I request content from you and you pretend to give it to me for free while secretly violating my privacy, stealing my attention, and otherwise screwing me without my consent". These are not the same thing.

This is not "just how the internet works", it's how a few people have decided they want the internet to work because it's profitable to them, and we don't have to put up with it.

brownbat
> The internet is worse, because of the content that is paid for by the methods you describe, than the internet of the 90s...

Yeah, when people comment that online content must be subsidized, I always find it a little ironic. They're contributing information and analysis to a discussion. Who's paying the subscription fee?

How could such a lifestyle, where they give away their mental energy for free, possibly be sustainable?

The frequent rebuttal of "but my comments are low quality and barely worth anyone's time!" is fun too.

gsich
this
hyperdimension
I always feel dirty when completing them, for this exact reason. Most places I'll just leave, unless I absolutely must reach the website.

The way I see it, you have two options:

- Allow Google to extensively track all your activity online, or

- Do free work for Google

And Google's convinced many webmasters to embed their recaptcha, so no matter where I go I have to give something up to Google. At least I felt better about it when I was helping OCR books.

drewg123
I miss the days when you just had to help them digitally encode writing. Now I'm in the same "is the pole part of the sign" situation as this guy all the time.

Does anybody know if captcha's are triggered or are made harder by ublock? It seems to me like captchas are easier on my work chrome profile, which does not have any extensions since I only use it for internal sites.

rushsteve1
Not only that but I am almost certain that they are intentionally more difficult on "shady" or lower traffic websites.
rcthompson
I think they are made more difficult by when Google observes "bot-like" activity coming from you. When they started the "I'm not a robot" checkbox, I used to be able to just check the box and it would let me through, every time. Then, one day I had to download several dozen files, and each download had one of these checkboxes. The first 10 or so files, it let me through like it always did. Then it started giving me one CAPCHA screen for the next 10 files. Then 2 screens. Then 3, and so on. That was several years ago, and ever since then it always gives me a long series of difficult CAPCHAs every single time. I guess Google still thinks I'm a bot even after several years of solving these things pretty consistently (I'd say I have about 90% success rate).

For the record, I do tag boxes containing only sign posts, or only tiny slivers of signs.

xirdstl
I find myself intentionally slowing down when completing forms in order to not appear too bot-like and be forced to complete a captcha.
beardog
Part of the reason these are so difficult is that it will intentionally fail you sometimes even when you're right to prevent malicious bots from learning off of it, also a decent number of them are not fully solved by Google (they show you both ones they know and ones they don't).

Basically its meant to be somewhat confusing.

bradknowles
Many sites do a canvas fingerprint, and send you to the captcha if the fingerprint fails.

And then it’s captcha hell.

droidist2
A fingerprint? Like this thing?

https://panopticlick.eff.org

akkat
Wait until you go abroad and instead of saying "find the street signs" it says something in Japanese with no way of changing the language.
birksherty
I get these all the time when using VPN/proxy. These are basically harassment for users.

They replaced the text captcha with these. Google knows that these stops humans from going in. But, they don't fix it because they need to harvest data. So, Google is evil. There is no doubt on that.

extralego
These Google captchas were much easier about 1 year ago. These days, I stop and ask myself “Do I really need to get past this guard?”.
mcochrane
I think it depends a lot on your IP too. I tried to do some google captchas in TOR browser once and it took forever! Just like this video, even when you got it right it still kept asking for more..
Baeocystin
Happens relatively frequently when using a VPN. If, for whatever reason, the IP I'm using is currently getting the captcha-hell treatment, shifting to a different exit address clears it up 95%+ of the time.
tziki
Someone should create a neural net that does this for you.
crispinb
High-tech isn’t always the best approach. Someone should just bash the dozy fuckers who force these idiotic things on us.
YaxelPerez
And train it by making people identify the correct solutions to get access to a website
digi_owl
Heh, i recall claims that hackers where using Amazon's Mechanical Turk service as a means of defeating captchas their botnets encountered.
AndrewOMartin
And what do you think Amazon were doing with their ever growing corpus of tasks and verified-correct answers? :)
amelius
I recall that hackers were sending captchas to porn sites. "Want to see this picture, then solve this captcha." And then the answers were redirected back to the original, legitimate site.
droidist2
True, sexual drive can be a great motivator. Porn users are tolerant of a lot (popup ads, etc.)
gustavmarwin
The Chrome plugin Privacy Badger (https://chrome.google.com/webstore/detail/privacy-pass/ajhmf...) is supposed to be useful to avoid those.

I haven't been able to reach a conclusion on whether it is effective, anyone uses this?

21
So, are you supposed to click on the sign support sticks?

What about squares which have just a sliver of the sign?

ggm
Yes. This. And, is it a storefront when it's a lemonade stand outside a house? Is it a road sign if it's a for sale sign? Is it a road when it's the road into a carpark?

Epistemology robot!!!

lstamour
Technically the sign support sticks aren’t signs. And some of the signs clicked on weren’t street signs, if you identify street signs as those put up by a government agency to regulate or assist with driving on a street. Also, he clicked a hotel, which isn’t a storefront. When Google says storefront they mean a multi-tenant store facing a street with a big sign indicating the store’s name or purpose. Sometimes these locations are closed, but they rarely look like hotels, houses or apartments. I have also had to pass, twice, by saying nothing was relevant and not clicking anything. I’ve also had it show me streets with cars and had it ask me to select storefronts... (Trick questions, maybe to see if I click wrong then change my mind?) And I’m much more likely to get it wrong on my iPhone than on a computer, perhaps Google doesn’t have as much mouse movement data that way, and can’t fingerprint an iPhone as uniquely?
caspervonb
Hard on humans, easy on bots.
hippich
We are working on hashcash.io - replacement for captcha. no more frustration to guess what google meant, just computer proof of work and you are good to go
fractallyte
Looks interesting, and fairly attractive presentation... but how does it work? This is essential to describe, but are no details about that on your front page!

Also, your text needs some work; there are grammatical errors and some phrases could be improved. (I can help with this, if you want.)

jraph
Please, do not drain my batteries, waste resources of my planet and make me (or my employer, or my coffee shop, or my train company, or my library) pay electricity to prove I'm not a robot.

Please, do not make Internet an energy hog more than it already is.

hippich
Eventually you will have full control over it (we haven't build it out,) and if you dislike it for any reason - you just do not participate.
lomereiter
Your last statement theoretically could be addressed by tweaking the algorithm settings such that it consumes at most [the time it takes for a human to solve reCaptcha] x [average power consumption of a smartphone]. Not sure if it would still hold its purpose, though.
sbov
I used to never get these back when I used Chrome. I started getting them once I switched to Firefox. Is this a thing? or was it coincidence?
fractallyte
So at the end, when he says despondently, "I'm not a robot!"...

Paranoid fantasy: it's a PhilipKDick/BladeRunner thing. You see, actually he is a robot (as, apparently, are most of us), and this test is designed to prove it. It's a precursor to the Voight-Kampff machine...

ggm
I suspect the bad guys using captcha won't go to the good low barrier captcha because there's no microcent transaction in it. It's a Gresham's law case: bad captcha with revenue drives out good captcha with low overhead
None
None
rajeshpant
Hopefully more sites will upgrade to the new Invisible reCaptcha by Google. I am using them on my sites and it is certainly less annoying and the user experience is better.
philo23
Interesting side note, when Google isn't completely convinced you're not a robot, it'll actually swap the invisible captcha out for these photo picker ones.
nmridul
Whenever I see this, I intentionally select incorrect boxes. Let the AI eat those incorrect data. If sufficient people do this, these AI free loading stuff will stop.
DINKDINK
My hope is that in the future lightning network micro payments will be a better alternative than captchas for rate limiting page access.
None
None
None
None
milankragujevic
cool seeing taran from ltt on hn...
baxtr
Oh I hate these so much.
None
None
jstanley
It used to be some benign OCR task, now it's "please identify street signs from photographs we've taken". Once they can correctly classify street signs, what else will they want? "please click on the faces you recognise?"? "please click on the faces of your friends that are labelled with the correct names?"?

I've started opting out of Google Captcha's wherever doing so isn't too burdensome. If I see a Google Captcha and I don't really want to get to the other side, I just close the tab and forget about it, rather than subject myself to their capricious gatekeeping.

userbinator
Some people I know online have basically started buying "captcha tokens" --- the same ones that spammers use when they want to bypass them in bulk. Depending on your morals (keeping in mind these services are mainly third-world-country sweatshop labour), it might be a better choice than spending your own brainpower on such things, and a way of giving Google a virtual middle finger.
mikeash
Doesn’t Google get what they want either way?
userbinator
They get solved captchas, but they're not using the user's brainpower and time to do it.
andrewchambers
All google cares about is getting the captchas solved to train their AI.
NullPrefix
Cut the middle man out, they could hire the sweatshop directly.
iakov
The sweatshop wants money, millions of users solve captchas daily, for free. Clever.
HN Theater is an independent project and is not operated by Y Combinator or any of the video hosting platforms linked to on this site.
~ yaj@
;laksdfhjdhksalkfj more things
yahnd.com ~ Privacy Policy ~
Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum.