He answers that specifically at 8:15. You're given 2 words; 1 is the OCR mistake/unknown, the other is legit/known. Then it takes the results of 10 or so users which correctly guessed the known words and then takes the unknown word for granted see as they all entered the same results.
This new #CAPTCHA / #ReCAPTCHA is a fantastic symbiosis of technologies! Now people in developing countries are digitising books and newspapers! Oh the rest of us are helping, but they're doing it for a living!
Really amazing, started with a great idea, identified the issues and made better in a 2nd unrelated field! Security and digitizing books, who would of thought of that!
@josealb2 Nobody is being forced to do anything. Web admins are free to not use CAPTCHA if they so choose, and users can just opt not to use a site with a CAPTCHA.
@jimmayl1 He said in another interwiew that they found out that it takes the same time to type two english words or a set of 6-8 random characters, cause obviously you concentrate longer to get the random characters right, and you can tipe the two words faster. So yo are spending the same average of 10 seconds while contributing with the digitalization.
not clear how to know which words cant be read by the OCR? he mentions something about a 'view' that tells you that? can someone explain better? thanksexcellent presentation...
I've noticed that the new recaptcha is actually a little bit smarter about entering bullshit on the to-be-digitized word o.o There are some points where I will enter garbage on the digitized word, and the security word correctly, and it denies the answer. I'm still annoyed by these. But I am glad they exist! :D
... basically all work done is with English language I wonder about expanding the possibilities of this project, i.e. more languages For example, the problem of getting with random system, according to Mr. Luis, is sometimes embarrassing (he did as an example "bad Christians") - but not if you put two words coming from different languages. This would help diversity of cultures survive as well (hope he is reading this).
Brilliant! Deserves an award. How about other applications like voluntary Captchas site? Make it an online GAME! If a million people did ten captchas that would be the same as ten million doing one. Kind of like seti@home thing, or pooled computing.
Can you imagine in the future where androids might be walking around among us, we might have to supply CAPTCHAs IRL, e.g. if we walk into a bank or even on a first date lol.
except the internet is pretty much globally expected to be in English with very few exceptions. The global language is now becoming English, and expecting people to know other alphabets or even be able to type them on their keyboard is going too far in my opinion. Even languages that use the same alphabet have strange nuances not found in English, things like üíóâàè, how do you expect people who are willing to wait when it says "wait" to be able to type such characters?
@GFAethna As long as people get educated on this matter then they will know to all substitute the right swear word where possible, use less common ones to pass the filters and it's all good.
@TheCanadianLemur No, I'm actually using my common sense. Computers can operate on you (roboting hysterectomies at the local hospital), but they can't read wavey letters? Please. They really do think we're idiots. They wouldn't go to the trouble of thinking up this splendid lie if it weren't for a reason.
This is a great idea in theory, but in reality, people will just fuck up the system by substituting one of the words (usually the shorter one) with things such as swears, racial slurs, etc.
@1robinsong2 No, you just don't understand how computers work. It's simple really, computers can't understand human language; that's why programming languages were invented in the first place.
Wasn't very smart of him to put this information out to the public. Now every time I am asked to fill a re-captcha I always put in "penis", "cock" or "vagina" to hopefully one day read one of these books that have replaced the word "and" with one of the above.
I do like the fact that you are using captchas to digitize text. However saying that compters can not, or can not do it well; is patently false. Morphological regularization neural networks make short work of captchas. Using a feature set of around 20, will make any captcha I have ever seen; it's bitch. However, I doubt that many spammers have graduate degrees in engineering. They are usually scrip kiddies.
So basically you're forcing people spend their time digitizing a word for you with no possitive impact on security. Clever but I don't think that's such a great feat. It would be interesting if the results were also used to improve OCR algorithms.
So instead of wasting 10s on some random characters, you put in 10s on one word to digitize and 10s for a known word? Doesn't sound like he's solved anything as soon as he had to add a second word.
yeah changed the world, and made it fucking worse. Jesus fuck, now i have to spend more time on writing capatcha then filling in information. It is bad and should not have been there, it should only show up on bots or wrong people, not the good guy. Shit is bad
He answers that specifically at 8:15. You're given 2 words; 1 is the OCR mistake/unknown, the other is legit/known. Then it takes the results of 10 or so users which correctly guessed the known words and then takes the unknown word for granted see as they all entered the same results.
Pure genious. So incredibly smart yet so simple at the same time Love it!
This is why nerds RULE. This was so imformative and I'll pass it on.
And barely anyone will know they are being used, so who cares.
This guy is awesome.
This new #CAPTCHA / #ReCAPTCHA is a fantastic symbiosis of technologies! Now people in developing countries are digitising books and newspapers! Oh the rest of us are helping, but they're doing it for a living!
very good. everything he presented fit together very nicely
@cshcrazy As I heard it, you can't know which one of them is the 'first' (known) and which is the other (digitalized).
Really amazing, started with a great idea, identified the issues and made better in a 2nd unrelated field! Security and digitizing books, who would of thought of that!
are you alive
@josealb2 Nobody is being forced to do anything. Web admins are free to not use CAPTCHA if they so choose, and users can just opt not to use a site with a CAPTCHA.
@jimmayl1 He said in another interwiew that they found out that it takes the same time to type two english words or a set of 6-8 random characters, cause obviously you concentrate longer to get the random characters right, and you can tipe the two words faster. So yo are spending the same average of 10 seconds while contributing with the digitalization.
not clear how to know which words cant be read by the OCR? he mentions something about a 'view' that tells you that? can someone explain better? thanksexcellent presentation...
It's not a scheme if you're making the world a better place.
Where are these digitized books posted, have somehow adverts near? that would mean we make somebody richer :D
I was wondering why I had one that said, "$10,000-$9,000 hike"
Who knows, maybe that was in a book.
Gosh, that's neat.
Funny starting point. "I invented that" then having to explain why it's essential.
I *LOVE* smart ideas!
Love his voice.
are you alive
Really wonderful.
very cool, charming guy too
I've noticed that the new recaptcha is actually a little bit smarter about entering bullshit on the to-be-digitized word o.o There are some points where I will enter garbage on the digitized word, and the security word correctly, and it denies the answer.
I'm still annoyed by these. But I am glad they exist! :D
Wow! This was fascinating! Amazing use of the technology.
are you alive
@@nibbo1954 Yes!
How does Google do this for Google Books?
What a brilliant man
... basically all work done is with English language
I wonder about expanding the possibilities of this project, i.e. more languages
For example, the problem of getting with random system, according to Mr. Luis, is sometimes embarrassing (he did as an example "bad Christians") - but not if you put two words coming from different languages. This would help diversity of cultures survive as well (hope he is reading this).
I met him. I was chosen at school to go and interview him.
are you alive
Brilliant! Deserves an award. How about other applications like voluntary Captchas site? Make it an online GAME! If a million people did ten captchas that would be the same as ten million doing one. Kind of like seti@home thing, or pooled computing.
That's pretty cool.
Now if only they could fix the audio recaptchas so it's possible to understand what to type
1. But now it takes them twice as long to do the captcha.
2. This has been around for years now.
Good work though ;)
Can you imagine in the future where androids might be walking around among us, we might have to supply CAPTCHAs IRL, e.g. if we walk into a bank or even on a first date lol.
are you alive
now ... i love (re)captchas !!!
are you alive
It's not always "check word" "book word". Secondly they take an average of 10 people (or whatever number)
@oipaloi if that were true, programs to do OCR would never fail.
except the internet is pretty much globally expected to be in English with very few exceptions. The global language is now becoming English, and expecting people to know other alphabets or even be able to type them on their keyboard is going too far in my opinion. Even languages that use the same alphabet have strange nuances not found in English, things like üíóâàè, how do you expect people who are willing to wait when it says "wait" to be able to type such characters?
Great idea!
right on dude
I heard someone could decipher the reCaptchas,
this man is a genius!!
@ReggaeChapin76 Tenés razon. :D
Brilliant. Thank you.
are you alive
@gregor00005 If they already knew how to differ from a "bot" and a "good guy" then they wouldn't need this technology.
Proud that you are guatemalan:) and very smart btw.
Very cool....
Intriguing. Thanks reddit.
are you alive
@ComplexStudios he's digitalizing old books and giving them out for free, how is he making money?
whats the intro song? I have heard it freakin everywhere but nobody seems to know what it is
that is amazing
lately i've been hearing/reading people saying that the NY times sucks, and it's not what it used to be
i think i just listened to someone talk about captchas,
How do you know which is which?
superb
that was a good idea
does anyone know the name of the song playing at the beginning of this??
are you alive
@GFAethna As long as people get educated on this matter then they will know to all substitute the right swear word where possible, use less common ones to pass the filters and it's all good.
Interesting
wow thats cool
What a great way to get a banana.
@TheCanadianLemur No, I'm actually using my common sense. Computers can operate on you (roboting hysterectomies at the local hospital), but they can't read wavey letters? Please. They really do think we're idiots. They wouldn't go to the trouble of thinking up this splendid lie if it weren't for a reason.
This is a great idea in theory, but in reality, people will just fuck up the system by substituting one of the words (usually the shorter one) with things such as swears, racial slurs, etc.
@1robinsong2 No, you just don't understand how computers work. It's simple really, computers can't understand human language; that's why programming languages were invented in the first place.
so does youtube use reCaptcha now or is it still on the old one?
Wasn't very smart of him to put this information out to the public. Now every time I am asked to fill a re-captcha I always put in "penis", "cock" or "vagina" to hopefully one day read one of these books that have replaced the word "and" with one of the above.
For free/??? NOOOO!
I do like the fact that you are using captchas to digitize text. However saying that compters can not, or can not do it well; is patently false.
Morphological regularization neural networks make short work of captchas. Using a feature set of around 20, will make any captcha I have ever seen; it's bitch.
However, I doubt that many spammers have graduate degrees in engineering. They are usually scrip kiddies.
So basically you're forcing people spend their time digitizing a word for you with no possitive impact on security. Clever but I don't think that's such a great feat.
It would be interesting if the results were also used to improve OCR algorithms.
who knew signing up to porn websites contributed to society!
there's 666 likes.
ANTICAPTCHAHAHAAHAAA
So instead of wasting 10s on some random characters, you put in 10s on one word to digitize and 10s for a known word? Doesn't sound like he's solved anything as soon as he had to add a second word.
He made a joke. Lighten up.
yeah changed the world, and made it fucking worse.
Jesus fuck, now i have to spend more time on writing capatcha then filling in information.
It is bad and should not have been there, it should only show up on bots or wrong people, not the good guy.
Shit is bad
Bad Christians. LOL
i hate captcha!