Skip to main content

CAPTCHA : A story of old books, traffic lights and self driving cars




Before we talk more about CAPTCHA, we need to talk quickly about Turing Tests. Long before Artifical Intelligence was the cool thing to work on, Alan Turing the great british mathematician deviced a test to tell a computer apart from a human. As part of the Turing Test, a human interogator asks the same question to another human and a computer. The interogator does not know which one is the human and which is the computer. Based on the responses from the human vs the computer, the interogator has to guess which response is from the human. If the computer is able to fool the interogator into beleiving that it is the human for more than 50% of the responses then it is considered that the machine has acheived AI. So, considering that we are in a world where we have not acheived AI which can match human intelligence, a machine or computer would fail at a task that humans can do easily. CAPTCHA uses this philosophy at it's core. 



CAPTCHA is a software test that validates if the subject executing it is a human being or a machine. A test pass corresponds to a successful authentication for a human being. A test fail is an authentication failure and prevents the machine from exploiting a protected resource such as account sign ups, server time etc. 

In the early 2000s when I was a college student, I remember the websites used to be filled with spam comments of all kinds. These were the days when blogging was the cool thing to do and Orkut was the social media of choice. There was no facebook yet. Rogue programs could be written to spam blogs with comments, perform numerous sign ups and in the worst case engage in unsophisticated Denial of Service attacks bringing down entire websites. Website or any front end interface is meant for human use. But when machines can mimic humans actions, it can be exploited for nefarious purposes. Using a free automation tool like Selenium Webdriver one can easily build a script in 5 mins to create actions on a web client. 


CAPTCHA prevented spam attacks in a novel way. Present a squiggly image of a word or a phrase to the subject and challenge them to type it out. The task is simple for a human but near impossible for a machine. Thats sublime and brilliant! Completely Automated Public Turing Test To Tell Computers and Humans Apart. CAPTCHA. Luis Von Ahn and his team coined this term in 2000 while at Carnegie Mellon University. Von Ahn turned out to be a pioneer of crowdsourcing software and went on to found the language learning platform Duolingo. 
The Captcha technology was used to translate and digitize thousands of old books in a very smart way. A standard CAPTCHA challenge was accompanied with a picture of a word or phrase from an old book and presented to users. This way thousands of old books were digitized by Google. And we helped do that unknowingly. That's a great use of hundreds of thousands of hours of human effort that was wasted in solving CAPTCHA challenges. This technology is called re-CAPTCHA and it is owned by Google. 

Traffic lights and Self driving Cars 
The latest CAPTCHA challenges that we see these days ask us to identify traffic lights and cars in an urban image. Now why always traffic lights and cars? 

Turns out Waymo, google's self driving car company is training it's AI models to better identify traffic lights and cars on the road. And like digitizing books in the past, we are helping Google make self driving cars a reality in the future. 

No CAPTCHA re-CAPTCHA 
Another type of CAPTCHA that Google has been using is the no captcha. Google knows our online behavior and based on that it generates a score between 0.0 to 1.0 that tells the likelihood of the user being a bot. As a developer, depending on the score returned, you can decide whether to present a challenge or not. 

The "I am not a robot" checkbox is a variation of the no-captcha as well in which Google asks you to click on a checkbox as a challenge. Google uses your move movement attributes to determine if you are a machine or a human.  

Breaking CAPTCHA
There are shops with humans sitting in front of computers to perform fake sign ups, likes, reviews etc. It can be a challenge for technologies to catch these because essentially these are not computers but humans interacting. The artificial intelligence and image recognition technology have seen amazing advancements recently which has made some of the traditional CAPTCHA ineffective or vulnerable. The challenge presented would keep changing as modern machine become more and more efficient at imitating humans. 

Popular posts from this blog

9 tips to build a resume that will get you calls from Google, Facebook and Amazon

I have a sweet tooth. Whenever I am at a grocery store, I love to surf the candy aisle. Specifically the shelves with the chocolates. And the selection of chocolates is vast. There are literally hundreds of options. So many brands, flavors, sizes to choose from. But some chocolates would stand out in the crowd. And what makes some stand out amongst the others is the packaging. Out of the hundreds of selections on offer, I pick up only a few to review and amongst them only one gets bought.  (Side note: The Endangered Species Chocolate company has a great product packaging and story. Try it out if you haven’t.) If not with chocolates, you would have experienced a similar situation while purchasing something else. Maybe a book, a bottle of wine or a bag of chips. And every time you are looking to discover something new, the first thing that draws you to the product is its packaging. Now, why am I talking about product packaging in a post about building great resumes? Well, if you rea...

Why Product Quality will be the Ultimate Competitive edge in the AI Era

  We live in the zeitgeist of artificial intelligence — where software can be built faster and cheaper than ever before. AI has leveled the playing field, turning anyone with an idea into a potential creator. But in a world where speed and cost are no longer the constraints,  what will truly separate the winners from the rest? The classic development trade off — good, fast, or cheap — has long forced teams to pick two at the expense of the third. AI is changing this by enabling development that is both fast and cheap. The pressing question becomes: can AI help us achieve software that is also  good ? From idea to prototype, AI empowers teams to deliver solutions quickly, affordably, and with surprisingly high quality. Moving from prototype to Minimum Viable Product (MVP), it’s still possible to maintain the trifecta of Good, Fast and Cheap. However, when aiming for production-ready applications, the stakes rise. Reliability, security, maintainability, and scalability beco...

The $460 Million Mistake That Crashed a Wall Street Giant—and What we can learn from it

  On the morning of August 1, 2012, Knight Capital - then one of the biggest market maker on Wall Street - deployed new code to its high-speed trading system—but one of eight production servers never got the update. That lone machine started running an old, dormant module called “Power Peg,” flooding the market with errant trades. In just 45 minutes, Knight amassed nearly $7 billion in accidental positions and lost $460 million. It was one of the most expensive software failures in Wall Street history—driven by a rushed deployment, missing checks, legacy code left behind, and no clear plan to roll back. This is the story of how a routine release turned into a company-ending event—and what leaders today can learn from it. Founded in 1995, the Knight Capital Group was the largest market maker in US equities. Knight’s electronic trading group covered more than 19000 securities and it’s high frequency trading algorithms processed a daily trading volume of $20 billions which was 15% of ...

Chick-Fil-A One : A crispy mobile app ordering experience

Chick-Fil-A or Popeyes or Shake Shack. The jury is still out on who serves the best chicken sandwich but I am declaring a winner on who has the best mobile app experience. And the champion is Chick-Fil-A.  Our phones are inundated with numerous apps. But you hardly come across one that enhances your experience with the product and impresses you so much that you stop for a minute and appreciate the excellent work done the team who built it. Chick-Fil-A One, the fast food restaurant chain's mobile ordering app did it for me. Chick-Fil-A is an intriguing company. Started 75 years ago it  is still a family owned business, one of the largest restaurant chains in the US, operates more than 2600 locations, generating revenue of more than $10 billion. They have the highest sale per restaurant unit easily beating giants such as McDonalds, Taco Bell and Subway. Oh and they do this while being mandatorily closed on Sundays. Quite a remarkable feat! How do they do it? The answer...

Ep 9 : Who will win the Streaming Wars? The fight for content, cost and convenience

 

From Code to Customer: Measuring Software Quality Before Release

  “When a metric becomes a target, it ceases to be a good metric.” -  economist Charles Goodhart I feel that every discussion about metrics should mandatorily start with the above word of caution from Goodhart’s law. Metrics should inform decisions, not drive behavior in isolation. Without context, they can easily be gamed or give a false sense of confidence. Alright! With that disclaimer out of the way, lets talk about Quality Metrics for production readiness. What you’ll find here comes from the trenches — lessons from things that worked, things that didn’t, ideas that sounded smart but fell apart at scale, and blind spots I didn’t see until they hit production. I’ve owned software quality across industries like e-commerce, fintech, streaming, and SaaS — in startups, scaleups, and big tech. Your context may vary, but these insights should hit home for most teams shipping software. Why Should We Even Measure Quality? I believe there are three reasons to measure software quali...