Skip to main content

CAPTCHA : A story of old books, traffic lights and self driving cars




Before we talk more about CAPTCHA, we need to talk quickly about Turing Tests. Long before Artifical Intelligence was the cool thing to work on, Alan Turing the great british mathematician deviced a test to tell a computer apart from a human. As part of the Turing Test, a human interogator asks the same question to another human and a computer. The interogator does not know which one is the human and which is the computer. Based on the responses from the human vs the computer, the interogator has to guess which response is from the human. If the computer is able to fool the interogator into beleiving that it is the human for more than 50% of the responses then it is considered that the machine has acheived AI. So, considering that we are in a world where we have not acheived AI which can match human intelligence, a machine or computer would fail at a task that humans can do easily. CAPTCHA uses this philosophy at it's core. 



CAPTCHA is a software test that validates if the subject executing it is a human being or a machine. A test pass corresponds to a successful authentication for a human being. A test fail is an authentication failure and prevents the machine from exploiting a protected resource such as account sign ups, server time etc. 

In the early 2000s when I was a college student, I remember the websites used to be filled with spam comments of all kinds. These were the days when blogging was the cool thing to do and Orkut was the social media of choice. There was no facebook yet. Rogue programs could be written to spam blogs with comments, perform numerous sign ups and in the worst case engage in unsophisticated Denial of Service attacks bringing down entire websites. Website or any front end interface is meant for human use. But when machines can mimic humans actions, it can be exploited for nefarious purposes. Using a free automation tool like Selenium Webdriver one can easily build a script in 5 mins to create actions on a web client. 


CAPTCHA prevented spam attacks in a novel way. Present a squiggly image of a word or a phrase to the subject and challenge them to type it out. The task is simple for a human but near impossible for a machine. Thats sublime and brilliant! Completely Automated Public Turing Test To Tell Computers and Humans Apart. CAPTCHA. Luis Von Ahn and his team coined this term in 2000 while at Carnegie Mellon University. Von Ahn turned out to be a pioneer of crowdsourcing software and went on to found the language learning platform Duolingo. 
The Captcha technology was used to translate and digitize thousands of old books in a very smart way. A standard CAPTCHA challenge was accompanied with a picture of a word or phrase from an old book and presented to users. This way thousands of old books were digitized by Google. And we helped do that unknowingly. That's a great use of hundreds of thousands of hours of human effort that was wasted in solving CAPTCHA challenges. This technology is called re-CAPTCHA and it is owned by Google. 

Traffic lights and Self driving Cars 
The latest CAPTCHA challenges that we see these days ask us to identify traffic lights and cars in an urban image. Now why always traffic lights and cars? 

Turns out Waymo, google's self driving car company is training it's AI models to better identify traffic lights and cars on the road. And like digitizing books in the past, we are helping Google make self driving cars a reality in the future. 

No CAPTCHA re-CAPTCHA 
Another type of CAPTCHA that Google has been using is the no captcha. Google knows our online behavior and based on that it generates a score between 0.0 to 1.0 that tells the likelihood of the user being a bot. As a developer, depending on the score returned, you can decide whether to present a challenge or not. 

The "I am not a robot" checkbox is a variation of the no-captcha as well in which Google asks you to click on a checkbox as a challenge. Google uses your move movement attributes to determine if you are a machine or a human.  

Breaking CAPTCHA
There are shops with humans sitting in front of computers to perform fake sign ups, likes, reviews etc. It can be a challenge for technologies to catch these because essentially these are not computers but humans interacting. The artificial intelligence and image recognition technology have seen amazing advancements recently which has made some of the traditional CAPTCHA ineffective or vulnerable. The challenge presented would keep changing as modern machine become more and more efficient at imitating humans. 

Popular posts from this blog

The $460 Million Mistake That Crashed a Wall Street Giant—and What we can learn from it

  On the morning of August 1, 2012, Knight Capital - then one of the biggest market maker on Wall Street - deployed new code to its high-speed trading system—but one of eight production servers never got the update. That lone machine started running an old, dormant module called “Power Peg,” flooding the market with errant trades. In just 45 minutes, Knight amassed nearly $7 billion in accidental positions and lost $460 million. It was one of the most expensive software failures in Wall Street history—driven by a rushed deployment, missing checks, legacy code left behind, and no clear plan to roll back. This is the story of how a routine release turned into a company-ending event—and what leaders today can learn from it. Founded in 1995, the Knight Capital Group was the largest market maker in US equities. Knight’s electronic trading group covered more than 19000 securities and it’s high frequency trading algorithms processed a daily trading volume of $20 billions which was 15% of ...

Why Product Quality will be the Ultimate Competitive edge in the AI Era

  We live in the zeitgeist of artificial intelligence — where software can be built faster and cheaper than ever before. AI has leveled the playing field, turning anyone with an idea into a potential creator. But in a world where speed and cost are no longer the constraints,  what will truly separate the winners from the rest? The classic development trade off — good, fast, or cheap — has long forced teams to pick two at the expense of the third. AI is changing this by enabling development that is both fast and cheap. The pressing question becomes: can AI help us achieve software that is also  good ? From idea to prototype, AI empowers teams to deliver solutions quickly, affordably, and with surprisingly high quality. Moving from prototype to Minimum Viable Product (MVP), it’s still possible to maintain the trifecta of Good, Fast and Cheap. However, when aiming for production-ready applications, the stakes rise. Reliability, security, maintainability, and scalability beco...

From Code to Customer: Measuring Software Quality Before Release

  “When a metric becomes a target, it ceases to be a good metric.” -  economist Charles Goodhart I feel that every discussion about metrics should mandatorily start with the above word of caution from Goodhart’s law. Metrics should inform decisions, not drive behavior in isolation. Without context, they can easily be gamed or give a false sense of confidence. Alright! With that disclaimer out of the way, lets talk about Quality Metrics for production readiness. What you’ll find here comes from the trenches — lessons from things that worked, things that didn’t, ideas that sounded smart but fell apart at scale, and blind spots I didn’t see until they hit production. I’ve owned software quality across industries like e-commerce, fintech, streaming, and SaaS — in startups, scaleups, and big tech. Your context may vary, but these insights should hit home for most teams shipping software. Why Should We Even Measure Quality? I believe there are three reasons to measure software quali...

Time Machine : Pens

Time machines exist. In fact, we all have a few in our lives. Some tucked away in old boxes, some lying in plain sight but ignored and some in forgotten memories. Most of the times, we bump across them while looking for something else. But when we find them, they take us back years in the past and invoke a deep sense of nostalgia. You either smile, laugh or feel sad. And sometimes, all of them at the same time. Pens. I had seen my father always carry one in his pocket without fail. After all, you never knew when you had to fill in one of those numerous paper forms that you get handed at in most Indian establishments — the railways reservation form, the cash withdrawal bank slip or gas cylinder delivery form. With the digitization of most things around us, this ancient writing instrument lost its meaning in our lives pretty fast. Today, we don’t care much about the pen we use. As long as it writes, we are good. We don’t really have a persona...

Chick-Fil-A One : A crispy mobile app ordering experience

Chick-Fil-A or Popeyes or Shake Shack. The jury is still out on who serves the best chicken sandwich but I am declaring a winner on who has the best mobile app experience. And the champion is Chick-Fil-A.  Our phones are inundated with numerous apps. But you hardly come across one that enhances your experience with the product and impresses you so much that you stop for a minute and appreciate the excellent work done the team who built it. Chick-Fil-A One, the fast food restaurant chain's mobile ordering app did it for me. Chick-Fil-A is an intriguing company. Started 75 years ago it  is still a family owned business, one of the largest restaurant chains in the US, operates more than 2600 locations, generating revenue of more than $10 billion. They have the highest sale per restaurant unit easily beating giants such as McDonalds, Taco Bell and Subway. Oh and they do this while being mandatorily closed on Sundays. Quite a remarkable feat! How do they do it? The answer...