CrowdStrike Sends Out Flawed Software Update That Causes Worldwide Outage

Author

Reads 480

Group of professionals wearing headsets working at computers in an office setting, promoting collaboration.
Credit: pexels.com, Group of professionals wearing headsets working at computers in an office setting, promoting collaboration.

CrowdStrike, a leading cybersecurity firm, sent out a flawed software update that caused a worldwide outage. The update was rolled out on a Tuesday morning, affecting customers across the globe.

The issue was first reported by users on social media, with many experiencing difficulties accessing their systems and data. The outages were widespread, with reports coming in from multiple continents.

The software update was intended to fix a critical vulnerability, but it ended up introducing a new bug that caused the outage. The company's systems were unable to handle the sudden change, leading to a cascade of errors.

The outage lasted for several hours, causing significant disruptions to businesses and individuals who rely on CrowdStrike's services.

Worth a look: Patelco Is down

What Happened

A perfect storm of tech failures caused global chaos, disrupting essential services worldwide. The incident began with an unrelated Microsoft Azure outage on Thursday night.

A faulty software update was released by CrowdStrike, a cybersecurity company, which caused the global IT outage. The update was meant for CrowdStrike’s Falcon antivirus platform.

See what others are reading: Bofa System down

Credit: youtube.com, CrowdStrike Software Update Causes Global Windows Outage | Impact on Businesses & Airports

The update backfired spectacularly, sending Windows computers into an endless reboot cycle. This was due to a flawed configuration file pushed to Falcon.

The root cause of the issue was a flawed configuration file that altered the driver's functionality. Importantly, this wasn’t a kernel driver update as initially suspected.

The impact was unprecedented and far-reaching, affecting airports, media, banks, hospitals, and other organizations. Airports saw massive queues form as major airlines temporarily grounded flights.

The update was designed to improve detection of a new hacking method but instead triggered a logic error. This led to the crash of operating systems globally.

CrowdStrike CEO George Kurtz apologized for the disruption, emphasizing that the issue wasn’t due to a cyberattack. He explained that while they’ve identified and fixed the problem, recovery might take time as affected machines need manual intervention to reboot.

The issue was caused by a bug in CrowdStrike's automated content validator tool, which cleared a template containing "problematic" content data. This led to an out-of-bound memory condition that triggered Microsoft systems to crash.

A boot loop occurred, causing Windows devices to restart without warning during their startup process. This meant the machines couldn’t finish a complete and stable boot cycle.

Hundreds of Microsoft engineers and software experts were deployed to work directly with customers on service restoration.

Consider reading: Windows Checkbook Software

Impact and Affected Parties

Credit: youtube.com, CrowdStrike Outage: Unraveling the Causes and Impact

The impact of the CrowdStrike outage was felt far and wide, affecting millions of people and businesses around the world.

Approximately 8.5 million Windows devices worldwide were impacted, making up less than 1% of the entire Windows estate.

The outage caused widespread disruptions, with shoppers in Australia reportedly unable to pay for groceries due to electronic payment systems being down.

Many businesses, including supermarkets, banks, airlines, telecommunications companies, and TV broadcasters, were affected by the outage.

A look at DownDetector shows a spike in outages across nearly all industries, with businesses such as Visa, Starbucks, Chase, TD Bank, Bank of America, Walmart, and UPS experiencing issues.

The outage was so widespread that it's difficult to understand its full scope, but it appears to be one of the largest outages in history.

Here are some of the notable businesses and organizations affected by the outage:

  • Airlines: American Airlines, Delta, KLM, Lufthansa, Ryanair, SAS, and United
  • Airports: Gatwick, Luton, Stansted, and Schiphol
  • Financial organizations: London Stock Exchange, Lloyds Bank, and Visa
  • Healthcare: Most GP surgeries and many independent pharmacies
  • Media organizations: MTV, VH1, Sky, and some BBC channels
  • Retailers, leisure, and hospitality organizations: Gail's Bakery, Ladbrokes, Morrisons, Tesco, and Sainsbury's
  • Sporting bodies: F1 teams Aston Martin Aramco, Mercedes AMG Petronas, and Williams Racing, and the Paris 2024 Organising Committee for the Olympic and Paralympic Games
  • Train operating companies: Avanti West Coast, Merseyrail, Southern, and Transport for Wales

Causes and Analysis

CrowdStrike's faulty channel file, rolled out to its Falcon sensor product, was the root cause of the worldwide outage.

Credit: youtube.com, CrowdStrike update causes global IT outage | NewsWhip in action

The issue was triggered by a cloud-delivered, rapid response update that contained "problematic" content data, which was cleared for delivery due to a bug in CrowdStrike's automated content validator tool.

A boot loop occurred, causing Windows devices to restart without warning, resulting in a situation where the machine couldn't finish a complete and stable boot cycle.

This was due to inadequate testing across various desktop and server environments, or a lack of proper sandboxing and rollback mechanisms for updates that involve a kernel-level interaction.

CrowdStrike's CEO, George Kurtz, acknowledged the issue, stating that it was not a security incident or cyber attack, but rather a defect found in a single content update for Windows hosts.

The fix has been deployed, and CrowdStrike is providing updates on its website and through official channels.

Bad updates can have a significant impact, as seen in this case, where a single flawed update caused a global outage affecting companies worldwide.

The fact that CrowdStrike's shares plummeted by over 20% in unofficial pre-market trading, translating to a staggering $16 billion loss in value, highlights the severity of the situation.

The outage has also raised concerns about the risks of deeply integrated cybersecurity software, which can have far-reaching consequences when a flaw is introduced.

Take a look at this: Is Citicards down

Response and Resolution

Credit: youtube.com, Risk & Repeat: Faulty CrowdStrike update causes global outage

CrowdStrike's CEO George Kurtz quickly acknowledged the issue, stating that it was not a security incident or cyber attack, but rather a defect in a single content update for Windows hosts. He assured customers that a fix had been deployed and that the company was working closely with Microsoft to resolve the issue.

The company's initial response was to roll back the changes to the affected product, but some hosts may still experience issues. System administrators can try to fix the problem themselves by following a few steps, including booting Windows into safe mode and deleting a specific file.

CrowdStrike customers can access more information by logging into the support portal. The company has also promised to provide full transparency on how the issue occurred and the steps being taken to prevent it from happening again.

Microsoft has been working extensively with CrowdStrike to automate the development and deployment of a fix. Hundreds of its engineers and software experts were deployed to work directly with customers on service restoration.

Credit: youtube.com, CrowdStrike- major player in cybersecurity released automatic update that caused major global outage

The fallout from the outage continues to be felt, with flights delayed or canceled, hospital systems affected, and businesses struggling to recover. Some of the affected companies include financial and payment services, cloud-based and online businesses, communications providers, and cloud services providers.

Here are the steps to fix the CrowdStrike Windows BSOD issue:

  1. Boot Windows into Safe Mode or the Windows Recovery Environment
  2. Navigate to the C:\Windows\System32\drivers\CrowdStrike directory
  3. Locate the file matching 'C-0000029*.sys', and delete it.
  4. Boot the host normally.

Aftermath and Prevention

The Crowdstrike outage was a stark reminder that even the most well-intentioned updates can go horribly wrong. Software makers need to step up their testing procedures to account for the increasing use of the cloud and diverse desktop and server environments.

Dror Kashti, co-founder and CEO of Sweet Security, emphasizes the importance of relying on solutions made for the cloud, such as sandboxing, safe-by-design languages, and non-destructive technologies. Kashti's words echo the need for adaptability in the face of rapidly changing technologies.

To avoid similar problems in the future, organizations should consider rolling out updates, especially those involving security software, in phases. This allows for testing in a sandbox environment or on a limited subset of machines representative of all operational configurations before full deployment.

Avoiding Similar Problems in the Future

Close-up of a computer screen displaying an authentication failed message.
Credit: pexels.com, Close-up of a computer screen displaying an authentication failed message.

The recent outage caused by a bad software update highlights the importance of thorough testing procedures. A phased approach to software updates, particularly for security solutions, can help prevent similar problems.

IT teams should test updates in a sandbox environment or on a limited subset of machines representative of all operational configurations before full deployment. This can help identify and fix issues before they affect critical systems.

System redundancy is also crucial for isolating and managing fault domains, especially in critical infrastructure. This can help prevent widespread outages and minimize downtime.

Security software like Falcon requires deep system access, which can be a double-edged sword. While it provides better protection, it also increases the risk of a bug taking down the entire operating system.

Organizations should consider rolling out updates in phases and employing a level of system redundancy to manage fault domains. This can help prevent similar problems in the future.

A fresh viewpoint: Patelco Security Update

A civil engineer working on a weir design using CAD software on a computer screen in an office setting.
Credit: pexels.com, A civil engineer working on a weir design using CAD software on a computer screen in an office setting.

The cloud requires solutions that are made for the cloud, and relying on sandboxing, safe-by-design languages, and non-destructive technologies is a must. This can help contain bugs and prevent them from propagating worldwide.

IT teams should attend to IT asset management and software asset management, and establish strong disaster recovery and business continuity planning as a priority. These measures can help prevent and mitigate the impact of future outages.

A unique perspective: Cloud Billing Software

Stock Plunging

CrowdStrike's stock took a hit, plummeting more than 21% in pre-market trading on Friday.

This significant drop in stock value resulted in a $16 billion reduction in the company's market cap.

The stock market can be unpredictable, and it's essential to stay informed about the companies you're invested in.

A 21% drop is a substantial decrease, and investors should be aware of the potential risks involved.

It's crucial to monitor market trends and adjust your investment strategy accordingly.

Curious to learn more? Check out: Equity Market Update

Carolyn VonRueden

Junior Writer

Carolyn VonRueden is a versatile writer with a passion for crafting engaging content on a wide range of topics. With a keen eye for detail and a knack for research, Carolyn has established herself as a reliable voice in the world of finance and travel writing. Her portfolio boasts a diverse array of article categories, from exploring the benefits of cash cards to delving into the intricacies of Delta SkyMiles payment options.

Love What You Read? Stay Updated!

Join our community for insights, tips, and more.