Chenxi Wang, Ph.D.’s Post

View profile for Chenxi Wang, Ph.D., graphic

Investor, Cyber expert, Fortune 500 board member, Venturebeat Women-in-AI award winner. I talk about #cybersecurity #venturecapital #diversity #womenintech #boardgovernance

This #crowdstrike #microsoft incident is causing a global outage. Here are some of the photos from various places. Yes, that is the giant sphere in Vegas. The good news is that it is not an attack. The software industry seems to be doing a better job of disrupting ourselves than hackers 😂 What lessons can we take away from this? Software updates are hard, in that ensuring it does not cause any negative impact is hard -- it is as hard as solving the "halting problem". However, we can do things better. Most companies don't adequately test their apps, let alone an update. But we must treat every change in software with the same rigor as the original dev & release. If we don't, this is what we are looking at. In fact, we are lucky that this is not another Solarwinds style attack. My thanks go to security teams globally working overtime to implement the workarounds and fixes. You are the heros!

  • No alternative text description for this image
  • No alternative text description for this image
Chenxi Wang, Ph.D.

Investor, Cyber expert, Fortune 500 board member, Venturebeat Women-in-AI award winner. I talk about #cybersecurity #venturecapital #diversity #womenintech #boardgovernance

2mo

People, whether the Sphere blue screened or not is NOT the point here. We are discussing the scale and implications of this disruption caused by a software update. For the purpose of this discussion, the Sphere might as well be blue screened. So there.

Chenxi Wang, Ph.D.

Investor, Cyber expert, Fortune 500 board member, Venturebeat Women-in-AI award winner. I talk about #cybersecurity #venturecapital #diversity #womenintech #boardgovernance

2mo

I am at my bank trying to open up an account for my son. After 40 minutes of trying, we were told it wasn't working because of the Crowdstrike outage. The banker asked me if I saw the news this morning. Hahahahaha

Rupa Dachere

CEO, President & Founder @Thrive-WiSE

2mo

At one point in my career, I was on the install/update dev team for a flagship product that was deployed globally and was the industry leader. I know first hand about how difficult it is to test/roll out updates, especially, if the code changes are at a kernel/low level. There is ALWAYS pressure to release quickly (especially if one is reacting to a vulnerability/exploit). Digging one's heels in and pushing back at internal forces so that one can do adequate testing/verification before releasing is a key skill for all devs, let alone managers/directors. Hopefully, this incident can be used as an example to advocate for adequate testing/verification.

Jonathan Chan

Technology Enablement, Trusted Advisor, Visionary Leader

2mo

Critical business infrastructure, being the backbone of essential services and operations, necessitates a higher level of security and resilience compared to other systems. This incident involving Crowdstrike and Microsoft Azure underscores this very point.

Philip Koopman

Autonomous Vehicle Safety, Embedded Software, UL 4600, Consulting, (He/him.) Personal account; likes/shares are interest and not endorsements; silence does not imply agreement.

2mo

Chenxi Wang, Ph.D. -- do we know if this was a defect missed by testing vs. releasing the wrong version? Both are a problem, but a significantly different type of problem.

Like
Reply
Rahul Bagal

Software Development Engineer | Python, React, SQL | I've developed 20+ websites used by 100K+ people worldwide.

2mo

Looks like the software industry just updated itself into a global coffee break!

I remeber the infamous SEP update, Javed Hasan! At least, auto update was not on by default in these on-prem days. Now. That was not even a software update, it seems! CRWD used the words "content update" so that is user space level (signatures, IoCs and such) but unfortunately, the new "content" must have triggered a latent bug in the executing kernel driver. That is bad luck as I assume content update may not trigger the same level of QA and deployment caution...

Evan Powell

Many time founder & 5 exits - lots of open source - now working to reimagine cyber security with deep learning

2mo

Data platforms are hard Every time you contract w a traditional security vendor you invite another data platform into your environment Maybe we sd make the data platform itself a first class citizen run by dedicated teams W/ security as a customer? Much like finance, marketing, sales and product rely on such common data platforms? There are many chicken or the egg problems to be solved, but today’s outage highlights the importance of security becoming a more normal consumer of data platforms over time

Atul S.

Technology Enabler

2mo

In a hurry to release functionality and features updates, usually many development/testing/qa teams don't always consider performance, impact, reliability of application as much as they should. These teams need some people on their team who objectively review all that and give a go-ahead. But in my over 3 decades of work in many fortune 100 companies, I have seen that these objetive /pushing back voices are many times faint, non existent or overridden. Secondly global updates of this magnitude have to be rolling in nature and contained by geographic area to reduce impact.

See more comments

To view or add a comment, sign in

Explore topics