Trading Service Failure Report: Incident Analysis and Preventative Measures

Β·

Impact and Timelines

From 8:39:00 AM to 9:28:15 AM UTC on March 17, 2023, OKX experienced partial to full unavailability of trading systems. Below is the detailed incident timeline:


Root Cause Analysis

The downtime resulted from resource exhaustion in a core infrastructure component due to:

Proactive measure: Trading suspension prevented disorderly market conditions during resolution.


Preventative Actions

To minimize future disruptions, OKX is implementing:

  1. Technical Optimizations

    • Log process scaling (e.g., file size limits).
    • Server/client-end monitoring enhancements.
  2. Procedural Improvements

    • Detailed incident documentation for root-cause analysis.
    • Streamlined alert protocols for faster response.
  3. System Redundancies

    • Upgraded infrastructure resilience.

Commitment to Reliability

OKX prioritizes:


FAQs

Q: How long did the outage last?
A: 49 minutes (08:39–09:28 AM UTC).

Q: Were user funds affected?
A: No. Fund safety protocols remained intact.

Q: What’s being done to prevent recurrence?
A: Infrastructure upgrades and enhanced monitoring.

Q: Where can I check real-time system status?
A: Visit the πŸ‘‰ OKX Status Page.


Note: This report replaces all prior communications dated March 20, 2023.