<?xml version="1.0" encoding="UTF-8"?>
<feed xml:lang="en-US" xmlns="http://www.w3.org/2005/Atom">
  <id>tag:status.maestra.io,2005:/history</id>
  <link rel="alternate" type="text/html" href="https://status.maestra.io"/>
  <link rel="self" type="application/atom+xml" href="https://status.maestra.io/history.atom"/>
  <title>maestra Status - Incident history</title>
  <updated>2026-04-08T15:37:00.000+00:00</updated>
  <author>
    <name>maestra</name>
  </author>
  
<entry>
  <id>tag:status.maestra.io,2005:Incident/cmnrocpx300qrnif58pczltok</id>
  <published>2026-04-08T15:37:00.000+00:00</published>
  <updated>2026-04-08T15:37:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/incident/cmnrocpx300qrnif58pczltok"/>
  <title>Degraded access to admin panel</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 16 minutes</p>
    <p><strong>Affected Components:</strong> Admin panel</p>
    <p><small>Apr <var data-var='date'> 8</var>, <var data-var='time'>15:37:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating elevated error rates affecting approximately 1.5% of traffic to some internal web services. API endpoints are operating normally and are not affected by this incident..</p>
<p><small>Apr <var data-var='date'> 9</var>, <var data-var='time'>16:28:00</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved. We&#039;ll get back with details at a later date..</p>
<p><small>Apr <var data-var='date'> 17</var>, <var data-var='time'>13:28:06</var> GMT+0</small><br /><strong>Postmortem</strong> -
  **What happened**  
On April 8, about 1.5% of admin panel traffic started failing. API endpoints were not affected. The issue was fully resolved on April 9.

**Why it happened**  
We recently rolled out a new version of the software that routes user traffic to our services (HAProxy) across our fleet of six load balancers. The rollout had been tested in a test deployment and initially looked healthy in production as well. However, about 7 hours after the upgrade, one of the six load balancers started failing to reach backend services — and that&#039;s what caused the errors users experienced.   
  
The root cause is a bug in the new HAProxy version that only shows up under a very specific, still-unidentified set of conditions. Our test deployment and the other five production balancers kept working fine, which is why the issue slipped past our pre-rollout checks.

**Why it took us a while to notice**  
When a request failed, our system automatically tried it again, and the retry usually succeeded. So from the outside things looked a bit slow or occasionally flaky rather than clearly broken. That automatic retry behavior is normally a good thing — it hides small, transient glitches from users — but in this case it also hid the growing problem from our monitoring long enough to delay detection.

**What we did**  
\- Rolled back the affected balancer to the previous HAProxy version, which immediately restored normal traffic.  
\- Paused all further HAProxy upgrades across the fleet until we understand the trigger.

**What&#039;s next**  
We&#039;re working to reproduce the bug in a controlled test environment so we can pinpoint the trigger, confirm a fix (either a patched version or a config change), and safely resume the rollout..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cmmsirjc10695m0kq0psrohsd</id>
  <published>2026-03-16T05:00:00.000+00:00</published>
  <updated>2026-03-16T05:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cmmsirjc10695m0kq0psrohsd"/>
  <title>Database maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 hour</p>
    <p><strong>Affected Components:</strong> Loyalty, Admin panel, Site personalization</p>
    <p><small>Mar <var data-var='date'> 16</var>, <var data-var='time'>05:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  We are planning for a scheduled maintenance during that time.  
  
We expect admin panel, site personalization and loyalty to be unavailable for no more then 35 minutes..</p>
<p><small>Mar <var data-var='date'> 16</var>, <var data-var='time'>05:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Mar <var data-var='date'> 16</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cmmsirk0e0617wv24wkpazf0q</id>
  <published>2026-03-16T05:00:00.000+00:00</published>
  <updated>2026-03-16T05:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cmmsirk0e0617wv24wkpazf0q"/>
  <title>Database maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 hour</p>
    <p><strong>Affected Components:</strong> Loyalty, Admin panel, Site personalization</p>
    <p><small>Mar <var data-var='date'> 16</var>, <var data-var='time'>05:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  We are planning for a scheduled maintenance during that time.  
  
We expect admin panel, site personalization and loyalty to be unavailable for no more then 35 minutes..</p>
<p><small>Mar <var data-var='date'> 16</var>, <var data-var='time'>05:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Mar <var data-var='date'> 16</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cmmiifw8r01b3x6d003rfhtst</id>
  <published>2026-03-09T05:00:00.000+00:00</published>
  <updated>2026-03-09T05:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cmmiifw8r01b3x6d003rfhtst"/>
  <title>Database servers maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 hour</p>
    <p><strong>Affected Components:</strong> Loyalty, Admin panel, Site personalization</p>
    <p><small>Mar <var data-var='date'> 9</var>, <var data-var='time'>05:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  We are planning for a scheduled maintenance during that time.  
  
We expect admin panel, site personalization and loyalty to be unavailable for no more then 45 minutes..</p>
<p><small>Mar <var data-var='date'> 9</var>, <var data-var='time'>05:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Mar <var data-var='date'> 9</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cmkqsc2st061x8iu91ylxanl7</id>
  <published>2026-01-26T06:00:00.000+00:00</published>
  <updated>2026-01-26T06:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cmkqsc2st061x8iu91ylxanl7"/>
  <title>Cluster maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    
    <p><strong>Affected Components:</strong> Transactional messages, Loyalty, Mass campaigns, Admin panel, Site personalization</p>
    <p><small>Jan <var data-var='date'> 26</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully..</p>
<p><small>Jan <var data-var='date'> 26</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  We are planning for a scheduled maintenance during that time.  
  
We expect loyalty, mass campaigns, admin panel, personalization and transaction messages to be unavailable for no more then 20 minutes..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cmjch5mya03wx13jw51rzv2hs</id>
  <published>2025-12-22T05:00:00.000+00:00</published>
  <updated>2025-12-22T05:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cmjch5mya03wx13jw51rzv2hs"/>
  <title>Network maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 hour</p>
    <p><strong>Affected Components:</strong> Transactional messages, Loyalty, Mass campaigns, Admin panel, Site personalization</p>
    <p><small>Dec <var data-var='date'> 22</var>, <var data-var='time'>05:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  We are planning for a scheduled maintenance during that time.  
  
We expect admin panel, site personalization, loyalty mass campaigns and transactional messages to be unavailable for no more then 10 minutes..</p>
<p><small>Dec <var data-var='date'> 22</var>, <var data-var='time'>05:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Dec <var data-var='date'> 22</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Incident/cmja43ook021rafzwk2n9bird</id>
  <published>2025-12-17T11:25:00.000+00:00</published>
  <updated>2025-12-17T11:25:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/incident/cmja43ook021rafzwk2n9bird"/>
  <title>Service degradation</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 30 minutes</p>
    <p><strong>Affected Components:</strong> Loyalty, Transactional messages, Site personalization</p>
    <p><small>Dec <var data-var='date'> 17</var>, <var data-var='time'>11:25:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating this incident..</p>
<p><small>Dec <var data-var='date'> 17</var>, <var data-var='time'>11:55:00</var> GMT+0</small><br /><strong>Resolved</strong> -
  **Summary:**

On **2025-12-17**, our API Gateway experienced a temporary degradation caused by an unsuccessful update of an ingress component.

**Impact:**

* Elevated **5xx error rate** on the API Gateway.
* **Average 5xx rate over the incident interval:** **\~1.76% of total RPS**.
* **Peak 5xx rate:** **\~9–10% of total RPS**.
* Partial request failures for API clients during the incident window.

**Timeline (UTC):**

* **11:25** — 5xx rate begins to increase.
* **\~11:35** — Peak degradation observed.
* **\~11:55** — Errors drop back to baseline and service stabilizes.

**Incident Duration:**

From **11:25 to 11:55 (UTC)** — **\~30 minutes**.

**Root Cause:**

A failed deployment of an ingress component introduced regressions that affected API Gateway request handling, resulting in increased 5xx responses.

**Resolution:**

We rolled back/reverted the ingress component update and confirmed the API Gateway error rate returned to normal levels..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cmheibnxb00i62fcbd85hnhal</id>
  <published>2025-11-03T05:00:00.000+00:00</published>
  <updated>2025-11-03T05:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cmheibnxb00i62fcbd85hnhal"/>
  <title>Network maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 30 minutes</p>
    <p><strong>Affected Components:</strong> Transactional messages, Loyalty, Mass campaigns, Admin panel, Site personalization</p>
    <p><small>Nov <var data-var='date'> 3</var>, <var data-var='time'>05:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  We are planning for a scheduled maintenance during that time.  
  
We expect admin panel, site personalization, loyalty, mass campaigns and transactional messages to be unavailable for no more then 10 minutes..</p>
<p><small>Nov <var data-var='date'> 3</var>, <var data-var='time'>05:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Nov <var data-var='date'> 3</var>, <var data-var='time'>05:30:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Incident/cmgz955jf02sxvmu4wnsuwduw</id>
  <published>2025-10-20T14:50:00.429+00:00</published>
  <updated>2025-10-20T14:50:00.429+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/incident/cmgz955jf02sxvmu4wnsuwduw"/>
  <title>Transactional communications are sent with delay</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 35 minutes</p>
    <p><strong>Affected Components:</strong> Transactional messages</p>
    <p><small>Oct <var data-var='date'> 20</var>, <var data-var='time'>14:50:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating this incident..</p>
<p><small>Oct <var data-var='date'> 20</var>, <var data-var='time'>15:24:54</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Incident/cmgtbornz010gl371d7fg3zc0</id>
  <published>2025-10-16T09:19:00.000+00:00</published>
  <updated>2025-10-16T09:19:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/incident/cmgtbornz010gl371d7fg3zc0"/>
  <title>Delays processing asynchronous operation calls</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 1 hour and 22 minutes</p>
    <p><strong>Affected Components:</strong> Transactional messages</p>
    <p><small>Oct <var data-var='date'> 16</var>, <var data-var='time'>09:19:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  Some requests to the /v3/operations/async endpoint are experiencing delays. We’re investigating the issue..</p>
<p><small>Oct <var data-var='date'> 16</var>, <var data-var='time'>10:41:07</var> GMT+0</small><br /><strong>Resolved</strong> -
  The incident has been resolved. All calls received during the outage will be processed once the current queue is cleared.  
  
An additional four-minute planned downtime of the async API was required to prevent further issues, from 11:39–11:43 UTC (07:39–07:43 EDT, 04:39–04:43 PDT).</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cmgh5t7ry008ipv1i2d8h86k2</id>
  <published>2025-10-07T22:00:00.000+00:00</published>
  <updated>2025-10-07T22:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cmgh5t7ry008ipv1i2d8h86k2"/>
  <title>Network equipment upgrade</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 5 hours</p>
    <p><strong>Affected Components:</strong> Admin panel, Site personalization</p>
    <p><small>Oct <var data-var='date'> 7</var>, <var data-var='time'>22:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Our hosting provider is upgrading the network equipment, which may cause brief, occasional service interruptions..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Incident/cmgcf7rmk0010qgeazox30cdq</id>
  <published>2025-10-04T14:39:00.000+00:00</published>
  <updated>2025-10-04T15:22:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/incident/cmgcf7rmk0010qgeazox30cdq"/>
  <title>Platform outage</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 11 minutes</p>
    <p><strong>Affected Components:</strong> Loyalty, Admin panel, Site personalization, Mass campaigns</p>
    <p><small>Oct <var data-var='date'> 4</var>, <var data-var='time'>15:22:00</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved. We will provide a comprehensive report next week..</p>
<p><small>Oct <var data-var='date'> 22</var>, <var data-var='time'>14:03:16</var> GMT+0</small><br /><strong>Postmortem</strong> -
  Due to a configuration error, offsite backup uploads fully saturated the network link, causing the gateway to become unavailable. To prevent this from happening again, we introduced rate limiting and moved backup transfers to a private Direct Connect to S3 instead of the production link..</p>
<p><small>Oct <var data-var='date'> 4</var>, <var data-var='time'>14:39:00</var> GMT+0</small><br /><strong>Identified</strong> -
  The platform is experiencing outages. The admin panel and some functionality are affected. We are actively investigating the issue..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cmg4p5j0b012jx2dzo15urka1</id>
  <published>2025-09-29T07:00:00.000+00:00</published>
  <updated>2025-09-29T07:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cmg4p5j0b012jx2dzo15urka1"/>
  <title>Network maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 hour</p>
    <p><strong>Affected Components:</strong> Mass campaigns, Site personalization, Loyalty</p>
    <p><small>Sep <var data-var='date'> 29</var>, <var data-var='time'>07:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  We are planning for a scheduled maintenance during that time.  
  
We expect to Site personalization, Loyalty and Mass campaigns be unavailable for no more then 30 minutes..</p>
<p><small>Sep <var data-var='date'> 29</var>, <var data-var='time'>07:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Sep <var data-var='date'> 29</var>, <var data-var='time'>08:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Maintenance/cme6ov3g800b94qsp4d9q09hl</id>
  <published>2025-08-11T06:00:00.000+00:00</published>
  <updated>2025-08-11T06:00:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/maintenance/cme6ov3g800b94qsp4d9q09hl"/>
  <title>Server maintenance</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Maintenance</p>
    <p><strong>Duration:</strong> 1 hour</p>
    <p><strong>Affected Components:</strong> Transactional messages, Mass campaigns, Loyalty</p>
    <p><small>Aug <var data-var='date'> 11</var>, <var data-var='time'>06:00:00</var> GMT+0</small><br /><strong>Identified</strong> -
  We will be rebooting the production server on 11 of August at 7:00am UTC to apply updates and ensure stability.

Expected downtime: \~45 minutes.

Thank you for your understanding..</p>
<p><small>Aug <var data-var='date'> 11</var>, <var data-var='time'>06:00:01</var> GMT+0</small><br /><strong>Identified</strong> -
  Maintenance is now in progress.</p>
<p><small>Aug <var data-var='date'> 11</var>, <var data-var='time'>07:00:00</var> GMT+0</small><br /><strong>Completed</strong> -
  Maintenance has completed successfully.</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Incident/cme004t08002lpaio1xqz4vs3</id>
  <published>2025-07-26T13:47:00.000+00:00</published>
  <updated>2025-07-26T13:47:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/incident/cme004t08002lpaio1xqz4vs3"/>
  <title>Delays in scenarios execution</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 19 minutes</p>
    <p><strong>Affected Components:</strong> Transactional messages</p>
    <p><small>Jul <var data-var='date'> 26</var>, <var data-var='time'>13:47:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating this incident..</p>
<p><small>Jul <var data-var='date'> 26</var>, <var data-var='time'>14:06:00</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved..</p>

        ]]>
  </content>
</entry>

<entry>
  <id>tag:status.maestra.io,2005:Incident/cme00nkzr0041ls3uyo7t5lyy</id>
  <published>2025-06-26T15:24:00.000+00:00</published>
  <updated>2025-06-26T15:24:00.000+00:00</updated>
  <link rel="alternate" type="text/html" href="https://status.maestra.io/incident/cme00nkzr0041ls3uyo7t5lyy"/>
  <title>Partial outage of the platform</title>

  <content type="html">
  <![CDATA[
    <p><strong>Type:</strong> Incident</p>
    <p><strong>Duration:</strong> 1 hour and 42 minutes</p>
    <p><strong>Affected Components:</strong> Loyalty, Transactional messages, Mass campaigns, Admin panel, Site personalization</p>
    <p><small>Jun <var data-var='date'> 26</var>, <var data-var='time'>15:24:00</var> GMT+0</small><br /><strong>Investigating</strong> -
  We are currently investigating this incident..</p>
<p><small>Jun <var data-var='date'> 26</var>, <var data-var='time'>17:06:00</var> GMT+0</small><br /><strong>Resolved</strong> -
  This incident has been resolved..</p>

        ]]>
  </content>
</entry>

</feed>