Skip to main content
The Anomaly Detection workflow detects outliers, correlates anomalies across instruments, classifies root causes, and forecasts future values. It runs in batch and streaming modes, producing anomaly scores, classifications, and a growth quality prediction alongside the raw timeseries. This workflow runs on the outputs of Characterization and Tool State workflows.

Detection Process

For each timeseries property:
  1. Embedding: Raw timeseries windows are encoded into high-dimensional vectors using a timeseries foundation model fine-tuned on process data.
  2. Outlier detection: Each embedding is scored against learned baselines. Outliers in the latent space correspond to unusual temporal patterns in the original signal.
  3. Cross-source correlation: After per-parameter detection completes, the system identifies temporal overlaps and embedding-space similarities across instruments to group related anomalies.
  4. Classification: Each anomaly is classified by probable root cause (e.g., source depletion, temperature excursion, substrate defect, chamber contamination) using the embedding vector and contextual features.
  5. Forecasting: The model projects the timeseries forward with confidence intervals, flagging predicted anomalies before they develop.
  6. Growth quality scoring: Signals from all active data sources are combined into a single quality score representing the probability of meeting target specifications.

Output Metrics

For each property in a timeseries, the workflow produces:
MetricDescriptionRange
anomaly_scoreLatent-space outlier score. Higher values indicate greater deviation from baselines.0 (normal) to 1 (anomalous)
anomaly_classClassification label, assigned when anomaly score exceeds the detection thresholdCategorical
forecastPredicted value N steps ahead based on current trajectorySame unit as property
forecast_lower / forecast_upperForecast confidence interval boundsSame unit as property
z_scoreStandard deviations from running mean (legacy)Unbounded
ema_z_scoreResidual z-score relative to exponential moving average (legacy)Unbounded
At the growth session level:
MetricDescriptionRange
growth_quality_scoreProbability of meeting target specifications, combining signals across all data sources0 (poor) to 1 (excellent)

Execution Modes

Batch: Runs automatically after the analysis pipeline completes on uploaded data. The full timeseries is evaluated in a single pass.Streaming: Runs incrementally with each incoming data chunk during a live run. The system maintains a rolling detection context across chunks to detect both sudden anomalies and emerging trends. Forecasts and the growth quality score update with each chunk. Error-level anomalies trigger real-time alerts.Scores and forecasts are not emitted until enough observations have been processed to fill the embedding window. Before that threshold, outputs are NaN.

Severity and Thresholds

Severity is assigned based on anomaly score and impact on the growth quality prediction:
SeverityMeaningAction
WarningScore elevated but within tolerance, or minor quality score impact. May self-correct.Review after the run or if warnings accumulate.
ErrorScore exceeds the critical threshold, or quality score has dropped significantly.Investigate immediately. Triggers real-time alerts in streaming runs.
Default baselines are calibrated from your organization’s historical data. Per-project overrides let you tighten thresholds for critical parameters or relax them for known-noisy signals.