Client

The Client class is your entry point to the Atomscale API. It provides methods for searching the catalogue, fetching analysis results, uploading files, and downloading processed videos.

View source on GitHub

Constructor

from atomscale import Client

client = Client(
    api_key="YOUR_API_KEY",
    endpoint="https://api.atomscale.ai/",
    mute_bars=False,
)

api_key

str | None

default:"None"

API key for authentication. If not provided, reads from the AS_API_KEY environment variable.

endpoint

str

default:"https://api.atomscale.ai/"

Root API endpoint. Can be overridden with the AS_API_ENDPOINT environment variable.

mute_bars

bool

default:"False"

Whether to suppress progress bars. Set to True for non-interactive environments like CI pipelines.

search()

Search and filter items in the data catalogue.

results = client.search(
    keywords="GaN",
    include_organization_data=True,
    data_type="rheed_stationary",
    status="success",
)

Parameters

keywords

str | list[str] | None

default:"None"

Keyword or list of keywords to search all catalogue fields. Applied after other explicit filters.

include_organization_data

bool

default:"True"

Whether to include catalogue entries from other users in your organization.

data_ids

str | list[str] | None

default:"None"

Filter to specific data IDs.

physical_sample_ids

str | list[str] | None

default:"None"

Filter by physical sample IDs.

project_ids

str | list[str] | None

default:"None"

Filter by project IDs.

data_type

str

default:"all"

Filter by data type.

Value	Description
`rheed_image`	Single RHEED image
`rheed_stationary`	Stationary RHEED video
`rheed_rotating`	Rotating RHEED video
`xps`	XPS spectrum
`photoluminescence`	Photoluminescence spectrum
`pl`	Alias for photoluminescence
`raman`	Raman spectrum
`all`	All types

status

str

default:"all"

Filter by pipeline status.

Value	Description
`success`	Analysis completed
`pending`	Queued for processing
`running`	Currently processing
`error`	Analysis failed
`stream_active`	Streaming in progress
`stream_interrupted`	Stream was interrupted
`stream_finalizing`	Stream finishing
`stream_error`	Streaming failed
`all`	All statuses

growth_length

tuple[int | None, int | None]

default:"(None, None)"

Filter by growth length in seconds. Tuple of (min, max) with None for open bounds.

upload_datetime

tuple[datetime | None, datetime | None]

default:"(None, None)"

Filter by upload timestamp. Tuple of (min, max) with None for open bounds.

last_accessed_datetime

tuple[datetime | None, datetime | None]

default:"(None, None)"

Filter by last accessed timestamp. Tuple of (min, max) with None for open bounds.

Returns

DataFrame

pandas.DataFrame

DataFrame with columns:

Column	Description
`Data ID`	Unique identifier
`Upload Datetime`	When the file was uploaded
`Last Accessed Datetime`	Last access time
`Type`	Data type (rheed_image, rheed_stationary, etc.)
`File Name`	Original filename
`Status`	Pipeline status
`File Type`	File extension/format
`Instrument Source`	Source instrument
`Sample Name`	Associated sample name
`Growth Length`	Duration in seconds (for videos)
`Physical Sample ID`	Linked physical sample
`Physical Sample Name`	Physical sample name
`Sample Notes`	Associated notes
`Owner`	Uploader’s name
`Workspaces`	Associated workspaces

Example

from datetime import datetime

results = client.search(
    keywords="GaN",
    data_type="rheed_stationary",
    status="success",
    upload_datetime=(datetime(2025, 1, 1), None),
    growth_length=(3000, None),
)
print(results[["Data ID", "Status", "Sample Name"]])

get()

Fetch analysis results for one or more data IDs.

analysed = client.get(data_ids=["44fa63b0-74da-4d25-a362-2276c80a670a"])

Parameters

data_ids

str | list[str]

required

Data ID or list of data IDs from the catalogue.

Returns

results

List of result objects. The type depends on the source data:RHEEDVideoResult (for rheed_stationary, rheed_rotating):

timeseries_data - DataFrame with per-frame metrics
snapshot_image_data - List of extracted frame snapshots

RHEEDImageResult (for rheed_image):

processed_image - Processed image as a PIL Image
mask - Binary segmentation mask (numpy array)
pattern_graph - NetworkX graph of the diffraction pattern
get_pattern_dataframe() - Tidy table of spot positions
get_plot() - Matplotlib figure (params: show_mask, show_spot_nodes, symmetrize, alpha)

XPSResult (for xps):

binding_energies - Array of binding energy values
intensities - Array of intensity values
predicted_composition - Dict mapping element symbols to fractional composition
detected_peaks - Detected peak positions
get_plot() - Matplotlib figure

PhotoluminescenceResult (for photoluminescence):

energies - Energy axis values
intensities - Intensity values
detected_peaks - Peak labels and positions
get_plot() - Matplotlib figure

RamanResult (for raman):

raman_shift - Raman shift axis values
intensities - Intensity values
detected_peaks - Peak labels and positions
get_plot() - Matplotlib figure

MetrologyResult (for metrology/instrument data):

timeseries_data - DataFrame with instrument readings (pyrometer, pressure, etc.)

OpticalResult (for optical imaging):

timeseries_data - DataFrame with per-frame metrics (edge perimeter, circularity, etc.)
snapshot_image_data - List of extracted frame snapshots

UnknownResult (fallback for unsupported types):

data_type - Type string
catalogue_entry - Raw catalogue metadata

Example

# Search and get results
search_results = client.search(keywords="demo", status="success")
analysed = client.get(search_results["Data ID"].to_list())

# Access time series data
video = analysed[0]
print(video.timeseries_data.columns)
print(video.timeseries_data.tail())

upload()

Upload files for analysis.

client.upload(files=[
    "/path/to/rheed.mp4",
    "/path/to/xps.vms",
])

Parameters

files

list[str | BinaryIO]

required

List of file paths (strings) or open file handles (BinaryIO objects).

Returns

None. Files are uploaded and queued for analysis. Check the web UI or use search() to monitor progress.

Example

# Upload by path
client.upload(files=[
    "/data/growths/2025-02-10/RHEED-stationary.mp4",
    "/data/growths/2025-02-10/RHEED-rotating.imm",
])

# Upload from file handles
with open("/data/rheed.mp4", "rb") as f:
    client.upload(files=[f])

download_videos()

Download processed or raw videos to disk.

client.download_videos(
    data_ids=["44fa63b0-74da-4d25-a362-2276c80a670a"],
    dest_dir="processed/",
    data_type="processed",
)

Parameters

data_ids

str | list[str]

required

One or more data IDs from the catalogue.

dest_dir

str | Path | None

default:"None"

Directory to write files to. Defaults to current working directory.

data_type

str

default:"processed"

Whether to download raw or processed data.

Returns

None. Files are saved to the destination directory.

Example

search_results = client.search(data_type="rheed_stationary", status="success")

# Download processed videos
client.download_videos(
    data_ids=search_results["Data ID"].to_list(),
    dest_dir="processed/",
)

# Download raw files
client.download_videos(
    data_ids=search_results["Data ID"].to_list(),
    dest_dir="raw/",
    data_type="raw",
)

list_physical_samples()

List all physical samples accessible to your account.

samples = client.list_physical_samples()

Returns

samples

pandas.DataFrame

DataFrame of physical sample records with columns including Physical Sample ID, Physical Sample Name, Project ID, Project Name, Target Material, and Owner.

list_projects()

List all projects accessible to your account.

projects = client.list_projects()

Returns

projects

pandas.DataFrame

DataFrame of project records with columns including Project ID, Project Name, Physical Sample Count, Project Notes, and Owner.

get_physical_sample()

Get all data associated with a physical sample.

result = client.get_physical_sample(
    physical_sample_id="sample-uuid",
    align=True,
)

Parameters

physical_sample_id

str

required

The physical sample ID.

include_organization_data

bool

default:"True"

Whether to include data from other users in your organization.

align

bool | str

default:"False"

Whether to time-align data from multiple sources. Pass True for outer join, or a string like "inner" to control the join strategy.

Returns

result

PhysicalSampleResult

Result object with attributes:

physical_sample_id - Sample identifier
physical_sample_name - Sample name
data_results - List of all result objects for this sample
aligned_timeseries - Aligned DataFrame if align was set, otherwise None

get_project()

Get all data associated with a project.

result = client.get_project(
    project_id="project-uuid",
    align=True,
)

Parameters

project_id

str

required

The project ID.

include_organization_data

bool

default:"True"

Whether to include data from other users in your organization.

align

bool | str

default:"False"

Whether to time-align data from multiple sources. Pass True for outer join, or a string like "inner" to control the join strategy.

Returns

result

ProjectResult

Result object with attributes:

project_id - Project identifier
project_name - Project name
samples - List of PhysicalSampleResult objects for each sample in the project
aligned_timeseries - Project-level aligned DataFrame if align was set, otherwise None

list_growth_instruments()

List all growth instruments accessible to your account.

instruments = client.list_growth_instruments()

Returns

instruments

list[dict]

List of instrument records. Each dict contains:

synth_source_id - Unique instrument ID (int)
source_name - Display name
synth_source_type - Instrument type (mbe, cvd, etc.)
source_manufacturer - Manufacturer name
source_model - Model name

create_growth_instrument()

instrument_id = client.create_growth_instrument(
    label="Main MBE",
    name="Veeco GEN10",
    instrument_type="mbe",
    serial_id="SN-12345",
)

Parameters

label

str

required

Display name for the instrument (e.g., “Main MBE”).

name

str

required

Manufacturer and model (e.g., “Veeco GEN10”).

instrument_type

str

required

Type of growth instrument.

Value	Description
`mbe`	Molecular beam epitaxy
`cvd`	Chemical vapor deposition
`pvd`	Physical vapor deposition
`sputter`	Sputtering
`ald`	Atomic layer deposition
`pld`	Pulsed laser deposition

serial_id

str | None

default:"None"

Optional serial number or identifier.

Returns

synth_source_id

int

The ID of the newly created instrument.

delete_growth_instrument()

Delete a growth instrument.

client.delete_growth_instrument(synth_source_id=42)

Parameters

synth_source_id

int

required

ID of the instrument to delete.

Returns

None.

Introduction

Guides

References

Constructor

search()

Parameters

Returns

Example

get()

Parameters

Returns

Example

upload()

Parameters

Returns

Example

download_videos()

Parameters

Returns

Example

list_physical_samples()

Returns

list_projects()

Returns

get_physical_sample()

Parameters

Returns

get_project()

Parameters

Returns

list_growth_instruments()

Returns

create_growth_instrument()

Parameters

Returns

delete_growth_instrument()

Parameters

Returns

Introduction

Guides

References

​Constructor

​search()

​Parameters

​Returns

​Example

​get()

​Parameters

​Returns

​Example

​upload()

​Parameters

​Returns

​Example

​download_videos()

​Parameters

​Returns

​Example

​list_physical_samples()

​Returns

​list_projects()

​Returns

​get_physical_sample()

​Parameters

​Returns

​get_project()

​Parameters

​Returns

​list_growth_instruments()

​Returns

​create_growth_instrument()

​Parameters

​Returns

​delete_growth_instrument()

​Parameters

​Returns

Constructor

search()

Parameters

Returns

Example

get()

Parameters

Returns

Example

upload()

Parameters

Returns

Example

download_videos()

Parameters

Returns

Example

list_physical_samples()

Returns

list_projects()

Returns

get_physical_sample()

Parameters

Returns

get_project()

Parameters

Returns

list_growth_instruments()

Returns

create_growth_instrument()

Parameters

Returns

delete_growth_instrument()

Parameters

Returns