Glossary

Operational definitions for AI data, alignment, agents, and physical AI.

17 resources

Agentic AI

Agentic AI refers to AI systems that pursue goals through multi-step interaction with tools, software, environments, memory, or other agents, rather than producing only a single isolated response.

Read glossary

Glossary

Data Curation

Data curation is the governed process of selecting, filtering, organizing, enriching, balancing, documenting, and versioning data so it is fit for a defined training, post-training, retrieval, or evaluation purpose.

Read glossary

Glossary

DPO

Direct Preference Optimization (DPO) is a preference-based post-training method that optimizes a policy directly from preferred and dispreferred responses relative to a reference model, without first fitting a separate explicit reward model.

Read glossary

Glossary

Golden Trajectory

A golden trajectory is a reviewed reference path—or set of acceptable reference paths—showing how an agent can complete a defined task correctly under specified tools, permissions, policy, and environment state.

Read glossary

Glossary

Inter-Annotator Agreement

Inter-annotator agreement (IAA) measures how consistently two or more annotators assign labels, scores, spans, rankings, or other judgments to the same items under a defined annotation protocol.

Read glossary

Glossary

LiDAR Annotation

LiDAR annotation is the creation or validation of labels on laser-scanned 3D point clouds or range data for detection, tracking, segmentation, mapping, localization, and scene understanding.

Read glossary

Glossary

MCAP

MCAP is an open, modular container file format for recording multiple channels of timestamped, pre-serialized data, commonly used for robotics and multimodal logs.

Read glossary

Glossary

Model Integrity

Model integrity is an operational umbrella term for evidence that an AI model or system behaves consistently with its specified purpose, constraints, provenance, security assumptions, and release requirements across its lifecycle.

Read glossary

Glossary

Multimodal Data

Multimodal data combines two or more information modalities—such as text, image, video, audio, document layout, screen state, depth, point cloud, or sensor streams—in a record whose relationships matter to the target task.

Read glossary

Glossary

Red Teaming

AI red teaming is structured adversarial testing that probes a model or complete AI system for harmful, insecure, unreliable, policy-violating, or otherwise unacceptable behavior before and after deployment.

Read glossary

Glossary

RLHF

Reinforcement learning from human feedback (RLHF) is a family of post-training methods that uses human judgments to construct a learning signal for improving model behavior.

Read glossary

Glossary

ROS Bag

A ROS bag is a recorded collection of ROS messages and associated metadata used to capture, replay, inspect, and process communication from a Robot Operating System application.

Read glossary

Glossary

Sensor Fusion

Sensor fusion is the process of combining observations from multiple sensors or streams to estimate a state, detect an object, or make a decision more reliably than a single source alone.

Read glossary

Glossary

SFT

Supervised fine-tuning (SFT) adapts a pretrained model by training it to reproduce curated target outputs for specified inputs or conversational contexts.

Read glossary

Glossary

Tool-Use Trajectory

A tool-use trajectory is the ordered record of an agent’s interaction with tools and an environment from an initial task state to completion, failure, escalation, or termination.

Read glossary

Glossary

VLA

A vision-language-action (VLA) model maps visual observations and language instructions to actions or action distributions for an embodied system.

Read glossary

Glossary

VLM

A vision-language model (VLM) is a model that jointly processes visual inputs and language to represent, retrieve, generate, or reason about content across the two modalities.

Read glossary