Welcome to Lume! 🚀 We’ve recently introduced Projects as our new way to organize data transformations. If you’re new to Lume or evaluating our platform, we recommend starting with our Project Guides for the latest and greatest experience. Existing customers can continue using Flows while we help you transition.
Understanding these foundational concepts will help you make the most of Lume’s capabilities. This guide introduces the key components and how they work together.
While Lume only requires a single record to generate mapping logic, providing larger data samples improves mapping accuracy through better pattern recognition.
Need support for additional data formats? Contact the Lume team for assistance!
A target schema defines the desired output format for your transformed data. It uses JSON Schema format to specify:
Expected data types
Field requirements
Data validation rules
Format specifications
Remember: Property names in Lume’s API cannot contain periods (.).
Don’t know JSON Schema? Lume can automatically generate a target schema from a sample CSV file containing your desired output format. This makes it easy for non-technical users to define their data requirements.
Example Target Schema
Copy
Ask AI
{ "type": "object", "properties": { "full_name": { "type": ["string"], "description": "The full name of the customer, including first name and last name." }, "email_address": { "type": ["string"], "description": "The customer's primary email address used for communication.", "format": "email" } // ... additional fields ... }, "required": ["full_name", "email_address"]}
Required Schema Properties
Type: Defines allowed data types
Copy
Ask AI
{ "type": ["string", "null"]}
Description: Explains the field’s purpose
Copy
Ask AI
{ "description": "The status of the business's registration"}
Optional Schema Properties
Enum: Defines allowed values for classification
Copy
Ask AI
{ "enum": ["CA", "NY", "TX"]}
Format: Specifies data format (email, date, UUID, etc.)
Copy
Ask AI
{ "format": "date-time"}
Pattern: Enforces format via regex
Copy
Ask AI
{ "pattern": "^\\+?[1-9]\\d{1,14}$"}
Advanced Properties
cleaning-instructions: Custom data cleaning rules
Copy
Ask AI
{ "lume-settings": { "cleaning-instructions": "Remove special characters and convert to lowercase" }}
source-enum: Defines acceptable source values for classification
Copy
Ask AI
{ "lume-settings": { "source-enum": ["California", "New York", "Texas"] }}
A flow is your complete data transformation pipeline. It can:
Accept multiple data inputs
Include multiple transformation steps
Join and combine data
Produce final mapped output
Flows help you organize related transformations into logical sequences. Complex transformations can be broken down into manageable steps, making them easier to maintain and modify.