ETL Processing

Compose workflows to automate your data processing and product pipelines.

Automation of pipelines for geospatial data modeling and Extract, Transform, & Load (ETL) Notebook capabilities allow non-developer users to intuitively interact with a multitude of data sources to construct workflows. ETL workflows capabilities include data cleansing, classification, analytics, queries, and filters across a variety of data types.

Unify Stakeholder Access to Enterprise Data Stores, Lakes, and User Repositories

Enterprises rely on secure and efficient access to their data assets. As organizations evolve, data storage options risk becoming increasingly fragmented across individual user repositories, incompatible micro-services, data stores, and data lakes. MapLarge simplifies access to this otherwise siloed data with federated queries, joins, and other operations on geospatial and non-geospatial data.

DATA INGEST AND TRANSPORT
Intelligent Recurring Scheduling
Intelligent Recurring Scheduling
Push/Pull As Appropriate
Push/Pull As Appropriate
Centralized Curated Data Access
Centralized Curated Data Access
Dynamic Configurable Data Synchronization
Dynamic Configurable Data Synchronization
Full ETL Suite
Full ETL Suite
MapLarge ETL Notebooks enables non-developer analysts to systematically apply GIS data transformations on foundation geospatial intelligence (GEOINT) data via repeatable import, cleaning, curation, and export workflows.
DATA INGEST AND TRANSPORT
Intelligent Recurring Scheduling
Intelligent Recurring Scheduling
Push/Pull As Appropriate
Push/Pull As Appropriate
Centralized Curated Data Access
Centralized Curated Data Access
Dynamic Configurable Data Synchronization
Dynamic Configurable Data Synchronization
Full ETL Suite
Full ETL Suite
ETL Notebooks provides a modular workflow approach to integrate AI/machine learning scripts/libraries written in Python and R. This workflow representation provides data scientists greater visibility during the prototyping process.
AI & DATA SCIENCE
AI & Custom Model Integration ONNX TensorFlow, PyTorch, ML.NET
AI & Custom Model Integration ONNX TensorFlow, PyTorch, ML.NET
Script with Python, R, Node, Java, C#
Script with Python, R, Node, Java, C#

Stabilize Data Science Code for Model Lifecycle Management at Scale

Cleaned and normalized data often requires additional transformations to prepare it for use with a data model. ETL workflows can apply additional transformations such as feature selection and feature engineering to increase the model accuracy and reduce computation time. Data Scientists need the capability to code in the tools of their choice and then execute this code in an integrated enterprise workflow for repeatability at production scale. MapLarge natively supports integrations with tools like Jupyter Notebooks, TensorFlow, ML.Net and PyTorch as well as dozens of other tools and frameworks via customizable secure Docker Containers.

AI & DATA SCIENCE
AI & Custom Model Integration ONNX TensorFlow, PyTorch, ML.NET
AI & Custom Model Integration ONNX TensorFlow, PyTorch, ML.NET
Script with Python, R, Node, Java, C#
Script with Python, R, Node, Java, C#

Create Flexible Workflows to Ingest, Normalize, Transform, and Share Data Products

Heterogeneous data stores and lakes, especially including geospatial data, require a series of ETL processing steps to prepare feeds for downstream consumption. ETL workflows provide a means to condition the data and increase its compatibility with analytic, machine learning, mapping, visualization, alerting, and tasking tools. ETL processing must keep up with streaming data and the real-time demands of enterprise applications. No-code ETL Workflows enable product managers and domain experts (analysts, operators, engineers, etc.) to efficiently create data products for their end-customers.

DATA PREPARATION
Outlier Identification and Removal
Outlier Identification and Removal
Data Scaling
Data Scaling
Data Normalization
Data Normalization
Integer Encoding
Integer Encoding
One-Hot Encoding
One-Hot Encoding
Data Visualization
Data Visualization
ETL Notebooks provides automated import/export, cleaning, styling, analytics, and data model instantiation workflow. Raster data products, such as LiDAR and Synthetic Aperture Radar (SAR), are processed to provide geospatial analytics such as viewshed, sensor avoidance and GPS routing and navigation. These features support use cases such as broad-area search for candidate helicopter landing zones (HLZ).
DATA PREPARATION
Outlier Identification and Removal
Outlier Identification and Removal
Data Scaling
Data Scaling
Data Normalization
Data Normalization
Integer Encoding
Integer Encoding
One-Hot Encoding
One-Hot Encoding
Data Visualization
Data Visualization

Dashboards
Dashboards
Analytics
Analytics
Alerting Framework
Alerting Framework
ETL Processing
ETL Processing
Sharing & Collaboration
Sharing and Collaboration
Task Management
Task Management
Mapping Engine
Mapping Engine
Streaming Ingest
Streaming Ingest
Multi-Model Database
Multi-Model Database
Extensibility
Extensibility
Machine Learning Ops
Machine Learning Ops
Security Administration
Security Administration

Platform for Building Geospatial and IoT Applications