400-0011-866

EsDataFactory

An automated, user-friendly next-gen data warehouse solution
EsDataFactory is a next-generation data integration tool developed by ESENSOFT, built upon over a decade of experience in implementing data warehouse and business intelligence projects. Incorporating cutting-edge data warehouse implementation methodologies, it is precisely designed to meet the unique demands of data service-oriented projects. Capable of addressing the requirements of various data integration initiatives, EsDataFactory offers a simpler and smarter alternative to traditional ETL (Extract, Transform, Load) tools.  It simplifies complex tasks and automates repetitive processes, making data integration more efficient and user-friendly.
400-0011-866

‌Key Features

One-Stop Modeling and Data Processing

Traditional ETL tools lack control over models, whereas in data warehouse project construction, the quality of model design and management is often a critical factor determining the success or failure of the project. EsDataFactory integrates data modeling and ETL design, allowing for simultaneous completion of ETL process design during modeling, providing a one-stop data service.

Intelligent Wizard, Convenient and Efficient

Data integration tools are not easy to master, requiring substantial experience in learning and accumulation. Understanding data models, implementing process logic, and verifying each step all involve partially repetitive and mechanical operations. EsDataFactory is designed with a wizard-guided approach, allowing users to complete modeling and data transformation definitions with minimal learning. Data changes are detected during configuration, ensuring the entire process is automatically driven by AI.

Comprehensive Data Visualization Technology

EsDataFactory leverages a wide range of visualization techniques to transform the traditionally opaque "black box" process of data integration into a transparent and visible workflow. With model visualization, process visualization, and monitoring visualization, each stage of the project is presented in a more intuitive and concise manner, providing clear insights into the construction and operational status.

Cross-Platform Heterogeneous Data Source Integration

In addition to supporting standard database connections, EsDataFactory also supports various other data formats, including Excel, Text data, PetaBase and more. It enables direct cross-database data association and retrieval, significantly reducing the workload associated with traditional data integration processes, such as converting heterogeneous data into a homogeneous format and synchronizing data.

Genuine "zero programming" experience

EsDataFactory adopts a drag-and-drop process definition and modular transformation approach, simplifying the ETL process definition and significantly lowering the entry barrier. Building on this foundation, EsDataFactory internally encapsulates a wide array of data cleansing and transformation rules, eliminating the need to learn the complex expression syntax and database-specific dialects. This eliminates the need for coding, greatly reducing technical barriers. With simple drag-and-drop configuration, data can be easily utilized

Interconnected Among All Products

ESENSOFT is a leading domestic provider of professional data warehouse and business intelligence solutions in China. The company provides a full-spectrum product suite supporting the entire data lifecycle — from collection, transmission, and loading to storage, analysis, and visualization. EsDataFactory, is a vital component within ESENSOFT product ecosystem. With its model-integrated architecture, it seamlessly integrates and interconnects with other products, offering flexible and comprehensive product suite solutions.

Functional Characteristics

Interconnected Among All Products

Unlike traditional data integration tools, which tend to have a fragmented organizational structure, EsDataFactory adopts a logical, layered management model for data warehousing. It includes built-in layers such as ODS, Integration Layer, Summary Layer, and Data Marts. Users can freely customize their configuration based on actual needs. Additionally, unique functional guides are designed to align with the characteristics of each data warehouse layer, enabling users to quickly become proficient in the tool.

Multi-Dimensional and Multi-Method Visualization Modeling

EsDataFactory provides a visual modeling interface‌ for creating data models, supporting both custom model development and field selection from existing data structures (such as database tables, views, files, etc.). It offers a ‌global view‌ to visualize the relationships between models and between models and dimensions. It allows users to build Normal Form schema, star schema, and snowflake schema.

Personalized Model Resource Retrieval

EsDataFactory features a search engine-like retrieval system, offering both simple and advanced search modes. The simple mode allows users to search for models based on the English or Chinese names of the model and its fields, with matching items highlighted. The advanced mode enables more complex searches, tailored to user-defined parameters, facilitating efficient retrieval of model resources.

Incremental Capture and Data Loading

EsDataFactory offers various data capture mechanisms for change data capture, such as timestamp-based, M05, triggers, and full table comparisons. These processes are transparent to the user, requiring only simple configurations to capture incremental data. The platform supports multiple data offloading strategies, including data overwrite, data append, data update, and update-insert, ensuring comprehensive coverage of all data offloading scenarios.

Drag-and-Drop Process Designer

EsDataFactory features a fully drag-and-drop process designer, enabling users to complete complex ETL tasks and job workflow definitions by simply dragging and dropping data resources and transformation components. With zero coding required, the interface is user-friendly, easy to read, and easy to maintain. The comprehensive graphical editing functions, such as copy, paste, undo, redo, and auto-align, enhance usability and efficiency.

Visual Debugging and Pre-Loading

The platform supports visual debugging, including step-by-step execution, execute to functionality. Users could check the status of each step, execution time, and result sets of execution. For developers, EsDataFactory offers pre-loading mechanisms and the execution status of each step, making it easier to quickly validate workflows and debug scripts, streamlining the development process.

Rich Data Processing Components

EsDataFactory offers various data capture mechanisms for change data capture, such as timestamp-based, M05, triggers, and full table comparisons. These processes are transparent to the user, requiring only simple configurations to capture incremental data. The platform supports multiple data offloading strategies, including data overwrite, data append, data update, and update-insert, ensuring comprehensive coverage of all data offloading scenarios.

Drag-and-Drop Process Designer

EsDataFactory features a fully drag-and-drop process designer, enabling users to complete complex ETL tasks and job workflow definitions by simply dragging and dropping data resources and transformation components. With zero coding required, the interface is user-friendly, easy to read, and easy to maintain. The comprehensive graphical editing functions, such as copy, paste, undo, redo, and auto-align, enhance usability and efficiency.

Visual Debugging and Pre-Loading

The platform supports visual debugging, including step-by-step execution, execute to functionality. Users could check the status of each step, execution time, and result sets of execution. For developers, EsDataFactory offers pre-loading mechanisms and the execution status of each step, making it easier to quickly validate workflows and debug scripts, streamlining the development process.

Rich Data Processing Components

EsDataFactory offers a wide range of data processing components to handle tasks such as data transfer, cleansing, transformation, and loading and offloading. The platform features a flat icon design, along with comprehensive online help manuals and a case library, making it easy for users to get started.

Specialized Data Warehouse Application Components

In addition to traditional data processing components, EsDataFactory encapsulates commonly used data warehouse transformation operations, such as row-to-column, column-to-row, row record splitting, consistent dimension transformations, and slowly changing dimension transformations.

“Zero” Expression Cleaning Component

EsDataFactory includes a variety of cleaning rules to perform field-level and record-level data cleaning and transformation. This eliminates the need for complex expression definitions. Additionally, the cleaning component supports real-time previews of data changes before and after cleaning while configuring, and supports automatic merging of multiple rules for execution, thus saving performance overhead.

Practical Batch Processing

For commonly used data warehouse operations, EsDataFactory offers a wide range of batch definitions and shortcut operations. The wizard-style guidance helps users quickly define data models and load data in bulk.

Isolation of Design and Runtime Perspectives

To ensure the stability and reliability of the runtime environment, EsDataFactory provides a mechanism that isolates the design area from the runtime area. All job revisions and changes are made in the design area and, after debugging and test runs, are released to the stable runtime environment through the release mechanism, ensuring that development and execution proceed without interference.

Scientific Data Security and Anonymization Methods

Protecting sensitive private data is crucial for information security. A breach of confidential data can lead to severe consequences. EsDataFactory provides a common data masking algorithms package to ensure data association rules are preserved, while ensuring equivalence between development and production environments in terms of data usage. Common anonymization algorithms include: Fixed value replacement, prefix and suffix addition, data obfuscation, data trimming, segment replacement, MD5 processing, hashing, random filling, and offset encryption.

Flexible Scheduling Service

EsDataFactory supports scheduling mechanisms based on time or events, such as: any time interval, file arrival, script events, etc. The scheduling settings support crontab expressions, allowing for fine-grained control of schedule timing, including by month, day, hour, or minute. It also supports scheduling based on weeks. and also allows setting a time window for scheduling, with adjustable retry intervals.

Powerful Resource Backup and Recovery

EsDataFactory offers a multi-level resource-based backup mechanism, including the backup of resources such as data models, ETL processes, ETL process flows, and scheduling services. Backups can be manually triggered or automatically completed on a scheduled basis by the system. After the backup, the data package can be selectively restored, which is particularly important for system upgrades, migration, and rapid failure recovery.

Multi-Angle Operation Monitoring

EsDataFactory supports both overview of execution monitoring and detailed process monitoring, allowing different levels of operations personnel to manage the system effectively. It provides various graphical and list-based methods to monitor operational status, execution duration, process execution, and more. It is capable of tracking the execution of each ETL process and even each individual transformation.

Comprehensive Resource Monitoring System

EsDataFactory provides a comprehensive resource monitoring system that tracks system server resource usage, system access status, operation time consumption, system occupancy, and storage usage. By monitoring a series of resources, potential system issues can be detected and resolved promptly, ensuring the smooth operation of the system.

Multi-Level Login Authentication

EsDataFactory employs dual authentication mechanisms, including Ukey signature and user role permissions. This approach controls access based on the source of the request, preventing unauthorized access and reducing the risk of security incidents.

Technical Advantages

Intelligent Execution Engine

Built on an engine technology that integrates both database and in-memory computing, EsDataFactory can address data integration scenarios of any complexity. The execution engine analyzes the current execution environment and transformation complexity, selecting the optimal execution plan to complete transformation tasks. It minimizes or eliminates data storage during execution, ensuring transaction integrity and consistency.

Parallel Processing Technology

The execution engine can automatically optimize and merge transformation components and cleaning rules based on syntax parsing. Multiple components and transformation rules are processed and executed in parallel, significantly improving computational performance.

Universal Technology

Utilizing an object-oriented design approach, EsDataFactory encapsulates a universal model-building method composed of spaces, management domains, models, and fields, based on the physical database layer. It provides a cross-platform and cross-database version data model creation and management mechanism, as well as a complete API interface service for external integration.

Running Isolation Mechanism

EsDataFactory uses VFS (Virtual File System) technology to partition and store file resources between the physical and logical layers, achieving complete isolation between the design and runtime environments. This ensures the security and stability of the runtime environment, preventing conflicts between development and execution. Additionally, it provides synchronization mechanisms such as publishing and importing, ensuring that the developing and runing environments remain synchronized.

Pure WEB-Based

Built using JAVA and J2EE technologies, EsDataFactory is based on a WEB service and B/S architecture. After the server application is deployed, no plugins need to be installed on the client side, allowing direct access. This setup supports multi-user collaboration, shared repositories, and agile development, eliminating the complexities of desktop migrations, version conflicts, and manual updates. It supports data partitioning, parallel processing, and database batch loading interfaces to ensure high efficiency in data extraction and loading. At the same time, the execution engine automatically optimizes and merges transformation processes to avoid performance losses due to redundant data retrieval, repetitive associations, and multiple data storage operations.

High Reliability

EsDataFactory utilizes technologies such as data caching, breakpoint resumption, and error notifications. Ensuring the stability and reliability of the whole system by detailed system logs, runtime logs, and operation logs, as well as a comprehensive operational and resource monitoring system.

High Extensibility

It comes with a variety of built-in function libraries, offering algorithm compatibility across different database types and versions, supports custom functions, and also provides rich script components, stored procedure components, system command components, and secondary development interfaces to ensure the system’s high extensibility.

High Security

Adopt comprehensive and rigorous security coding to enhance product safety, with strict validation of unsafe data in the coding process. The MD5 technology with asymmetric algorithms is used to encrypt passwords, with regular reminders for weak passwords and mandatory password changes upon initial login. The digital signature and user permission authentication mechanisms prevent data security risks from the source, providing complete audit logs, tracking, and alert notifications to prevent data leakage and loss. Data transmission is encrypted. A rich set of data masking algorithms is built-in.

Contact Us

Telephone400-0011-866

Emailyixin@esensoft.com

Follow us on Wechat

@Copyright 2006-2024 Beijing Esensoft Limited Liablity Company