Connecting multiple data sources and dealing with raw, inconsistent data is a common problem — missing values, wrong formats, and duplicates often break downstream logic. This article shows how to build a structured, code-based process in .NET using a data flow architecture. Data is read, cleaned, validated, and stored in a database, while faulty records are logged. The flow processes records asynchronously and in parallel, making it suitable even for large data volumes.
April 1, 2025 by Andreas Lennartz14 minutes
In Part 2, we enhance the ETLBox data flow by adding custom error messages and implementing Slowly Changing Dimension Type 2 (SCD-2) to track changes in supplier data over time.
April 1, 2025 by Andreas Lennartz11 minutes
In Part 3, we transform our ETLBox data flow from a statically typed setup to a fully dynamic, configuration-driven process using ExpandoObject. This enables flexible handling of changing data structures and metadata-driven transformations.
April 1, 2025 by Andreas Lennartz15 minutes
In today’s world, managing data efficiently is crucial for businesses. One key task in data management is merging new data into existing database tables. Whether you’re adding new information or updating existing records, knowing how to do this properly can make a big difference. However, this task can be complex, and there are various approaches and paradigms you should be aware of.
June 12, 2024 by Andreas Lennartz16 minutes
Explore how to build ETL pipelines in C# using ETLBox inside polyglot notebooks. Learn to extract data from SQL Server, transform it dynamically, analyze it with DataFrames, and visualize results using ScottPlot—all within a single interactive notebook environment powered by .NET Interactive.
June 2, 2024 by Andreas Lennartz7 minutes
Traditional CRUD operations don’t scale well in ETL workflows. Handling data row by row creates unnecessary overhead, slowing down performance. This article explores how to transition from CRUD-based processing to a more efficient dataflow approach using ETLBox.
March 14, 2024 by Andreas Lennartz8 minutes
In the world of data, transformation is key. But not just any transformation—it's about transforming raw data into actionable insights. That's where ETL (Extract, Transform, Load) processes shine, and ETLBox is a .NET star in this domain. ETLBox offers a set of tools and components to build scalable and efficient ETL processes with minimal hassle. Whether you're a seasoned data engineer or a developer venturing into data warehousing, ETLBox could be your toolkit of choice.
November 10, 2023 by Andreas Lennartz20 minutes