Database Type Check
The DbTypeCheck is a component designed to check the type consistency of all properties of incoming records against their corresponding columns in an existing database table. This ensures that data being processed matches the expected types in the database, helping maintain data integrity and prevent errors during data flow operations.
Features
DbTypeCheck
ensures that incoming rows match the data types of the corresponding columns in a specified database table. By providing a TableName
and a ConnectionManager
, it retrieves the TableDefinition
automatically. Optionally, ColumnMapping
(similar to DbDestination
) can be applied. The component verifies that each property’s data type and value align with the mapped column.
As a transformation component, DbTypeCheck
has two outputs:
- Valid rows are passed through
LinkTo
. - Invalid rows can be redirected using
LinkFlawedTo
. If noLinkFlawedTo
is defined, an exception is thrown when encountering flawed records.
Usage Example
public class MyDataTypeRow
{
[DbColumnMap("IntCol")]
public int? SomeInt { get; set; }
[DbColumnMap("StringCol")]
public string SomeString { get; set; }
}
var check = new DbTypeCheck<MyDataTypeRow>(SqlConnection, "destination_table");
var dest = new DbDestination<MyDataTypeRow>(SqlConnection, "destination_table");
var flawedRowsDest = new MemoryDestination<MyDataTypeRow>();
source.LinkTo(check);
check.LinkTo(dest);
check.LinkFlawedTo(flawedRowsDest);
Customizing Type Checks
Provide a Custom Table Definition
By default, DbTypeCheck
retrieves the full table definition from the database. However, a custom TableDefinition
can be provided when only specific columns should be validated.
var tableDef = TableDefinition.FromTableName(SqlConnection, "destination_table");
tableDef.Columns = tableDef.Columns.Where(col => col.Name != "IgnoredColumn").ToList();
var check = new DbTypeCheck<MyDataTypeRow>(SqlConnection, "destination_table");
check.TableDefinition = tableDef;
Handling Invalid Records
Records that match the schema are passed via LinkTo
. Invalid records can be redirected using LinkFlawedTo
. If LinkFlawedTo
is omitted, an exception is thrown upon encountering an invalid row. Alternatively, LinkErrorTo
can be used to capture errors as ETLBoxError
objects.
Column Mapping
Column mapping works the same way as in DbDestination
. Using the same mapping in both components ensures consistency. Mapping can be done via attributes or by explicitly defining a mapping array.
var check = new DbTypeCheck(SqlConnection, "destination_table");
var dest = new DbDestination(SqlConnection, "destination_table");
var colMap = new[]
{
new DbColumnMap() { DbColumnName = "IntCol", PropertyName = "SomeInt" },
new DbColumnMap() { DbColumnName = "StringCol", PropertyName = "SomeString" }
};
dest.ColumnMapping = colMap;
check.ColumnMapping = dest.ColumnMapping;
Additional Configuration
Include Identity Columns
If identity (auto-increment) columns need to be included in validation, enable IncludeIdentityColumn
. This is useful when explicitly inserting values into an identity column.
var check = new DbTypeCheck<MyRow>(SqlConnection, "destination_table");
check.IncludeIdentityColumn = true;
var dest = new DbDestination<MyRow>(SqlConnection, "destination_table");
dest.AllowIdentityInsert = true;
AllPropertiesMustMatch
If true
, any extra properties in the input data that do not exist in the table schema cause the row to be marked as invalid.
CustomCheck
A user-defined validation rule can be applied after type checks. For example, rejecting negative Id
values:
var check = new DbTypeCheck<MyRow>(SqlConnection, "destination_table");
check.CustomCheck = row => row.Id > 0;
IncludeCharLength / IncludeBinaryLength
These options validate the length of CHAR
/VARCHAR
and BINARY
/VARBINARY
columns. If a value exceeds the allowed length, the row is flagged as invalid.
Example
public class MyDataTypeRow
{
public int Id { get; set; }
[DbColumnMap("IntCol")]
public int? SomeInt { get; set; }
[DbColumnMap("StringCol")]
public string SomeString { get; set; }
}
// Create demo table
string tableName = "dbtypecheck_example";
var tableCols = new List<TableColumn>
{
new TableColumn("Id", "INT", allowNulls: false, isPrimaryKey: true, isIdentity: true),
new TableColumn("IntCol", "INT", allowNulls: false),
new TableColumn("StringCol", "VARCHAR(5)", allowNulls: true),
};
DropTableTask.DropIfExists(SqlConnection, tableName);
CreateTableTask.CreateIfNotExists(SqlConnection, tableName, tableCols);
// Define test data
var source = new MemorySource<MyDataTypeRow>(
new List<MyDataTypeRow>()
{
new MyDataTypeRow() { SomeInt = 1, SomeString = "OK" }, // Pass
new MyDataTypeRow() { SomeInt = 2, SomeString = "TOO_LONG" }, // Exceeds VARCHAR(5)
new MyDataTypeRow() { SomeInt = null, SomeString = "X" }, // Null in non-nullable column
new MyDataTypeRow() { SomeInt = -1, SomeString = "X" } // Fails CustomCheck
});
// Configure DbTypeCheck
var check = new DbTypeCheck<MyDataTypeRow>(SqlConnection, tableName);
check.CustomCheck = row => row.SomeInt > 0;
// Define destinations
var dest = new DbDestination<MyDataTypeRow>(SqlConnection, tableName);
var flawedRowsDest = new MemoryDestination<MyDataTypeRow>();
// Link components
source.LinkTo(check);
check.LinkTo(dest);
check.LinkFlawedTo(flawedRowsDest);
// Execute network
Network.Execute(source);
Dynamic Object Support
Like all ETLBox components, DbTypeCheck
supports both strongly-typed objects (DbTypeCheck<MyRow>
) and dynamic ExpandoObject
(DbTypeCheck<ExpandoObject>
or DbTypeCheck
).
Handling Conversion Errors
DbTypeCheck
captures detailed information about failed type checks in the ConversionError
property. This helps in diagnosing data issues by providing explicit error messages for each flawed record.
ConversionError Class
- PropertyName: The column causing the error
- Reason: Why the conversion failed
- Value: The rejected value
Example with POCO
public class MyDataTypeRow
{
[DbColumnMap("IntCol")]
public int? SomeInt { get; set; }
[DbColumnMap("StringCol")]
public string SomeString { get; set; }
public ConversionError ConversionError { get; set; }
}
var check = new DbTypeCheck<MyDataTypeRow>(SqlConnection, "destination_table");
check.IncludeCharLength = true;
var source = new MemorySource<MyDataTypeRow>(
new List<MyDataTypeRow>
{
new MyDataTypeRow { SomeInt = null, SomeString = "Test" }, // Fails
new MyDataTypeRow { SomeInt = 1, SomeString = "Valid" }
});
var flawedRowsDest = new MemoryDestination<MyDataTypeRow>();
source.LinkTo(check);
check.LinkTo(new DbDestination<MyDataTypeRow>(SqlConnection, "destination_table"));
check.LinkFlawedTo(flawedRowsDest);
Network.Execute(source);
foreach (var row in flawedRowsDest.Data)
{
Console.WriteLine($"Error in property {row.ConversionError.PropertyName}: {row.ConversionError.Reason}");
}
Example with Dynamic Object
dynamic CreateExpandoObject(int? someInt, string someString)
{
dynamic expando = new ExpandoObject();
expando.SomeInt = someInt;
expando.SomeString = someString;
return expando;
}
var source = new MemorySource<ExpandoObject>(
new List<ExpandoObject>
{
CreateExpandoObject(null, "Test"), // Fails
CreateExpandoObject(1, "Valid")
});
var check = new DbTypeCheck<ExpandoObject>(SqlConnection, "destination_table");
var flawedRowsDest = new MemoryDestination<ExpandoObject>();
source.LinkTo(check);
check.LinkTo(new DbDestination<ExpandoObject>(SqlConnection, "destination_table"));
check.LinkFlawedTo(flawedRowsDest);
Network.Execute(source);
foreach (dynamic row in flawedRowsDest.Data)
{
Console.WriteLine($"Error in property {row.ConversionError.PropertyName}: {row.ConversionError.Reason}");
}