Difference between revisions of "Import Table Wizard"

From DataSelf Knowledge Base
Jump to navigation Jump to search
(Data Filter column)
Line 146: Line 146:
  
 
==== ''Data Filter'' column ====
 
==== ''Data Filter'' column ====
Filter applied during the data import from each table unioned.
+
Filter applied during the data import for each table unioned.
  
 
==== ''Data Source Name'' column ====
 
==== ''Data Source Name'' column ====

Revision as of 04:33, 29 March 2014

Import Table Wizard - Start window

Import Table Wizard - Start window


This window shows data sources that are found on the host computer’s ODBC Data Source Administrator or connections/data sources that are defined and saved in current ETL project. EDIT (see comments)


Data Source Column
  • For data sources connected via ODBC: Refers to the DSN (name) of an ODBC connection.
  • For data sources connected via other providers (such as Excel): Refers to the data source name saved for each connection.
Driver column
Type column
Add button

The Add button opens the Import Table Wizard - Data Source Settings window which allows the creation of new data sources providers.

Edit button
Delete button
Next button

Import Table Wizard - Data Source Settings Window

Data Source type options of Import Table Wizard - Data Source Settings window.

Navigation to here: From Import Table Wizard - Start click Add.
There are three versions of this wizard depending on the Data Source type. The options are: MS SQL SERVER, MS Excel, and MS Access.

Data Source type = "MS SQL SERVER"

ImportTableWizard DSS MSSLQSereverOption.png

Server Name field.

In the Server Name field enter the instance name of SQL Server. The instance name is the same name as used to login using Microsoft SQL Server Management Services.

SQL Server Instance Names
  • “local” should be written as “(local)”, “localhost” or the name of the machine.
  • If an name/instance requires the “Instance Name>” then it would be “(local)instance name>”.

NOTE: The instance name of an SQL Server is informally known as the server name.
SQL Server Setup sets the instance name to the computer name during installation.

To find an SQL Server’s Instance Name:
  1. open SQL Server Configuration Manager (search for it in the Start menu).
  2. Click on SQL Server Services.
    The instance name of SQL Server is in parenthesis inline with SQL Server service. If it says MSSQLSERVER, then it’s the default instance.
  3. To connect to it in Management Studio, just type . (dot) OR (local) and click Connect.
    If the instance name is different, then use .instance name] to connect to it (for example if the instance name is SQL2008, connect to .2008).

Also make sure SQL Server and SQL Server Browser services are running, otherwise you won’t be able to connect.

Data Source type = “MS Excel”

MS Access option of Import Table Wizard - Data Source Settings window

In versions of the ETL prior to version 2013.002.xx access to Excel files required an ODBC connection. While the older ODBC connections still work the Import Table Wizard - Data Source Settings window configures a connection using a technology called an Excel provider.

Finish button

Clicking the Finish button attempts to configure a data connection to the specified Excel file on the host computer.

NOTES:

  • For Excel files the data connector (the software which runs on the host computer) is also known as an Excel provider.


Excel version pull-down list box

The Microsoft Excel 2007 option is compatible with all versions of Excel files (e.g. .xls, xlsx, xlsm)

Data Source type = “MS Access”

MS Access option of Import Table Wizard - Data Source Settings window


Import Table Wizard - Select Tables window

Navigation to here: From Import Table Wizard - Start click Next.

Import Table Wizard - Select Tables window.

Advanced button


Import Table Wizard - Table ... - Select Fields window

Navigation to here: From Import Table Wizard - Select Tables click Next.

Import Table Wizard - Table ... - Optional: Select Data Grouping window

Navigation to here: From Import Table Wizard - Select Fields click Next.


"var char" message

NOTE: The MediaWiki software does not allow us to show "var char" as a single word.

Import Table Wizard - Finish window

Union option selected.

Import Table Wizard ... - Table Union

Imports and combines data from the AR_Customer tables from the DSDW_Sage100 and DSMAS90200 data sources respectively.

Union Concepts

An ETL Table Union will merge two or more source tables into a single table in the data warehouse. Here are some examples of application of this feature:

  • When combining archived and current data (ex.: invoices) so people can analyze all historical data from a single target table.
  • When consolidating multi-company systems into a single data warehouse. For instance, the invoices from 3 source ERPs will merge into a single target invoice table. Users will be able to easily analyze invoices from all 3 ERPs from a single target table, and then use filters to slice it.

The ETL Table Union is based upon the SQL's UNION operator. In SQL the UNION operator combines the result of two or more SELECT statements.

Target Structure

The fields selected in the Import Table Wizard#Import Table Wizard - Table ... - Select Fields window define the field names and data types of the data that will be combined in the table imported by the union operation; call this the target structure (better name?).


The source tables must have the same column names and data types as the target structure.

For instance:

  • Source table 1 columns: CustName (char 50), CustNo (char 10), CreditStatus (int)
  • Source table 2 columns: CustName (char 50), CustNo (char 10), City (char 50)
  • target structure: CustName (char 50), CustNo (char 10), TableID.


NOTE: The source tables may have columns not listed in the target structure. Those columns will not be imported and otherwise will be ignored by the union operation.

Columns

These columns form the table in the upper panel of the wizard, and show what's been configured for the current Table Union. To change this configuration:

  • Select a line in the white panel and enter new values in the boxes below the white panel.
  • Click "Add Table" button to add a new line and configure its parameters in the boxes below the white panel.

TableId column

The ETL Table Union adds a column named TableID that can be used to indicate the source of the data in each row. On the example pictured above, the table union is combining records from the AR_Customer table from DSDW_SAGE100 and DSMAS90200 data sources. In the target table, records from DSDW_SAGE100 will be flagged with TableID 001 and the other with 002.

Sage 300 and Sage 500 already have internal columns to handle multi-company setups. So you do NOT have to populate the TableID when doing unions for multicompany for these systems.

Table Name column

Name of the source tables to be unioned.

Data Filter column

Filter applied during the data import for each table unioned.

Data Source Name column

Data source from each each table will be imported from.

Test table and field structure check box

When checked the ETL validates if the column imported from the source tables match the target structure.

NOTE: This process can be time consuming when working with large source tables, so you may consider disabling when you are sure the source data structures match.