Professional Documents
Culture Documents
Synopsis:
Introduction to Informatica 7x Importance of Informatica for ETL Applications Informatica Architecture Informatica Client module Informatica Server Transformations in Informatica
Informatica Suite
Informatica PowerCenter
Informatica PowerMart
All product functionality, Includes all features except including the ability to register distributed metadata, multiple multiple servers. registered servers.
A PowerCenter lets you create a single repository that you can configure as a global repository.
Extraction, Transformation, Loading Can extract large volumes of data from multiple platforms, handle complex transformations on the data, and support high-speed loads. It can simplify and accelerate the process of moving data warehouses from development to test to production.
Informatica Architecture
Sources
PowerCenter and PowerMart access the following sources: Relational. File.
Application.
Mainframe. Other. Microsoft Excel and Access.
Informatica Architecture
Targets
PowerCenter and PowerMart can load data into the following targets: Relational.
File.
Application. Other. Microsoft Access. You can load data into targets using ODBC or native drivers, FTP, or external loaders.
Informatica Architecture
Informatica provides the following integrated components: Informatica repository. The Informatica repository is at the center of the Informatica suite. The Informatica Client and Server access the repository to save and retrieve metadata. Informatica Repository Server. The Informatica Repository Server manages connections to the repository from client applications. Informatica Client. Use the Informatica Client to manage users, define sources and targets, build mappings and mapplets with the transformation logic. Informatica Server. The Informatica Server extracts the source data, performs the data transformation, and loads the transformed data into the targets.
Informatica Architecture
Server
Source Analyzer
Sources
Target
Repository Manager
Designer
Warehouse Designer
TransformationDevelope r
Repository
Connectivity
Informatica Client
Repository Manager. Use the Repository Manager to create and administer the metadata repository
Designer. Use the Designer to create mappings that contain transformation instructions for the Informatica Server. Source Analyzer. Import or create source definitions. Warehouse Designer. Import or create target definitions. Transformation Developer. Develop reusable transformations to use in mappings. Mapplet Designer. Create sets of transformations to use in mappings. Mapping Designer. Create mappings that the Informatica Server uses to extract, transform, and load data. Workflow Manager. Use the Workflow Manager to create, schedule, and run workflows. Workflow Monitor. Use the Workflow Monitor to monitor scheduled and running workflows for each Informatica Server.
10
11
Use the Administration Console to add repository configurations to the Console Tree. When you add a repository configuration, you can perform the following actions: Create a repository in a database. Change the Repository Server managing the repository. Upgrade an existing repository from an earlier version.
12
13
16
17
18
Output
19
Navigator. Use to connect to and work in multiple repositories and folders. You can also copy and delete objects and create shortcuts using the Navigator.
Workspace. Use to view or edit sources, targets, mapplets, transformations, and mappings. You can work with a single tool at a time in the workspace. Status bar. Displays the status of the operation you perform. Output. Provides details when you perform certain tasks, such as saving your work or validating a mapping. Overview. An optional window to simplify viewing workbooks containing large mappings or a large number of objects.
20
Transformations
Transformations are the manipulation of data from how it appears in the source system(s) into another form in the data warehouse.
This includes
Data merging: Process of standardizing data types and fields. Cleansing: This involves identifying any changing inconsistencies or inaccuracies. Eliminating inconsistencies in the data from multiple sources. Converting data from different systems into single consistent data set suitable for analysis. Aggregation: The process where by multiple detailed values are combined into a single summary value typically summation numbers representing dollars spend or units sold.
21
Transformations in Informatica
Aggregator Transformation: The Aggregator transformation allows you to perform aggregate calculations, such as averages and sums.
Expression Transformation: Expression transformation to calculate values in a single row before you write to the target. You can use the Expression transformation to perform any non-aggregate calculations.
Advanced External Procedure Transformations : Advanced External Procedure transformations operate in conjunction with procedures you create outside of the Designer interface to extend PowerCenter/PowerMart functionality. External Procedure Transformations : External Procedure transformations operate in conjunction with procedures you create outside of the Designer interface to extend PowerCenter/PowerMart functionality. Filter Transformation : Filter transformation provides allows you to filter rows in a mapping. Rank Transformation: Allows you to select only the top or bottom rank of data. Router Transformation: Is similar to Filter transformation but it used for two or more filter conditions.
22
Transformations in Informatica
Joiner transformation : Joiner transformation joins two related heterogeneous sources residing in different locations or file systems. The combination of sources can be varied. You can use the following sources:
Normalizer Transformation: Normalization is the process of organizing data. use the Normalizer transformation with COBOL sources, which are often stored in a denormalized format
Sequence Generator Transformation: Is used for generates numeric values Stored Procedure Transformation: Is an important tool for populating and maintaining databases.
23
Transformations in Informatica
Sorter transformation Transformation : Sorter transformation allows you to sort data. You can sort data from a source transformation in ascending or descending order according to a specified sort key. Source Qualifier Transformation : Source Qualifier represents the rows that the Informatica Server reads when it executes a session. The Source Qualifier displays the transformation datatypes. The transformation datatypes in the Source Qualifier determine how the source database binds data when the Informatica Server reads it. XML Source Qualifier Transformation: When you add an XML source definition to a mapping, you need to connect it to an XML Source Qualifier transformation. Update Strategy Transformation: To update the target based on the flag values.
24
25
26
27
Informatica Server
The Informatica Server moves data from sources to targets based on workflow and mapping metadata stored in a repository. A session is a type of workflow task. A session is a set of instructions that describes how to move data from sources to targets using a mapping. When a workflow starts, the Informatica Server retrieves mapping, workflow, and session metadata from the repository to extract data from the source, transform it, and load it into the target. The Informatica Server uses the following processes to run a workflow:
The Load Manager process. Starts and locks the workflow, runs workflow tasks, and starts the DTM to run sessions.
The Data Transformation Manager (DTM) process. Performs session validations. Creates threads to initialize the session, read, write, and transform data.
28
29
Questions ? ?
31
Thank You
32