Data Warehouse Interview Questions: Data WareHousing Interview Questions Part 10

Q.What r the joiner caches?
A.Specifies the directory used to cache master records and the index to these records. By default, the cached files are created in a directory specified by the server variable $PMCacheDir. If you override the directory, make sure the directory exists and contains enough disk space for the cache files. The directory can be a mapped or mounted drive.In the source, if we also have duplicate records and we have 2 targets, T1- for unique values and T2- only for duplicate values.

Q.How do we pass the unique values to T1 and duplicate values to T2 from the source to these 2 different targets in a single mapping?
A.source--->sq--->exp-->sorter(with enable select distinct check box)--->t1--->aggregator(with enabling group by and write count function)--->t2
If we want only duplicates to t2 u can follow this sequence --->agg(with enable group by write this code decode(count(col),1,1,0))--->Filter(condition is 0)--->t2.

Q.What are the diffrence between joiner transformation and source qualifier transformation?
A.
1.Source qualifier – Homogeneous source
2.Joiner – Heterogeneous source

Q.While importing the relational source defintion from database, what are the meta data of source you import?
A.
1.Source name
2.Database location
3.Column names
4.Datatypes
5.Key constraints.

Q.What are the unsupported repository objects for a mapplet?
A.
1.Source definitions:Definitions of database objects (tables, views, synonyms) or files that provide source data.
2.Target definitions: Definitions of database objects or files that contain the target data.
3.Multi-dimensional metadata: Target definitions that are configured as cubes and dimensions.
4.Mappings: A set of source and target definitions along with transformations containing business logic that you build into the transformation. These are the instructions that the Informatica Server uses to transform and move data.

5.Reusable transformations:Transformations that you can use in multiple mappings.
6.Mapplets:A set of transformations that you can use in multiple mappings.
7.Sessions and workflows: Sessions and workflows store information about how and when the Informatica Server moves data. A workflow is a set of instructions that describes how and when to run tasks related to extracting, transforming, and loading data. A session is a type of task that you can put in a workflow. Each session corresponds to a single mapping.

Q.What are the types of metadata that stores in repository?
A.Multi-dimensional metadata. Target definitions that are configured as cubes and dimensions.

Q.What are the reusable transforamtions?
A.Reusable transformations can be used in multiple mappings.When you need to incorporate this transformation into maping,U add an instance of it to maping.Later if you change the definition of the transformation ,all instances of it inherit the changes.Since the instance of reusable transforamation is a pointer to that transforamtion,You can change the transforamation in the transformation developer,its instances automatically reflect these changes.This feature can save you great deal of work.

Q.What are the types of maping in Getting Started Wizard?
A.
1.Simple Pass through maping :
Loads a static fact or dimension table by inserting all rows. Use this mapping when you want to drop all existing data from your table before loading new data.
2.Slowly Growing target :
Loads a slowly growing fact or dimension table by inserting new rows. Use this mapping to load new data when existing data does not require updates.

Q.What are the types of maping wizards that r to be provided in Informatica?
A.
1.Simple Pass through
2.Slowly Growing Target
3.Slowly Changing the Dimension

Type1
Most recent values

Type2
1.Full History
2.Version
3.Flag
4.Date

Type3
Current and one previous

Q.What are Dimensions and various types of Dimensions?
A.Set of level properties that describe a specific aspect of a business, used for analyzing the factual measures of one or more cubes, which use that dimension. Egs. Geography, time, customer and product.

Q.What are the session parameters?
A.Session parameters are like maping parameters,represent values you might want to change between sessions such as database connections or source files.
Server manager also allows you to create userdefined session parameters.Following are user defined session parameters:-
1.Database connections:Source file names: use this parameter when you want to change the name or location of
session source file between session runs.
2.Target file name : Use this parameter when you want to change the name or location of session target file between session runs.
3.Reject file name : Use this parameter when you want to change the name or location of session reject files between session runs.

Q.What is Session and Batches?
A.
1.Session - A Session Is A set of instructions that tells the Informatica Server How And When To Move Data From Sources To Targets. After creating the session, we can use either the server manager or the command line program pmcmd to start or stop the session.
2.Batches - It Provides A Way to Group Sessions For Either Serial Or Parallel Execution By The Informatica Server.

There Are Two Types Of Batches :
1.Sequential - Run Session One after the Other.
2.Concurrent - Run Session At The Same Time.

Q.If a session fails after loading of 10,000 records in to the target.How can u load the records from 10001 th record when u run the session next time in informatica 6.1?
A.Running the session in recovery mode will work, but the target load type should be normal. If its bulk then recovery wont work as expected

Q.Whats the diff between Informatica powercenter server, repositoryserver and repository?
A.Repository is a database in which all informatica componets are stored in the form of tables. The reposiitory server controls the repository and maintains the data integrity and Consistency across the repository when multiple users use Informatica. Powercenter Server/Infa Server is responsible for execution of the components (sessions) stored in the repository.

Q.How can you access the remote source into your session?
A.
1.Relational source: To acess relational source which is situated in a remote place ,u need to configure database connection to the datasource.
2.FileSource : To access the remote source file you must configure the FTP connection to the host machine before you create the session.
3.Hetrogenous : When U’r maping contains more than one source type,the server manager creates a hetrogenous session that displays source options for all types.

Q.What is rank transformation?where can we use this transformation?
A.Rank transformation is used to find the status.ex if we have one sales table and in this if we find more employees selling the same product and we are in need to find the first 5 0r 10 employee who is selling more products.we can go for rank transformation.In update strategy target table or flat file.

Q.which gives more performance ? why?
A.Pros: Loading, Sorting, Merging operations will be faster as there is no index concept and Data will be in ASCII mode.
Cons: There is no concept of updating existing records in flat file.As there is no indexes, while lookups speed will be lesser.

Q.What is a command that used to run a batch?
A.pmcmd is used to start a batch.

Q.What are the mapping paramaters and maping variables?
A.Please refer to the documentation for more understanding.
Mapping variables have two identities:
Start value and Current value
Start value = Current value ( when the session starts the execution of the undelying mapping)
Start value Current value ( while the session is in progress and the variable value changes in one ore more occasions)
Current value at the end of the session is nothing but the start value for the subsequent run of the same session.

Data Warehouse Interview Questions

HowToGetSoftwareJob

Monday, 26 March 2012

Data WareHousing Interview Questions Part 10

No comments:

Post a Comment

Stats

About Me