Learn DataStage Course
Course Curriculum
Curriculum
Data Warehouse Fundamentals
- Data Marts
- Introduction of Data warehousing
- purpose of Data warehouse
- Architecture of Data warehouse
- OLAP & OLTP Methods
Data Modeling
- Introduction of Data Modeling
- Entity Relationship Model
- Types of Modeling (Logical & Physical)
- Types of schema’s
- Types of tables
- Dimension & Fact table relationship.
Process of ETL (Extraction, Transaction & Load)
- Introduction of ETL
- Types of ETL tools
- Key tools in the market
- Installation Process
- Database and OS requirements
- SMP & MPP
Introduction of IBM Datastage
- Datastage components
- Designer client
- Director client
- Administrator client
- Web Console
- Datastge Parallelism
- Portioning techniques
- Creation of jobs & types of jobs
- configuration files & Environment Variables
- Parameters & parameter set
- compilation & job run
- Designer & Director Repository
- Parallel Pallet
- General – link, containers
- File stages – Sequential files, Dataset etc
- Database stages – Oracle, DB2, ODBC connectors etc
- Processing stages
- Filter, Sort, Aggregate, Join, Lookup, Merge, Transformer – Stage variables, Constraints, all functions, Aggregate, Copy, Change Capture, Change Apply, Funnel, Remove Duplicate, Surrogate Key Generator, Pivot stage, SCD stages and etc.
- Debugging stages – Peak, row generator, column generator etc.
- Quality stage – Investigate, Standardize, Match Frequency, Reference Match Stage etc.
- Sequencer Pallet
- Job Activity
- Job sequencer
- Start loop Activity
- End loop Activity
- Notification Activity
- Terminator Activity
- Nested Condition Activity
- Exception handling Activity
- Execute Command Activity
- Wait for file Activity
- User variable Activity
- Adding Check Points
- Restart able
Triggering methods
- Types of triggers
- Command line output
- System output
- Ereplace output
Job scheduling, monitoring, status check
- Scheduling jobs using Director & ESP
- ESP Basics
- Monitoring methods
Performance tuning techniques
- Job level tuning
- Stage level tuning
RCP, Multiple instance
- RCP (Run time column propagator) Usage areas
- Ways of using multiple instances
Import & export of DSX & DS job run using UNIX.
- Importing of Datastage jobs.
- Exporting of Datastage jobs.
- QA & Production deployment methods
- Unix commands execution in Datastage jobs.
DataStage Administrator
- Create Project
- Delete Project
- Protect Project
- Permissions
Appendix
- Real time complex scenarios and FAQ
- Resume preparation
- Interview & certification FAQS
- How to face the interview
- Project preparation with data analysis methods.