ETL/Datastage Developer Resume

ETL/Datastage Developer

Primary Skills
ETL, datawarehouse design

US-NJ-Jersey City (will consider relocating)


Professional Summary:

• Over 7 years of experience in Data modeling, Datawarehouse Design, Development and Testing using ETL and Data Migration life cycle using IBM WebSphere DataStage 8.x/7.x
• Expertise in building Operational Data Store (ODS), Data Marts, and Decision Support Systems (DSS) using Multidimensional Model(Kimball and Inmon),Star and Snowflake schema design.
• Experience in analyzing the data generated by the business process, defining the granularity, source to target mapping of the data elements, creating Indexes and Aggregate tables for the data warehouse design and development.
• Data Processing experience in designing and implementing Data Mart applications, mainly transformation processes using ETL tool DataStage (Ver8.0/7), designing and developing jobs using DataStage Designer, Data Stage Manager, DataStage Director and DataStage Debugger.
• Efficient in all phases of the development lifecycle, coherent with Data Cleansing, Data Conversion, Performance Tuning and System Testing.
• Excellent in using highly scalable parallel processing Infrastructure using DataStage Parallel Extender.
• Efficient in incorporation of various data sources such as Oracle, MS SQL Server, and DB2, Sybase, XML and Flat files into the staging area.
• Experience in Mapping Server/parallel Jobs in DataStage to populate tables in Data warehouse and Data marts.
• Proven track record in addressing production issues like performance tuning and enhancement.
• Excellent knowledge in creating and managing Conceptual, Logical and Physical Data Models.
• Experience in dimensional and relational database design.
• Strong in Data warehousing concepts, dimensional Star Schema and Snowflakes Schema methodologies.
• Expert in unit testing, system integration testing, implementation, maintenance and performance tuning.
• Experience in different Scheduling tools like AutoSys for automating and scheduling jobs run.
• Excellent with PL/SQL, T-SQL, Stored Procedures, Database Triggers and SQL * Loader.
• Experience in UNIX Shell Scripting.
• Excellent knowledge of operating systems Windows, UNIX, Macintosh, and databases including Oracle, SQL Server,and DB2.
• Experience in implementing Quality Processes like ISO 9001:2000/Audits.
• Detail oriented with good problem solving, organizational, analysis, highly motivated and adaptive with the ability to grasp things quickly.
• Ability to work effectively and efficiently in a team and individually with excellent interpersonal, technical and communication skills.

Education Qualifications:
Masters in Electrical and Computer Engineering.

Skill Sets:
IBM Information Server V8.1(DataStage, QualityStage, Information Analyzer), Ascential DataStage V7.5 (Designer, Director, Manager, Parallel Extender)., Oracle 8i/9i/10g, MS SQL Server 2005/2008, DB2 UDB, MS Access, Sybase, SQL, PL/SQL, SQL*Plus, Flat files, Sequential files, TOAD 9.6, Erwin, Microsoft Visio, Oracle Developer 2000, SQL*Loader, IBM Cognos 8.0, IBM AIX UNIX, Red Hat Enterprise Linux 4, UNIX Shell Scripting, Windows NT,/XP, Macintosh,C,C++,VB scripting.

Project Summary:

Prudential Financial, Newark,NJ 1/2009- Present
DataStage Developer
Prudential Financial, Inc. is a Fortune Global 500, provides insurance, investment management, and other financial products and services to both retail and institutional customers throughout the United States and in over 30 other countries.
The project objective was to collect, organize and store data from different operational data sources to provide a single source of integrated and historical data for the purpose of reporting, analysis and decision support to improve the client services.

IBM DataStage 8.0 (Designer, Director, Manager, Parallel Extender), Oracle 10g,SQL Server 2008, DB2 UDB, Flat files, Sequential files, Autosys, TOAD 9.6, SQL*Plus, AIX UNIX, IBM Cognos 8.0

• Interacted with End user community to understand the business requirements and in identifying data sources.
• Analyzed the existing informational sources and methods to identify problem areas and make recommendations for improvement. This required a detailed understanding of the data sources and researching possible solutions.
• Implemented dimensional model (logical and physical) in the existing architecture using Erwin.
• Studied the PL/SQL code developed to relate the source and target mappings.
• Helped in preparing the mapping document for source to target.
• Worked with Datastage Manager for importing metadata from repository, new job Categories and creating new data elements.
• Designed and developed ETL processes using DataStage designer to load data from Oracle, MS SQL, Flat Files (Fixed Width) and XML files to staging database and from staging to the target Data Warehouse database.
• Used DataStage stages namely Hash file, Sequential file, Transformer, Aggregate, Sort, Datasets, Join, Lookup, Change Capture, Funnel, Peek, Row Generator stages in accomplishing the ETL Coding.
• Developed job sequencer with proper job dependencies, job control stages, triggers.
• Used QualityStage to ensure consistency, removing data anomalies and spelling errors of the source information before being delivered for further processing.
• Excessively used DS Director for monitoring Job logs to resolve issues.
• Involved in performance tuning and optimization of DataStage mappings using features like Pipeline and Partition Parallelism and data/index cache to manage very large volume of data.
• Documented ETL test plans, test cases, test scripts, and validations based on design specifications for unit testing, system testing, functional testing, prepared test data for testing, error handling and analysis.
• Used Autosys job scheduler for automating the monthly regular run of DW cycle in both production and UAT environments.
• Verified the Cognos Report by extracting data from the Staging Database using PL/SQL queries.
• Wrote Configuration files for Performance in production environment.
• Participated in weekly status meetings.

Kaiser Permanente, Pleasanton, CA 05/2007-- 12/2008
ETL Designer/ DataStage Developer

Kaiser Permanente is an integrated managed care organization, is the largest health care organization in the United States. The Health Plan and Hospitals operate under state and federal non-profit tax status, while the Medical Groups operate as for-profit partnerships or professional corporations in their respective regions.

The project was to design, develop and maintain a data warehouse for their vendor's data, internal reference data and work with their DBA to ensure that the physical build adheres to the model blueprint.

DataStage 7.5.1 Enterprise Edition, Quality Stage, Flat files,Oracle10g, SQL Server -2005/2008, Erwin 4.2, PL/SQL, UNIX, Windows NT/XP

• Involved in understanding of business processes and coordinated with business analysts to get specific user requirements.
• Studied the existing data sources with a view to know whether they support the required reporting and generated change data capture request.
• Used Quality Stage to check the data quality of the source system prior to ETL process.
• Worked closely with DBA's to develop dimensional model using Erwin and created the physical model using Forward Engineering.
• Worked with Datastage Administrator for creating projects, defining the hierarchy of users and their access.
• Defined granularity, aggregation and partition required at target database.
• Involved in creating specifications for ETL processes, finalized requirements and prepared specification document.
• Used DataStage as an ETL tool to extract data from sources systems, loaded the data into the SQL Server database.
• Imported table/file definitions into the Datastage repository.
• Performed ETL coding using Hash file, Sequential file, Transformer, Sort, Merge, Aggregator stages compiled, debugged and tested. Extensively used stages available to redesign DataStage jobs for performing the required integration.
• Extensively used DataStage Tools like Infosphere DataStage Designer, Infosphere DataStage Director for developing jobs and to view log files for execution errors.
• Controlled jobs execution using sequencer, used notification activity to send email alerts.
• Ensured that the data integration design aligns with the established information standards.
• Used Aggregator stages to sum the key performance indicators used in decision support systems.
• Scheduled job runs using DataStage director, and used DataStage director for debugging and testing.
• Created shared containers to simplify job design.
• Performed performance tuning of the jobs by interpreting performance statistics of the jobs developed.
• Documented ETL test plans, test cases, test scripts, and validations based on design specifications for unit testing, system testing, functional testing, regression testing, prepared test data for testing, error handling and analysis.

Macy's, Atlanta,GA 12/2005-- 04/2007
ETL Developer
Macy's (NYSE: M) is a chain of mid-to-high range American department stores delivering fashion and affordable luxury to customers coast to coast. Online shopping is offered through Its selection of merchandise can vary significantly from location to location, resulting in the exclusive availability of certain brands in only higher-end stores.

The aim of the Project was to build a data warehouse, which would keep historical data according to a designed strategy. Flat files, Oracle tables were part of the source data, which came in on a daily, weekly, monthly basis.

IBM Information Server DataStage 7.5, Oracle 10g, SQL, PL/SQL, UNIX, SQL*Loader, Autosys, Business Objects 6.1, Windows 2003, IBM AIX 5.2/5.1, HP Mercury Quality Center 9.0

• Involved in understanding of business processes and coordinated with business analysts to get specific user requirements.
• Used Information Analyzer for Column Analysis, Primary Key Analysis and Foreign Key Analysis.
• Extensively worked on DataStage jobs for splitting bulk data into subsets and to dynamically distribute to all available processors to achieve best job performance.
• Developed ETL jobs as per business rules using ETL design document
• Converted complex job designs to different job segments and executed through job sequencer for better performance and easy maintenance.
• Used DataStage maps to load data from Source to target.
• Enhanced the reusability of the jobs by making and deploying shared containers and multiple instances of the jobs.
• Imported the data residing in the host systems into the data mart developed in Oracle 10g.
• Extensively used Autosys for automation of scheduling jobs on daily, bi-weekly, weekly monthly basis with proper dependencies.
• Wrote complex SQL queries using joins, sub queries and correlated sub queries
• Performed Unit testing and System Integration testing by developing and documenting test cases in Quality Center.
• Validated the report generated using Business Objects using PL/SQL queries.
• Worked on troubleshooting, performance tuning and performances monitoring for enhancement of DataStage jobs and builds across Development, QA and PROD environments.

Citibank Inc., New York, NY 12/2004--11/2005
DataStage Developer
Citibank, the leading global banking company, has some 200 million customer accounts and does business in more than 100 countries, providing services to consumers, corporations, governments and institutions.

The project was to transform the data coming from various sources through multiple stages before being loaded into the data warehouse and maintenance.

Ascential DataStage 7.0(Designer, Manager, Director), Oracle 9i, MS Access, SQL-Server 2000/2005,SQL, PL/SQL, Toad, UNIX

• Involved in understanding of business processes to learn business requirements.
• Extracted data from different systems into Source. Mainly involved in ETL developing.
• Defined and implemented approaches to load and extract data from database using DataStage.
• Worked closely with data warehouse architect and business intelligence analyst in developing solutions.
• Used Erwin for data modeling (i.e. modifying the staging and SQL scripts on Oracle and MS Access Environments).
• Involved in design, source to target mappings between sources to operational staging targets, using DataStage Designer.
• Performed ETL coding using Hash file, Sequential file, Transformer, Sort, Merge, Aggregator stages compiled, debugged and tested. Extensively used stages available to redesign Data Stage jobs for performing the required integration.
• Executed jobs through sequencer for better performance and easy maintenance.
• Involved in unit, performance and integration testing of Data Stage jobs.
• Used Data Stage Director to run and monitor the jobs for performance statistics.
• Involved in performance tuning of the jobs.
• Used T-SQL for validating the data generated at OLAP server.

Wipro Technologies, Bangalore 08/2003--11/2004 Quality Assurance Engineer

Wipro Technologies is an Indian Multinational, a leading provider of integrated business, technology and process solutions on a global delivery platform.

Worked as a QA tester on web based E-billing application. The application has two modules account payable (AP) and account receivable (AR) to keep track of transactions.
Windows XP,,Test Director 7.2,WinRunner 7.0,SQL server 2000

• Active participation in decision making and QA meetings and regularly interacted with the Business Analysts &development team to gain a better understanding of the Business Process, Requirements & Design.
• Analyzed business requirements with perspective to Black Box Testing, system architecture/design and converted them into functional requirements/test cases.
• Used Test Director to document the requirements and created traceability matrices for the requirements.
• Developed Test Plan that included the scope of the release, entrance and exit criteria and overall test strategy. Created detailed Test Cases and Test sets and executed them manually.
• Performed Cross-Browsing testing to verify if the application provides accurate information in different (IE, Netscape, Firefox, Safari) browsers.
• Extensively used Output and Checkpoint for verifying the UI properties and values using VB scripting.
• Back-End Database verification manually and using WinRunner to automatically
• Verify Database with the values entered during automated testing.
• Performed the Back-End integration testing to ensure data consistency on front-end by writing and executing SQL Queries. Provided management with metrics, reports, and schedules and was responsible for entering, tracking bugs.
• Ensured that the Defect was always written with great level of detail.

See above


You must be logged in and have a current resume access subscription. Login or Register »

Related: Resumes | IT Resumes in Jersey City, NJ | IT Resumes in New Jersey