HowToGetSoftwareJob

HowToGetSoftwareJob
HowToGetSoftwareJob Google Hangout

Wednesday, 25 April 2012

how to get job in datawarehouse

Data Warehouse Development
Software projects usually begin by gathering requirements and then building what is needed. Data warehouse (DWH) projects on the other hand typically begin by building what is needed, and only then do you wind up with requirements. This calls for a radically different approach to development and project planning.
Inmon et al (2008): “… the requirements for a data warehouse are often not known when it is first built.” Instead, confusion and ambiguity reign. Gathering DWH requirements involves negotiating among stakeholders which functionality should be delivered first, and ensuring robust alignment with corporate strategy. Given ubiquitous lack of strategic clarity, this is no easy task.
1. Be Ready For The Information Democracy
When organizations embark on their first DWH project, they often strive to establish “one version of the truth.” ‘Spreadmarts’ (data silos maintained by individuals in spreadsheet programs) across the organization need to be surpassed by a central place to turn to for the definitive truth. And indeed a DWH can be very effective for that.
What is much less obvious, and often not anticipated, is that a dramatic change in (informal) power structure will take place. Poignant facts about business success (and lack thereof…) will emerge, waiting for anyone to be discovered in the DWH. What this does is that it shifts the “informal power structure” from those in command to those in control of information gathering skills. This can upset existing hierarchies, and trigger significant resistance.
2. Every DWH Starts With A Business Case
Sometimes data warehouses are needed to meet regulatory requirements. One could argue that your DWH then is a conditio sine qua non, a ticket to market. All other cases should definitely be founded on a solid business case. Not only to “justify” investments being made, but certainly also to provide guidance on setting priorities within the project.
When the going gets tough, your business case reminds you why you got started in the first place. It provides a tangible manifestation of successful execution of business strategy (see also tip# 5), justifying management attention and corporate sponsorship. Every DWH project runs into setbacks, in particular during the extract, transform, load (ETL) phase (see also tip# 4). When you’re having second thoughts, or the project needs to “compete” for resources, the business case will see you through.
3. Deliver Your DWH in Increments
For all the (largely pointless) controversy between Inmon and Kimball, one thing is clear: a DWH needs to be delivered incrementally. Inmon fully agrees but his approach to enterprise data warehouses (EDW’s) appears much less suited for an incremental, bottom-up implementation. For an EDW, however, Kimball’s architecture has some major challenges. Conforming departmental data marts (DM’s) is difficult enough as it is, but changes in grain (the minimal level of detail available) wreak havoc and invariably cause major scrap and rework.
What is required to make incremental delivery of a DWH robust, is a top-down architecture in combination with bottom-up implementation. It’s the only way you can gracefully deal with (unavoidable) change. Since 2/3 of TCO will go to maintenance (itself, largely building in changes) the architecture should account for this.
4. Test For (Acceptable) Data Quality First
As you are mapping information requirements onto the existing data systems in the enterprise, some sources may prove to be more “mission critical” for the DWH than others. Unless data quality programs and a (working) master data management (MDM) solution are in place, you can never take sufficient data quality for granted. Therefore do preliminary data profiling as early as possible, to give management a “feel” for what data quality is like, and work these findings into your project estimates.
One company we worked with wanted to learn from their DWH who their best customers were. However, they implemented a sales force automation tool that had failed spectacularly. Many reps were not providing data at all, and those that did, provided irreconcilable naming. The same client could go under a dozen names, or more! There was no way the DWH could enable tallying up cost and revenue per client. Unwelcome as it may be, you’re better of learning these things sooner than later!
5. Profile Your Source Systems To Plan Your Project
About 60% or more of DWH projects goes to the ETL phase. Incidentally, this is also where the greatest risks for project overruns lie, and certainly not only because it is such a large chunk of the work. Poor data quality, missing, incorrect, or out of data source specifications, a shortage of business domain expertise, transformation complexities, and many other things can go wrong here. Kimball (2008): “Due to all the unknown data realities hidden in your source system data, data staging processes have a well-earned reputation of being nearly impossible to estimate and deliver in time.”
The one way to gather objective information to help you plan this stage better is in-depth profiling of source systems. This also alleviates any hiatus in meta data available about these systems, which will tremendously help ETL programmers do a better job the first time around. Remember that data staging is a “classic” software development task (see also next month’s newsletter on software testing) where testing and trial deployment are bound to be followed by multiple iterations of fixing. If you only planned for development and initial testing, you will drift from the plan – and potentially get lost!
6. Tying Your DWH To Corporate Strategy Is Mandatory
Given the central role a DWH will play in the organization, it is adamant that you closely align it to corporate strategy. If you’re pursuing a product strategy, make sure the right KPI’s are supported. Likewise for operational excellence: defect rates, wing-to-wing time, cost measures, scrap & rework, etc., all should be available.
The best way to surface an organization’s strategy is not to read documentation or corporate promotion on “strategy.” These PR blurbs tend to be overflowing with platitudes, and the strategy as stated there may or may not coincide with practice at all. You need to investigate what “excellent performance” means. Having the latest and greatest technology (product strategy)? Zero out-of-stock positions (operational excellence)? Everybody wants to satisfy their customers these days, but that does not necessarily imply customer intimacy, etc. You infer the ‘true’ corporate strategy by finding out what managers need to accomplish in order to earn an excellent performance review.
7. If It’s Not An EDW, You’re Creating A (Another?) Silo
There is nothing wrong with creating a DWH to meet specific departmental needs. In particular if this is where you can make a profitable business case. If data driven decision making is to become the norm, though, realize that at some point you will want to create horizontal integration of information.
However, somewhere down the line, someone will raise the question how the comptroller’s numbers are related to marketing, how marketing campaigns relate to activity in the call or service center, etc. “We can’t say” because of some concocted, technical explanation doesn’t impress. Certainly not from the DWH team. Yet these questions are only natural and legitimate! Only EDW’s allow you to answer cross-departmental queries. If you embarked on a departmental DWH, managing these expectations is a full-time job.
8. DWH Architecture Is A Genuine Profession
Although data warehousing is still relatively new, enough experience and practices have been established to merit a new and unique role: the data warehouse architect. Like with “traditional” architecture, you need to negotiate (business) needs and wants with technical possibilities. The outcome can take the form of a blueprint (architectural drawings), a plan (construction project), set of components (chosen building materials), or principles (legislation or guidelines). All too often, there is confusion about what exactly is meant by “architecture.”
Unfortunately, tradition has it that when developers have been working in the field from 3-5 years on, they get “promoted” to architect status. This doesn’t necessarily mean they have any formal training in architecture, it usually just means you’re dealing with a developer who’s got hands-on experience, and who has probably seen several mistakes made more than once. Some developers are just very good (and maybe experienced) developers, and some people are more suited for other work. Developing and architecture require completely different people skills, the kinds of (almost innate) talents that don’t necessarily evolve with experience.
9. Consider The Business Case For Real-Time Data
Integrated data are used ever more widely to support operational decision making. This may happen in your call-center, where optimal cross-sell suggestions may be used that were derived via data analytics. Or in your warehouse when last-minute changes to shipments are based on the latest data on availability. The end result is that the pressure on your DWH to deliver (more) timely data increases. Monthly refreshes used to be the norm, then we went to weekly, but many data warehouses are now updated every day, or even more often. Hardware is becoming ever cheaper, and your architecture has a tremendous bearing on the cost to increase data throughput, and update frequencies. So technically, a lot has become possible, but faster (or the holy grail: “real time”) data always come at a (substantial) price.
Most operational data must absolutely be up-to-date for a flawless customer experience. Like contact data in your CRM application, or warehouse and shipment data. But must your cross-sell suggestions be real-time? How often will you make “the wrong” offer if your next best offers are calculated only once per week or month? Everybody always wants their data faster, but has the value of better decision making really been quantified?
10. Data Warehousing Should Be A Programme, Not A Project
Building a data warehouse from the ground up is such a gargantuan effort, requiring specialist skills, that outside help is just about always required. To fence off these consultant resources “some” project needs to be defined, preferably around one or more initial increments (see also tip# 3).
You cannot expect your data warehouse journey to ever end. There is never a fixed end date, and maybe more or less of a start date (the kick-off party?). Kimball (2008): “… each data warehouse is continuously evolving and dynamic.”
Since you want as smooth a transition as possible from building to deployment, this is best accomplished by making the effort part of a programme, right from the start. You’ll typically encompass multiple parallel efforts like data profiling, overhaul of source systems, cataloguing of meta data, data quality efforts, etc. Demand from your consultants that they actively help you transition from a project mode to in-house continuation of DWH operation (if that’s your chosen maintenance model).
Further reading
Some excellent books on Data Warehouse Development:
The Business of Data Vault Modeling.
ISBN# 9781435719149
Dan Linstedt, Kent Graziano & Hans Hultgren (2008)

The Data Warehouse Lifecycle Toolkit, 2nd Edition.
ISBN# 0470149779
Ralph Kimball, Margy Ross, Warren Thornthwaite, Joy Mundy & Bob Becker (2008)

Building a Data Warehouse for Decision Support.
ISBN# 0133711218
Vidette Poe (1996)

The Data Warehouse ETL Toolkit.
ISBN# 0764578578
Ralph Kimball & Joe Caserta (2004)

Corporate Information Factory.
ISBN# 0471197335
Bill Inmon (1998)

The Data Warehouse Toolkit.
ISBN# 0471153370
Ralph Kimball (1996)

Data Warehousing – the Route to Mass Customization.
ISBN# 0471963283
Sean Kelly (1996)

Certified Data Mining and Warehousing Professional VS-1068

Vskills Data Mining and Warehousing Professional assesses the candidate for a company's data mining and warehousing needs. The certification tests the candidates on various areas in data mining and warehousing which include knowledge of planning, managing, designing, implementing, supporting, maintaining and analyzing the organization's data warehouse and covering data mining and On-Line Analytical Processing (OLAP).


Why should one take this certification?
This Course is intended for professionals and graduates wanting to excel in their chosen areas. It is also well suited for those who are already working and would like to take certification for further career progression.

Earning Vskills Data Mining and Warehousing Professional Certification can help candidate differentiate in today's competitive job market, broaden their employment opportunities by displaying their advanced skills, and result in higher earning potential.


Who will benefit from taking this certification?
Job seekers looking to find employment in IT department of various companies, students generally wanting to improve their skill set and make their CV stronger and existing employees looking for a better role can prove their employers the value of their skills through this certification


Test Details:
  1. Duration: 60 minutes
  2. No. of questions: 50
  3. Maximum marks: 50, Passing marks: 30 (60%); There is no negative marking in this module.
Companies that hire Vskills Certified Data Mining and Warehousing Professional
Data Mining and Warehousing professional are in great demand. Companies specializing in Integration Services are constantly hiring knowledgeable professionals. Various banks, telecom and IT companies also need data mining and warehousing professionals for data management and analysis.


Fee Structure:
Rs. 2225.00/- + Service Tax as applicable

More On: http://www.monsterindia.com/vskills/selected_program.html?id=1009

negotiating-a-salary-raise-tips

Negotiating salary is, for most people, the hardest part of the job process and the cause of considerable anxiety. The key is to be prepared, reasonable and confident.
  • Wait For The Right Moment. The right time to ask for a raise is right after you’ve achieved something significant, for example, completed a tough project under budget, or when your boss or other key person has complimented you.

    If money is tight in the company and you have not made any significance contributions lately to help offset that problem, now may not be a good time to ask for a larger share of a dwindling pool of money. Timing also refers to the company’s policies and procedures in terms of the amount of time between reviews and raises — and when it’s “acceptable” to ask for a raise.
  • Broach The Topic Professionally And Stay Emotionally Neutral. Be professional, polite, and respectful. Always negotiate your salary with your direct superior. Never go above his or her head or to the Human Resources department. Set up a meeting with your boss to address this topic. That way you’ll know how much time you have and your boss won’t be taken by surprise.
  • Dress for success – on the day of your meeting, dress as you would for a job interview or business conference. You may even want to develop a script to follow. Just keep it flexible. When making your case, don’t compare yourself to co-workers — stick to the field in general. Anticipate any objections the employer might be able to raise and be prepared to justify your cost effectiveness.
  • Ask for What You Want. When asking for anything in life, you should be certain you know what you want. Otherwise you’re leaving the decision up to someone else and you may come out dissatisfied. You can’t be shy about asking to be paid what you’re worth. Give your boss an estimate of how much your efforts add to the company’s bottom line. Ground your proposal on objective criteria.
  • Create a one pager that includes comparables, and at the bottom, estimate your fair market value in light of those comparables. That will help convince your boss and give your boss something to show to higher-ups to justify giving you a raise. That one-pager will also add to your confidence in the negotiation.
  • Present Your Outline Of Your Accomplishments. Use as many details as possible, such as numbers and facts. You’ll want to take five to seven of your most recent or biggest-impact contributions and present them in a bulleted list. Most bosses are interested in numbers. If you are in marketing, how do the things you do put profit on the bottom line? If you are an administrator, how do you make money for the company, or, how do you save money for the company and how much of that savings drops directly into the profit margin of the organization?
  • Stay Positive. Talk about how you are happy in your current job.. Focus on what you deserve rather than what you need. Emphasize the benefits of your skills to the company. Don’t present your current salary/position as a problem.
  • Don’t Monopolize the Conversation. Know when to listen. Yes, you’ve arranged this meeting and you’re there to tell your side, but don’t dominate the discussion. Say what has to be said and then listen. Listen closely and give your employer plenty of room to talk. Often the more time people are given to talk, the more they will say – even just to fill that silence.  In addition, it is important that you listen to all your boss has to say. You want to be cooperative, not demanding and combative. You will likely gain and understanding of how things work within the company and what the company is both willing and able to do in your favor.
  • Be Flexible And Open To Other Options. Consider negotiating for perks. Maybe a pay raise won’t fly at the moment – in part because it would involve extra taxes and workers’ compensation for your employer. But you can ask for other things, including an extra week of vacation, extra personal days, education benefits or. So include and discuss other types of compensation that would be valuable to you.
  • Have an Exit Strategy. Express your understanding of the boss’ position. If your request for a raise is denied, try to find out where you can improve, so that next time you ask, your boss will have no choice but to reward your efforts.
  • Confirm the Details in Writing. Write a follow-up memo after the meeting. summarizing the meeting, demonstrating your value, and highlighting your accomplishments — and send the memo to your boss as documentation. Document any salary promises. If you were not able to obtain an increase in salary, find out when you will be able to revisit the issue. Be prepared to offer suggestions of what the next steps should be.
Marsha A. Ostrer is a mediator, conflict resolution trainer and lawyer who practices privately through Family Mediation of Cape Cod. Her conflict resolution specialty is successfully entering and defusing highly charged conflicts using a targeted mix of training and consulting.
She is also the founder and developer of http://www.all-things-conflict-resolution-and-adr.com website from which this article was developed see http://www.all-things-conflict-resolution-and-adr.com/Negotiating-Salary-Raise-Tips-Part-I.html for more tips. Her website’s mission is to provide resources and information, so that organizations and individuals will be able to make informed choices in accessing conflict resolution skills, training, and services to manage and stabilize the conflicts in which they are involved.
Article Source: http://EzineArticles.com/?expert=Marsha_A_Ostrer
http://EzineArticles.com/?Negotiating-a-Salary-Raise-Tips&id=5969220
See Other Helpful Articles:
Five Large Employers Posting Data Warehousing Jobs
Find Employers Posting Data Warehousing Jobs
Making a Great Impression During an Interview for a Data Warehousing Job
Business Analyst Salary and Job Description

Data Warehouse Architect

Deloitte Consulting LLP is one of the world's leading management consulting firms for executable strategy, operations, technology, and human capital advisory services. The consulting practice is built around integrated core capabilities - people, process and technology and industry expertise - the capabilities needed to help clients to tackle their most complex challenges
Federal Practice - Deloitte Consulting LLP
Deloitte Consulting's dynamic Federal Practice based in Washington D.C. and the surrounding Metropolitan area has opportunities for you to become part of their high-quality team that delivers innovative solutions to key Federal clients in financial management, business process improvement, strategy and operations, information systems development, package implementation, enterprise transformation, business process and applications outsourcing, and a full range of human capital advisory services.
As a Data Warehouse Architect, you will act as the primary technical architect for our data warehousing projects to solve some of the toughest challenges in business intelligence today. The successful candidate should possess deep technical expertise in database design, ETL, reporting and analytics and will have previous consulting experience utilizing a structured delivery methodology.
Job Requirements:
                     Ability to Obtain a US Govt security clearance
•                     A minimum of 10 years experience architecting large scale data warehouses using Oracle
•                     A minimum of 7 years of relevant consulting experience preferred or 15 years of corporate IT
•                     Expert data modeling skills (i.e. conceptual, logical and physical model design - with both traditional 3rd normal form as well as dimensional modeling (star, snowflake), experience with Operation Data Stores, Enterprise Data Warehouses and Data Marts
•                     Expert with pro's and con's of multiple warehouse architectures
•                     Expert RDBMS knowledge of Oracle (release 9i and 10g) including very large databases
•                     Expert in load design strategies and optimization techniques
•                     Implementation experience with ETL concepts and tools such as: Oracle Warehouse Builder, Informatica, DataStage, and ab Initio
•                     Implementation experience with one or more of the following BI/Enterprise Reporting tools: Oracle Discoverer, Business Objects, Cognos, Brio, and Microstrategy
•                     Possess strong communication and client management skills.
•                     Ability to effectively manage multiple projects simultaneously
•                     BS in Computer Science or equivalent experience in a relevant field
Responsibilities:
•         Translate client user requirements into technical architecture vision and implementation plan
•         Design and develop the architecture for all data warehousing components (e.g. tool integration strategy; source system data ETL strategy, data staging, movement and aggregation; information and analytics delivery; and data quality strategy)
•         Design of the data warehouse data storage strategy/technique (ODS, EDW, DM, ROLAP, MOLAP, etc.)
•         Design of data warehouse sub-system (e.g. QA/QC, ETL and Query metrics)
•         Oversight of tool evaluation and selection
•         Hardware recommendations, sizing and configuration
•         RDBMS installation, configuration and tuning
•         Oversight and/or coordination of system testing phase
•         Problem and issue resolution
•         Mentoring and developing junior staff
•         Participation in proposal development and client presentations
Deloitte is one of the leading professional services organizations in the United States, specializing in audit, tax, consulting and financial advisory services with clients in more than 20 industries. We provide powerful business solutions to some of the world’s most well-known and respected companies, including more than 75 percent of the Fortune 100.

At Deloitte, you can have a rewarding career on every level. In addition to challenging and meaningful work, you’ll have the chance to give back to your community, make a positive impact on the environment, participate in a range of diversity and inclusion initiatives, and find the support, coaching, and training it takes to advance your career. Our commitment to individual choice lets you customize aspects of your career path, your educational opportunities and your benefits. And our culture of innovation means your ideas on how to improve our business and your clients’ will be heard.

Visit www.deloitte.com/us/careers to learn more about our culture, benefits and opportunities.



About Deloitte

As used in this document, “Deloitte” means Deloitte LLP and its subsidiaries. Please see www.deloitte.com/us/ about for a detailed description of the legal structure of Deloitte LLP and its subsidiaries. Deloitte LLP and its subsidiaries are equal opportunity employers.

More On: http://careers.deloitte.com/jobs/eng-US/details/j/E12ROSCMGRRV123/data-warehouse-architect

Job Openings at Safari Books Online: Data Warehouse Developer/Data Architect

Job Openings at Safari Books Online: Data Warehouse Developer/Data Architect

We currently have a job opening at Safari Books Online for a Data Warehouse Developer/Data Architect:
Data Warehouse Developer/Data Architect
JOB SUMMARY:  
Data Warehouse Developer/Data Architect with very strong data integration skills to design data models, schemas, ETL, Ad-Hoc Queries, build dynamic/interactive reports, and contribute to the design and development of our delivery platform.
ESSENTIAL DUTIES AND RESPONSIBILITIES:
The following reflects management’s definition of essential functions for this position but does not restrict the tasks that may be assigned. Management may assign or reassign duties and responsibilities to this job at any time due to reasonable accommodations or other purposes.
  • Technical leadership role in the data warehouse and enterprise modeling efforts
  • Technical leadership to establish, define, maintain and communicate data architecture reference architectures, guidelines, standards and best practices.
  • Technical leadership for solution architecture development and review with regard to data integration architecture design and implementation
  • Own and manage the conceptual and logical data models for the DW and other data platforms
  • Own the data dictionary, data repository and data governance
  • Provide technical database leadership to DBAs.
  • Interacting with client, functional experts and developers to understand the system requirements.
  • Guide the development and implementation database administration and operational procedures.
KNOWLEDGE, SKILLS AND ABILITIES                                                 
  • Designing and developing Business Intelligence and data warehouses
  • Exemplary writing skills with a strong ability to communicate project status
  • Expert-level knowledge of SQL, stored procedure optimization, and triggers
  • Expert in the creation, maintenance, and tuning of ETL jobs to populate the data warehouse
  • Strong Experience with MySQL, Pentaho, SQL Server, Data modeling, BI,
  • A Bachelor’s degree in Computer Science, Information Systems, Engineering, or other related degree
Read More; http://blog.safaribooksonline.com/2012/03/21/job-openings-at-safari-books-online-data-warehouse-developerdata-architect/

Data Management & Warehousing

Job scheduling - fixed and relative timing

This article looks at the relatve merits of two types of processing schedule, once based on a fixed times (similar to 'cron' on a unix system) and onebased on relative times (similar to 'at' on a unix system)

Introduction

Much of data warehousing is dependent on running regular jobs to get collect data and load it into the system. In order to demonstrate both fixed and relative timing methods we use an example as outlined below:

A warehouse has two source systems. The first system produces an extract file once a night at a given time (for this example at 8pm). The second system provides a set of transactions for a period of time (for this example every half an hour) then sends a file to the data warehouse server. Once the extract from the first system and all the available files from the second system are loaded a data mart can be built from the data warehouse

Note that since this is an example of scheduling the complexity has been removed, however more discussion on the complexity of the load process and the use of directed graphs (or digraphs) can be found in our knowledge base.

Fixed time job scheduling

Fixed time job scheduling (also known as absolute time job scheduling) assumes a number of known times for actions to occur. In our example the first system provides a file at 8pm and we might therefore reasonably schedule our job at 8.30. It will need to check if the file exists, and if not it may choose to sleep for a period of time before checking again. At some point, after say three tries it will decide to fail the job for that night and try again tomorrow.

The files from the second system can either be loaded by a second cron job, or as either a dependency or precursor to the first job. In either case the elapsed time will contain some element of contingency against the planned file delivery time.

Furthermore the loading of the second system will all occur during the critical batch window for the day and (assuming our half hour pushes) will have forty-eight files to load.

Relative time job scheduling

Unix and many commercial schedulers offer an alternate to fixed time. In unix this is implemented via the 'at' command. This simple mechanism allows a much more granular approach to scheduling data warehouse loads.

Once again we can assume that the file exists and is ready to load. If it is then the file will be loaded. However whether a file is loaded or not the script executes the following:

at -f script 2>>script.log now +5 minutes

where script is replaced with the path and name to the script and script.log is the path and name to the log file you want.

What now happens is the script will run again 5 minutes after the completion of the previous script. The 'at' interval can be tuned from minutes to years. It may appear that we could achieve the same thing by setting the cron interval to 5 minutes but this is not so.
Cron versus At
Cron versus At

Comparison of methods

If we use the fixed method and schedule every five minutes 'cron' the we have no problem with the first extract, however the second extract has an issue around controlling the number of files being processed. The first job that starts will grab the available file. It will still be processing this file when the second job starts and therefore two things may happen. Firstly without careful programming the file may be re-processed. Secondly the system resource load is rising as I now have two jobs serving the same purpose running.

With the relative method the first extract again has no issues although the delay will now be much lower than the scheduled version as no contingency is required. The second system extract will however pick up the first file. Only once it has finished processing the file will it start waiting for another five minutes. This avoids the risk of re-processing files and manages system resource load becuase there is only ever one job running.

If we use relative scheduling we also find that our batch window has shrunk because having loaded files thoughout the day as they become available they are now no longer part of the major load during the critical batch window.

Conclusion

In practice both forms of scheduling have a place in building a data warehouse however the use of relative timing verses fixed timingis often over looked and therefore many data warehouse batch schedules are longer than they need to be.
With some simple analysis and the use of relative timing batch window time usage is greatly reduced. Article Manager module by by George! Software.

business-analyst-salary-and-job-description

We are starting a new series of blog posts that will profile the key positions in the data warehousing industry. We think this information will be useful to current data warehousing job seekers, as well as those that are looking to advance their career in the future.
Position
Business Analyst
Description
Business analysts are primarily responsible for investigating problems and proposing solutions. Effective Business Analysts will have a thorough understanding of the company’s business activities, operational systems, business intelligence program and IT infrastructure.


Their responsibilities include:
  • Query and Interpret data using techniques ranging from statistical analysis to complex data mining
  • Analyze the data-flow of Source Systems to Staging Environment to Data Warehouse tables
  • Gather Business Requirements
  • Develop Technical Specifications and Mapping Documents
  • Facilitate Decisions leveraging both technical and business resources
  • Feed information to ETL Developers and Data Architects
  • *Responsibilities of Senior Business Analysts may also include presenting to leadership and leading departmental projects.
Salary
Indeed.com – National Average*: $86,000
Salary.com – National Average*: $86.060 (Business Analyst III)
Salary.com – Range*:
10th percentile = $69,598
90th percentile = $102,267
*Does not include benefits
Education
According to Salary.com 50% of Business Analysts have a bachelor’s degree. 33% have a Master’s Degree and will generally receive a higher salary.
Professional Experience
According to Salary.com most Business Analysts have 5-10 years of experience. In general, some experience is necessary, less than 9% had less than 1 year of experience.
Qualifications
  • Knowledge of Industry
  • Knowledge of Business Intelligence tools and methodologies
  • Knowledge of project management (waterfall, agile…etc.)
  • Experience with running queries and reports in a BI tool
  • Experience with coding in SQL based clients
  • Dependable team member
Top/Senior Candidates
Top/Senior candidates will have many of the following qualifications:
  • More than 5 years of experience
  • Experience with full range of Business Intelligence Tools
  • Experience with generating and interpreting statistical studies
  • Experience leading projects across multiple departments
  • Experience with presenting to large groups and senior leadership
Below are Related Job Searches at Indeed.com
Business Analyst Average Salary = $86,000
Senior Business Data Analyst Average Salary = $105,000
Business Data Analyst Average Salary = $68,000
Project Manager Business Analyst Average Salary = $97,000
See Other Helpful Articles:

One Response so far.

datawarehouse interview demo

The main operation of a data warehouse is to load data from source systems to the warehouse and mart. Think of all the events that can occur. There are many.

How will data be obtained? Will the warehouse link into source systems and pull the data out? If so, what kind of systems are they? Cobol systems with data stored in ISAM/VSAM files. Perhaps Orcale or SQL Server systems. Will you use an ETL tool to get at those file structures? Perhaps it is easier to ask the source system people who are knowledgeable about their systems to create you flat file feeds. That is my preference, since I do not have to get bogged down figuring out where all the data is for each source. If files are sent to you, where are they sent? For example, I have all my sources FTP their flat files to a UNIX directory. Sounds easy, but there are problems associated with that. What if somebody mistakenly sends a file and issues a delete command? What stops him from deleting other files sent by other people? What if your load process starts processing the file before the FTP transmission completes. How do you check if the file is still "hot" and how do you ignore it? What happens if the file contains too many errors to process? Do you continue processing the next file? What if you receive or process a file in the wrong order. Your parent/child relationships between tables will be violated. That will produce incomplete end-user reports when inner joins between tables drop records. How do you enforce parent/child relationships if at all? How do you handle records that were once sent to the warehouse but have since been deleted from the source system? Deletions are a huge challenge since sources may be unable to send deleted records. One way to handle this is to accept a full feed from the source. Any records in the warehouse that are missing from the feed represent implicit deletions. The problem with full feeds is that they can be huge which will give you performance issues. Incremental feeds are preferable if they can provide deletions. Incremental feeds can be problematic too. An incremental feed should contain all data that has changed from the previous time data was sent. So the previous date and time needs to be tracked. It is a bad idea to assume the previous time was yesterday. Also, how do you identify records that have changed? Typically the time stamp on the source record is used, but is it reliable? Perhaps someone can make changes or do maintenance without updating the time stamp. Then there are challenges and quirks associated with data modeling, such a slowly changing dimensions. Do know what Type 1, 2 and 3 are. I have been asked this twice.

I can go on and on. In fact I wrote a 100 page document that discusses these issues and how they are resolved.

I hope this helps you get started. Good luck in the interview
Read More: http://www.tek-tips.com/viewthread.cfm?qid=1660406
 
sqlwarehouse (Programmer)
10 Sep 11 18:42
hi dkyrtata, tks a lot for ur help...must admit ..have never done dw job before...i wanto get a job..so applied ,,iam developer..and used sql for backend purpose..created sps,views,dml triggers,udfs & job schedules ...etc... but never worked on dw side... dont want to miss th opportunity.... would u recommend any books ..or just practical approach ..i can quickly learn ...i just want to give the presentation on how would i establish the operational(i.e service) requirements for the data warehouse....which i have no clue now ..sorry to bother you ..much appreciated  
dkyrtata (Programmer)
12 Sep 11 10:49
Be aware that the two DW gurus are Ralph Kimball and Bill Inmon, (there is one other, Claudia something I think). They take two different approaches to DW. So you may be asked to briefly describe their methodologies. Most DWs take the ideas from both methodologies as DW is not an exact science.

Occasionally, I reread parts of Kimball's book, "The Data Warehouse Etl Toolkit". He describes all the possible problems and things you should do in a warehouse. It leaves me wonder how any DW project can ever deliver anything to its end users when there is just so many things to do. DW projects are huge and often lead to expensive failures - probably more often than not.

 
blom0344 (TechnicalUser)
13 Sep 11 10:52
Claudia Imhoff.  More recent developments center around Dan Linstedts 'Datavault' approach for corporate DW development
Ties Blom

 
sqlwarehouse (Programmer)
16 Sep 11 13:52
anyone ....about operational  requirements for the dataware house....

how about these.. can i say these following are operational requirements ....

-Requirement Gathering
-Physical Environment Setup
-Data Modeling
-ETL
-OLAP Cube Design
-Front End Development
-Report Development
-Performance Tuning
-Query Optimization
-Quality Assurance
-Rolling out to Production
-Production Maintenance
-Incremental Enhancements 

How to Enable Data Warehouse Job Schedules

How to Enable Data Warehouse Job Schedules

This topic has not yet been rated - Rate this topic
Updated: December 1, 2010
Applies To: System Center Service Manager 2010 SP1
By default, the schedules for the extract, transform, and load (ETL) jobs are not enabled. Use the following procedure to enable the schedule for the ETL jobs; however, this procedure can be used to enable the schedule for any of the data warehouse jobs. In this release of Service Manager, you can enable the schedules only by using Windows PowerShell. Additionally, in this release, it is not possible to query for the status of a data warehouse job schedule. If you have to know the status of a particular job, run the command to enable it.
noteNote
To run the commands in this topic, the execution policy in Windows PowerShell must be set to RemoteSigned. Type the command Set-ExecutionPolicy RemoteSigned to set the execution policy.

To enable a schedule for a data warehouse job by using a Windows PowerShell cmdlet

  1. On the computer that hosts the data warehouse management server, click Start, point to Programs, point to Windows PowerShell 1.0, right-click Windows PowerShell, and then click Run as administrator.
  2. At the Windows PowerShell command prompt, type the following command, and then press ENTER:
    Add-PSSnapIn SMCmdletSnapIn
    
  3. Type the following commands, and then press ENTER after each command:
    Enable-SCDWJobSchedule –JobName Extract_<data warehouse management group name>
    
    Enable-SCDWJobSchedule –JobName Extract_<Service Manager management group name>
    
    Enable-SCDWJobSchedule –JobName Transform.Common
    
    Enable-SCDWJobSchedule –JobName Load.Common
    
  4. Type exit, and then press ENTER.

    Read More: http://technet.microsoft.com/en-us/library/ff461164.aspx

Data warehouse jobs

Before your interview, make sure to understand what are the platforms used for data warehousing in the company you are interviewing for.

Here are a series of questions that can be asked for data warehouse developer jobs, data warehouse analyst jobs, data warehouse architect jobs, data warehouse manager jobs...


Can you give 3 to 5 advantages Data Mining can have compared to normal systems?

Could you explain the usage of Degenerated dimension and where is it appropriate to deploy it?

Could you explain how SCD can be useful in real time?

Could you explain why a Data Warehouse can or cannot be used for Transaction Database?

Could you explain the role and liabilities of an ETL developer or head of ETL development?

In AB Initio, could you briefly explain the function XFR?

In AB Initio, how is it possible to calculate the whole memory requirement of a graph?

Could you explain the major advantages of Business Performance Management?

Under Cognos, could you explain what Cubes and Sub Cubes mean?

In regards to ETL, Could you explain the importance on cleaning data before loading it into the warehouse?




Search and apply for data warehouse jobs now

Read More: http://www.randstadtechnologies.com/find-jobs/career-tips/it-interview-preparation/data-warehouse-jobs.html

Data Warehousing Specialist Jobs

Data Warehousing Specialist Job Description

Data warehousing specialists design and implement warehouses of database information, which end users use for analysis and reporting.
Responsibilities for data warehousing specialist jobs include:
  • Designing database design structures.
  • Developing data extraction techniques to pull data from other systems, such as billing, claims, or administration.
  • Developing process models for loading, transformation, sourcing, and extraction.

Data Warehousing Specialist Salaries by State

Hover over your state to get an idea of what Data Warehousing Specialists make in your area.
How to use this salary data.
Job seekers can use it while negotiating a salary.
Employers can use it to help set appropriate wage levels while writing job descriptions.

Data Warehousing Specialist Salaries

Job TitleMean Annual Salary
Data Warehousing Specialist $79,240 
All Jobs $39,336 
Data Warehousing Specialist salaries can vary depending on your experience, the location, company, industry, and benefits provided. Nationwide, most data warehousing specialists make between $59,400 - $99,600 per year, or $28.53 - $47.90 per hour.

Employment Outlook for Data Warehousing Specialists

There are currently 209,330 data warehousing specialists in the United States, with 7,260 new data warehousing specialist job openings created each year.
Data Warehousing Specialist jobs are not expected to see much growth beyond their current levels in the next decade.

Education / Training Requirements for Data Warehousing Specialists

Associate degree is normally required for data warehousing specialist jobs.

Salaries for Jobs Related to Data Warehousing Specialists

Job TitleMean Annual Salary
Database Administrators $73,490 
Software Developers, Applications $87,790 
Geospatial Information Scientists and Technologists $79,240 
Clinical Data Managers $72,830 
Librarians $54,500 
Library Technicians $29,860 
Geographic Information Systems Technicians $79,240 

More about Data Warehousing Specialists

To learn more about data warehousing specialist jobs, salaries and responsibilities, visit the Bureau of Labor Statistics, the US Department of Labor or Career One Stop.

Job Tips and Job News – 10 Myths about Data Warehousing

Debunking some common myths about data warehousing to bring the focus back to business
There are several reasons why data warehousing projects are expensive and fail to produce sufficient value for the business. I consider following 10 myths to be the main cause of the problem:
Myth #1 – A data Warehouse can create competitive advantage
Myth #2 – A data Warehouse is required for business intelligence
Myth #3 – Data Warehousing starting point is an enterprise data model
Myth #4 – You need both an Operational Data Store and a Data Warehouse to cover the entire spectrum of business reporting
Myth #5 – Data Warehousing requires an engineering approach
Myth #6 – Data Warehousing fails due to problems with transaction-processing systems
Myth #7 – We can’t predict what questions will be asked from a data warehouse
Myth #8 – Data Warehousing improves decision-making
Myth #9 – Data Warehousing empowers front-line and business staff to do their own analysis
Myth #10 – Data Warehousing reduces overall cost of reporting on business performance and opportunities
I’m sure that this will come as a sacrilege to those who have spent a career building their knowledge and practice of data warehousing. I’ve examined this topic in a lot more detail at A Myth Buster Anthology of Data Warehousing.
Read more

Data Warehouse Job Descriptions

  1. Data Base Administrator (DBA)

    • Data Base Administrators (DBAs) are responsible for the structure and performance of a data warehouse. DBAs are skilled in data administration, data modeling and tuning, ensuring development teams have the optimum environment in which to deliver applications.

    ETL Developer

    • ETL developers make source data available to the warehouse. Using either in-house developed or off-the-shelf tools, ETL developers ensure the warehouse is populated with timely and accurate data.

    Business Intelligence (BI) Developer

    • Business Intelligence (BI) developers use warehouse data to solve organizational problems through reports, analysis and data visualization.

    Solution Architect

    • Solution Architects work at data warehouse vendors to help customers implement their data warehouse. Solution architects are able to recommend data warehouse solutions to meet the organization's specific needs.

    Data Warehouse Sales

    • Account executives for data warehouse companies work with organizations to identify business problems requiring data warehouse solutions and develop proposals to solve those problems. Account executives typically work with solutions architects and other consultants who know the industry in order to develop a thorough business case.

Hadoop, MongoDB, and Data Warehouse - Selecting The Right Tool for The Job

What is Big Data?

The term, Big Data has been around for a while, but during recent years more and more enterprises and internet services have turn their attention to Big Data. But what is Big Data, and how big it is?
The notion of big is relative. There is no size threshold that defines big and non-big data. It all depends on the organization that is dealing with the data. For some organizations, terabytes can define big. For others, petabytes or more. Big Data is not a precise term. It is rather a characterization of the process and problem of continuously accumulating more and more data. Its main characteristics are:
  • Big volume
  • Big variety of data sources and formats
  • Big rate of volume increase and data change
Big data is also characterized by technology implementation challenges of scope and scale. Datasets grow so large they become more difficult to work with. Related challenges in storage, search, sharing, analytics, and visualization also emerge.

Typical Challenges

Some organizations are trying to collect as much data as possible. They try to measure every aspect related to each transaction and operation. The problem is not in the storage of big volumes of data. The situation is quite opposite – storage becomes cheaper every year and allows organizations to store larger and larger volumes of data. The problem is how to extract real value out of the increasing volumes of data.
Other organizations are dealing with the unlimited demand of Internet consumers for popular new web and mobile services. Some of the largest social network applications and Internet services have successfully moved away from SQL based data management. The leading commercial platform for this space is Hadoop. Large scale content and document management platforms have also emerged for dealing successfully with rapidly scaling demand for storage and retrieval capacity. Leading platforms in this space include MongoDB and CouchDB.

Apache Hadoop

Hadoop is a scalable, fault-tolerant and distributed data storage and processing system. Hadoop is designed to store terabytes and even petabytes of data on commodity hardware with the ability to process such volumes of data. Hadoop automatically detects and recovers from hardware, software and system failures. Two base components of Hadoop ecosystem are:
  • HDFS is a distributed file system that provides high-throughput access to application data. HDFS is organized into a cluster of servers where each server stores a small fragment of the complete data set, and each piece of data is replicated on more than one server.
  • Hadoop MapReduce is a software framework for distributed processing of large data sets on compute clusters.
Hadoop Ecosystem

Mongo DB

MongoDB (from "humongous") is an open source, document-oriented, NoSQL database system written in the C++ programming language. In MongoDB, any field can be queried at any time. MongoDB supports range queries, regular expression searches, and other special types of queries in addition to exactly matching fields. Queries can also include user-defined JavaScript functions (if the function returns true, the document matches). Queries can return specific fields of documents (instead of the entire document), as well as sorting, skipping, and limiting results. Queries can "reach into" embedded objects and arrays.

Data Warehouses

A data warehouse is a central repository for all enterprise data. Data can be collected from various systems inside an organization. Usually a data warehouse is hosted on a relational database. The difference from traditional relational databases used in OLTP systems is that it is designed purely for reporting and analytics. For large enterprises with a wide set of business lines, departments, tools and data formats, it is often very difficult to trust the data. This is critical for tasks of analysts and executives who usually analyze a high-level view of the enterprise's data. In this case the Data Warehouse plays the role of "a single version of truth".
Basic attributes of data warehouses are:
  • Wide ranges of historical data
  • Consolidated, conformed and valid data
  • De-normalized data structures (usually multidimensional)
  • Fast report response time
  • Flexible ad-hoc analysis
In addition to traditional row-based relation databases, data warehouses now support alternate technical approaches to store and access the data. Recently new trends have been evolving very rapidly:
  • Column-oriented RDBMSs which store the data by columns; this allows great compression rates and improved I/O performance
  • In-memory data storage engines, which allows database queries in memory without affecting a slow disk systems
Data Warehouse
Main approaches in DW design are Inmon's style (Top-Down) and Kimball's style (Bottom-Up). Within a Top-Down approach, the data warehouse is being designed for the entire enterprise, and then propagated to the needs of certain departments. The Bottom-Up approach recommends building the data for small tasks (like single department or service needs), and then integrating them into the enterprise data warehouse.

Summary

As always, in real life, there is no silver bullet and selecting the right tool for each task is the key. In this short article we have tried to cover the main definitive characteristics of Hadoop, Mongo DB, and more traditional Data Warehouses.

Data Warehouse jobs in London

Datawarehouse Developer

Coal IT Services - East London
The Datawarehouse Developer plays a key role in the Datawarehouse team by designing and developing solutions to business problems and by ensuring that end user...
CWJobs.co.uk - 9 hours ago - save job - block - email - more...

Datawarehouse Architect

Glasgow
If you fit the above description and are available, please contact me. As part of a large transformation project, my client is building a new Data Warehouse...
Jobs-North - 5 days ago - save job - block - email - more...

Datawarehouse Manager

North London - +1 location
Understanding Recruitment is acting as an employment agency for this vacancy. Key skills: Data Warehouse, Greenplum, ETLs, data-marts and analytics, Java,... £45,000 - £55,000 a year
EngineerBoard.co.uk - 2 days ago - save job - block - email - more...

Oracle Datawarehouse Developer

York
- Using Oracle Warehouse Builder (OWB), design, create and maintain ETL routines for the. Interviews are taking place ASAP, so please send an updated CV to be...
OnQ Jobs - 3 days ago - save job - block - email - more...

Datawarehouse Developer

C.O.A.L IT. Services Ltd - London
The Datawarehouse Developer plays a key role in the Datawarehouse team by designing and developing solutions to business problems and by ensuring that end user... £55,000 - £65,000 a year
Jobsite UK - 1 hour ago - save job - block - email - more...

BI/Datawarehouse Developer

Capgemini - AMS - Telford
Software component design and development within a development team. Proactive and able to work from own initiativeAble to work as part of a team....
icrunchdata.com - 3 days ago - save job - block - email - more...

Business Analyst (Datawarehouse)

London
To be considered, candidates must possess a strong track record of business analysis, data warehousing and dimensional modeling (Kimball), process re...
HereIsTheCity.com - 7 days ago - save job - block - email - more...

Database/Datawarehouse Test Engineer

Project Partners (Hydrogen Group) - London
Nicholas House, 3 Laurence Pountney Hill, London EC4R 0EUD: +44 (0)20 7002 0058 T: +44 (0)20 7929 1800 F: +44 (0)20 7929 1200 W: www.projpartners.comHydrogen is... £300,000 a year
Technojobs - 4 days ago - save job - block - email - more...

Datawarehouse Architect

RZ Group - Glasgow
If you fit the above description and are available, please contact me. As part of a large transformation project, my client is building a new Data Warehouse...
Jobsite UK - 5 days ago - save job - block - email - more...

Oracle PL/SQL Datawarehouse Developer

IT Connections - Camberley
Oracle PL/SQL Data warehouse Developer role within one of the UK's leading provider of consultancy, managed services and product development for the retail...
CWJobs.co.uk - 1 day ago - save job - block - email - more...