Tuesday, August 14, 2012

Big Data Warehousing


IBM's Big Data platform has data warehousing as an integral part of the framework. The IBM datawarehouse capabilities for Big Data include massively parallel processing engine, high performance OLAP and mixed operational and analytic workloads. So IBM's workload optimized systems (comprising of integrated software and hardware/appliances) include both Deep Analytics appliance and configurable Operational Analytics appliances along with data warehousing software.

As part of existing set of solutions available from IBM:
  • IBM Netezza appliance is used for high performance analytic queries without DBA tuning or storage administration, where high speed batch ingest and 100s of queries per second is necessary.
  • IBM Smart Analytics System is used for both analytic and real-time, transactional workload for the warehouse. Here the workload includes point transactional queries against detailed data, and extreme concurrent query volumes (greater than several 100s of queries per second), and where continuous or trickle feed data ingest is required.

Look out for newer set of solutions to come from IBM's stables in the near future. Also here is an earlier blog post of mine on selecting a Data Warehouse Solution.