Greenplum hadoop

  • Published on
    22-Apr-2015

  • View
    752

  • Download
    5

Embed Size (px)

DESCRIPTION

 

Transcript

  • 1. Copyright 2012 EMC Corporation. All rights reserved. 1
  • 2. Greenplum Enable Big Data Analytics Jimmy Chiu /EMC Greenplum Taiwan Copyright 2012 EMC Corporation. All rights reserved. 2
  • 3. Volume, Variety, Velocity, Value + ComplexityNew insights on Contextual andcustomers, products, Velocity Volume location-awareand operations delivery to any Big Data device Variety Complexity Documents Transactional Smart Grid Images Audio Text Video Data Volume: data volumes approaching multiple petabytes Velocity: data being generated and ingested for analysis in real-time Variety: tabular, documents, e-mail, metering, network, video, image, audio Complexity: different standards, domain rules, and storage formats per data type Gartner March 2011 Copyright 2010 EMC Corporation. All rights reserved. 3
  • 4. Sample Big Data Scenarios LOAN PROCESSING AUTO INSURANCE SMART GRID ANALYTICS IN BANKING IN P&C INSURANCE IN UTILITIES/ENERGY REAL-TIME STATISTICALPROACTIVE EMERGENCY RESPONSE VIDEO ANALYTICS IN HEALTHCARE IN RETAIL PROCESS CONTROL IN MANUFACTURING Copyright 2010 EMC Corporation. All rights reserved. 4
  • 5. Big Data Analytics For CompetitiveAdvantage Suppliers Suppliers Who are my most valuable Manufacturing customers? Manufacturing Inventory Inventory Physical Assets Physical Assets What are my most Distribution important Services Distribution products? Personal Marketing Services Mass Additional Marketing Profits What are my most successful campaigns? Customers Customers Todays Business Model Big Data Analytics Business Model Copyright 2010 EMC Corporation. All rights reserved. 5
  • 6. Big Data meets Fast Data Social and Personal Every Minutes: Google gets more than 2 million search queries About 47,000 people download an App Some 100,000 tweets hit Twitter Almost 300,000 people log on to Facebook Business and Transactional: CERN (European Organization for Nuclear Research) generates 40TB/sec of scientific data Wal-Mart 1 million transactions per hour Worlds top systems currently trade at faster than 50 microseconds New York Stock Exchange generates 1TB of new trading data daily Copyright 2010 EMC Corporation. All rights reserved. 6
  • 7. Working together, they enable entirely New Business Models Big Data allows you to find opportunities you didnt know you had. Fast Data allows you to respond to opportunities before they are gone. In the Financial Services Industry, large quantities of historical data need to be processed against a growing number of fast-moving data feeds. Batch processing is no longer a suitable solution! Copyright 2010 EMC Corporation. All rights reserved. 7
  • 8. Effective Customer Segmentation is all about blending Structured and Unstructured Data Transaction data (structured data) tells you what the customer did. Unstructured data can tell you why they did it, why some others did not, what else they need or want, and what problems they may have. Copyright 2010 EMC Corporation. All rights reserved. 8
  • 9. Big Data Architecture Solving Big Data challenge involves more than just Requirements managing volumes of data. Gartner Multiple data types: structured, semi-structured, unstructured Integrated data stores: real-time, traditional, data warehouse Modern development tools: Java, lightweight messages, mobile-enabled Cloud-enabled: elastic scale, self-healing Beware point solutions integration is critical! Copyright 2010 EMC Corporation. All rights reserved. 9
  • 10. Greenplum Overview Copyright 2010 EMC Corporation. All rights reserved. 10
  • 11. Greenplum Product Line Copyright 2010 EMC Corporation. All rights reserved. 11
  • 12. Architecture of GreenplumFlexible framework for processing large datasetsProcess large datasets with support for SQLboth SQL and MapReduce MapReduce Master MasterMaster servers optimize queriesfor the most efficient query executionInterconnect for conti