事例から見るnosqlの使い方 - db tech showcase tokyo 2015 2015/06/11

35
® © 2015 MapR Technologies 1 ® © 2015 MapR Technologies RDBだけではない! 事例から見るNoSQLの使い方 マップアール・テクノロジーズ株式会社 セールスエンジニア 梅川 真人 2015/6/11

Upload: mapr-technologies-japan

Post on 04-Aug-2015

4.005 views

Category:

Data & Analytics


2 download

TRANSCRIPT

1. 2015 MapR Technologies 1 2015 MapR Technologies RDB! NoSQL 2015/6/11 2. 2015 MapR Technologies 2 NoSQL MapR-DB Use Case #1 Use Case #2 MapR-DB 5.0 3. 2015 MapR Technologies 3 NoSQL No SQL Not Only SQL NoSQL RDB RDB 4. 2015 MapR Technologies 4 Web Web 1.0 Web 2.0 IO Write Read Write Read Write Read memcached Coherence RDBMS IO RDBMS BigData Web 5. 2015 MapR Technologies 5 NoSQL Join SQL API 6. 2015 MapR Technologies 6 NoSQL Google FileSystem Google MapReduce Google BigTable Apache Hadoop HyperTable Apache HBase Facebook Cassandra Amazon S3 Amazon Dynamo Amazon SimpleDB LinkedIn Voldemort 7. 2015 MapR Technologies 7 NoSQL Dynamo, Cassandra BigTable, HBase 8. 2015 MapR Technologies 8 NoSQL DISK MEMORY 9. 2015 MapR Technologies 9 NoSQL RDBMS LSM (Log Structured Merge) Tree NoSQL IO Read IO Write Read B+Tree RDBMS Write , IORead 10. 2015 MapR Technologies 10 2015 MapR Technologies MapR-DB (MapR Enterprise Database Edition) 11. 2015 MapR Technologies 11 HBase NoSQLEventually ConsistentEventually Inconsistent Ring NoSQL (: Cassandra, Riak) Region HDFS Hadoop HBase MapReduce 12. 2015 MapR Technologies 12 HBase Key (Region) Region (RegionServer) () CF1 CF2 CF3 CF4 CF5 R1 R2 R3 R4 13. 2015 MapR Technologies 13 : RDBMS : / /SaaS RDBMS NoSQL 14. 2015 MapR Technologies 14 MapR-DB MapR Enterprise Database Edition ( MapR M7 Edition) HBase API NoSQL MapR M7 HBase 15. 2015 MapR Technologies 15 MapR-DB 1. 2. 3. 4. 5. (LSM-Tree B+Tree) 6. 16. 2015 MapR Technologies 16 MapR-DB: Disks MapR JVM HBase Disks ext3 JVM HDFS JVM HBase Apache Disks Unified 17. 2015 MapR Technologies 17 $ pwd /mapr/default/user/dave $ ls file1 file2 table1 table2 $ hbase shell hbase(main):003:0> create '/user/dave/table3', 'cf1', 'cf2', 'cf3' 0 row(s) in 0.1570 seconds $ ls file1 file2 table1 table2 table3 $ hadoop fs -ls /user/dave Found 5 items -rw-r--r-- 3 mapr mapr 16 2012-09-28 08:34 /user/dave/file1 -rw-r--r-- 3 mapr mapr 22 2012-09-28 08:34 /user/dave/file2 trwxr-xr-x 3 mapr mapr 2 2012-09-28 08:32 /user/dave/table1 trwxr-xr-x 3 mapr mapr 2 2012-09-28 08:33 /user/dave/table2 trwxr-xr-x 3 mapr mapr 2 2012-09-28 08:38 /user/dave/table3 18. 2015 MapR Technologies 18 HBase vs M7 19. 2015 MapR Technologies 19 HBase MapR-DB MapR-DB HBase CAP CP CP MicroWAL+MapR WAL+MemStore+HFile 100 64 3 20. 2015 MapR Technologies 20 2015 MapR Technologies Use Case #1 21. 2015 MapR Technologies 21 Aadhaar PEOPLE 20 BILLION BIOMETRICS 22. 2015 MapR Technologies 22 Aadhaar 2000(160 x 12) 8 20156 23. 2015 MapR Technologies 23 Aadhaar ID (, ) 12 Aadhaar number (100) HBase 24. Template Tables Audit Tables Auth, BFD, OTP Auth Data Server FMS, Portals Template Tables Audit Tables Auth, BFD, OTP FMS, Portals B Data Center A Data Center Read Read Write Write Read Read Write S y n c S y n c Aadhaar 25. 2015 MapR Technologies 25 Apache Hadoop Hadoop Tasktracker DR NameNode JobTracker () 26. 2015 MapR Technologies 26 2015 MapR Technologies Use Case #2 27. 2015 MapR Technologies 27 1TB 1200ms RDB (1CPU) 1 28. 2015 MapR Technologies 28 RDB RDB //ETL /ETL FTP FTP 29. 2015 MapR Technologies 29 MapR-DB MapRDB Web MapR-DB ETL (1) FTP FTP NFS NFS 30. 2015 MapR Technologies 30 2015 MapR Technologies MapR-DB 5.0 31. 2015 MapR Technologies 31 Elasticsearch (Storm, Spark Streaming) (Drill, Impala, Presto, ...) MapR Data Platform DISTRIBUTION INCLUDING HADOOP MapR-DB (MR, Spark, Hive, Pig, ) WEB MapR-FS Data Access 32. 2015 MapR Technologies 32 Traditional vs. Real-Time Search Indexing Traditional NoSQL Applications HBase API Elasticsearch API HBase Elasticsearch Batch indexing HBase API MapR-DB Updates Real-time indexing Updates Searches MapR-DB/Elasticsearch HBase/Elasticsearch Real-Time NoSQL Applications 33. 2015 MapR Technologies 33 MapR LOG FILES, CLICKSTREAMS SENSORS BLOGS, TWEETS, SNS DATA WAREHOUSE Data Movement Data Access BI, BA Ad-hoc integrated analytics ETL / Recommendation (Fraud Detection) MAPR DISTRIBUTION FOR HADOOP (Spark Streaming, Storm) MapR Data Platform MapR-DB MAPR DISTRIBUTION FOR HADOOP (MapReduce, Spark, Hive, Pig) MapR-FS (Drill, Impala) 34. 2015 MapR Technologies 34 MapR-DB 3. 2. 1. HBase 35. 2015 MapR Technologies 35 MapR Technologies