bigquery, fluentd and tagomoris #gcpja

21
BigQuery, Fluentd, and tagomoris gcp ja night #28 2014/09/16 TAGOMORI Satoshi(tagomoris)

Upload: satoshi-tagomori

Post on 24-Jan-2015

2.281 views

Category:

Technology


1 download

DESCRIPTION

 

TRANSCRIPT

Page 1: BigQuery, Fluentd and tagomoris #gcpja

BigQuery,Fluentd,and tagomoris

gcp ja night #282014/09/16

TAGOMORI Satoshi(tagomoris)

Page 2: BigQuery, Fluentd and tagomoris #gcpja

Satoshi Tagomori (@tagomoris)LINE Corporation

Analytics Platform Team

Page 3: BigQuery, Fluentd and tagomoris #gcpja

スーパードゥラアアアアァァァァァァアアアアアアアアァァイイイイイイイイイイイイイイイイィィィィィイイイイイイイイィイィイイイイィイィ

Page 4: BigQuery, Fluentd and tagomoris #gcpja

極度乾燥(しなさい)。

Page 5: BigQuery, Fluentd and tagomoris #gcpja

BigQuery

What all of us love :)

Page 6: BigQuery, Fluentd and tagomoris #gcpja

Fluentd

It is that with cute logo :)

Page 7: BigQuery, Fluentd and tagomoris #gcpja

Fluentd

Readable configuration

Flexible buffer system

Various input/output plugins

Very simple/easy plugin system

High performance for many uses

Page 8: BigQuery, Fluentd and tagomoris #gcpja

fluent-plugin-bigquery

Insert events from Fluentd into BigQuery

over streaming inserts

Table Sharding inserts

Page 9: BigQuery, Fluentd and tagomoris #gcpja

versions

v0.0.x ~ v0.1.x

by @tagomoris

many patches from @yugui

v0.2.x

KAIZEN platform Inc (@naoya_ito)

Page 10: BigQuery, Fluentd and tagomoris #gcpja

tagomoris: I want new maintainer for fluent-plugin-bigquery, who uses BQ actually....

Page 11: BigQuery, Fluentd and tagomoris #gcpja

naoya_ito: OK, KAIZEN platform will do!

Page 12: BigQuery, Fluentd and tagomoris #gcpja

tagomoris: Great! I’ll transfer my repository to KAIZEN’s account... Can I have commit bit for Fluentd related fixes?

Page 13: BigQuery, Fluentd and tagomoris #gcpja

naoya_ito: Sure!

Page 14: BigQuery, Fluentd and tagomoris #gcpja
Page 15: BigQuery, Fluentd and tagomoris #gcpja

( д) ゚ ゚

Page 16: BigQuery, Fluentd and tagomoris #gcpja

Disclosure: I’m an employee of LINE now!

Page 17: BigQuery, Fluentd and tagomoris #gcpja

performance

Use Table Sharding inserts

tables table1,table2,table3,table4

Use many threads for concurrent insertion

num_threads 4

(Same with tables)

Page 18: BigQuery, Fluentd and tagomoris #gcpja

FeaturesAuthentication

auth_method compute_engine

auth_method private_key

Schema

specs per fields

schema_path

fetch_schema

Page 19: BigQuery, Fluentd and tagomoris #gcpja

API Quotanumber of records over streaming inserts

10,000 rows per sec per table

10MB per sec per table

Use Table Sharding

Max row size: 20KB

Max data size per insert: 1MB

Max rows per request: 500

Page 20: BigQuery, Fluentd and tagomoris #gcpja

Hobby programming & Cloud services

For hobby programming

To setup environments is very troublesome...

Cloud services are easy to use!

Cloud service specific limitations/restrictions are funny to play with!

Page 21: BigQuery, Fluentd and tagomoris #gcpja

Enjoy!