At 2:15, Jagan says: "Data is stored in Colossus, which is BigQuery's columnar storage which is encrypted, replicated and distributed making it highly durable against failures" He is mixing two concepts: Colossus (Google's DFS) and Capacitor (BigQuery's columnar storage). Jagan should've said: "Data is stored in Capacitor, which is BigQuery's columnar storage. In turn, Capacitor files are stored in Colossus which is Google's encrypted, replicated and distributed file system making it highly durable against failures"
At 2:15, Jagan says:
"Data is stored in Colossus, which is BigQuery's columnar storage which is encrypted, replicated and distributed making it highly durable against failures"
He is mixing two concepts: Colossus (Google's DFS) and Capacitor (BigQuery's columnar storage).
Jagan should've said:
"Data is stored in Capacitor, which is BigQuery's columnar storage. In turn, Capacitor files are stored in Colossus which is Google's encrypted, replicated and distributed file system making it highly durable against failures"
it’s nice to have the power point representation but what about with technical approach on how we can implement this in real time ?
At 11:41, you mention 'Ingest from GCS or HTTP POST...' Could you please explain what you mean by or give an example of the 'HTTP POST' method?
Loading data from a local data source: cloud.google.com/bigquery/docs/loading-data-local
Awesome, Jagan
Thank You
He is just reading slides