Now Google BigQuery is truly enterprise-ready DW (DML without limits; Integer range partitioning)
I have been using BigQuery for a couple of years and designing/migrating enterprise data warehouses but the new announcement on limitless DML inspired me to write about its beauty.
I do have strong hands-on experience with on-prem traditional data warehouse & data warehousing across cloud providers (Redshift, SQL Data Warehouse a.k.a. Synapse). I always considered Google BigQuery as the next generation DW but following two nonexistence bothering me all the time:
- Limited DML operations: It’s limitless now since Feb 26, 2020 announcement.
- Date only partition: It used to support Date & Timestamp(it is an illusion, practically it’s also a date partition). Now BigQuery also supports tables partitioned on an integer column as well. I hope in the near future there will be more partitioning options like the oracle.
DML was always a struggle while migrating data from traditional enterprise data warehouses and especially dealing with the CDC(change data capture). In my recent Oracle EDW migration to BigQuery, managing update/delete on the new CDC was a big challenge.
Top 10 BigQuery features which I do consider it is the next generation EDW:
- Quick setup & the most cost-effective DW
- Fully managed and server-less
- Scales on-demand from gigabytes to petabytes
- Seamless access due to data replication so the data is always available
- Automatic snapshots and data backup reduces worries about unexpected data changes
- Encrypted, durable, and highly available
- Built-in machine learning for predictive analytics
- In-memory BI Engine for blazing-fast reporting
9. This is how BigQuery removes the traditional data warehousing barriers:
10. Real-time insights: Insert up to 100,000 rows of data per second and analyze business events as they unfold.
Read more about BigQuery here.
Happy coding & keep spreading happiness and kindness :)