How to build a sustainable data ecosystem on Google Cloud

  Рет қаралды 779

PracticalGCP

PracticalGCP

Күн бұрын

Пікірлер: 14
@WiktorJurek
@WiktorJurek 9 ай бұрын
This is bang on. It would be awesome to see how this works in practice - as in, how all of this looks in the console, how to set it up, and practically how you can oversee/manage this kind of setup.
@practicalgcp2780
@practicalgcp2780 9 ай бұрын
There’s quite a lot of effort involved but the foundation isn’t that difficult to setup. But it’s not like there is just some sort of UI everything can be done there, I think the entry point of data management and discovery for large group of users can be from the catalog tool, and a platform team can own the tooling for things like quality scan and analytics hub while making them self service. There are things especially like the data quality check rules I would prefer to keep these in version control so it’s much easier to control the changes and quality of the checks where as other things like analytics hub UI should be sufficient as long as there is a way to recovery if something goes wrong
@alifarah9
@alifarah9 8 ай бұрын
Really appreciate these high quality videos ! Seriously your videos are better than the official video for GCP. What makes these videos invaluable is you teach frok first principles and talk about problem that will be faced in any cloud environment not GCP.
@practicalgcp2780
@practicalgcp2780 8 ай бұрын
Thank you so much 🙏 you are right the principal are very much the same, no matter which cloud provider it is. Although my focus is GCP because it is something I believe as an ecosystem it’s much more powerful but remains the easiest to implement and scale compares to other cloud providers.
@alexanderpotts8425
@alexanderpotts8425 9 ай бұрын
Knocking it out of the park as usual. I'm trying to get adoption of some of these in my team already!
@practicalgcp2780
@practicalgcp2780 9 ай бұрын
Amazing to see you find it useful. I believe a of these things I covered are what we are doing everyday already, I was trying to put everything together in a more structured way hopefully to help a winder crowd adopt these technologies and methods.
@JohnMcclaned
@JohnMcclaned 9 ай бұрын
Would love to see a video about how to use AlloyDB to an ordered pubsub topic
@practicalgcp2780
@practicalgcp2780 9 ай бұрын
I thought about challenges from event based data consumption coming from message queues, but decided not to cover in this video. Event based data consumption in real time has very different challenges and I don’t believe it’s the same pain as we get compared to data stored in analytic databases. Sure, managing those are important, but from my experience, event based applications are very bespoke, already has clear data contract as they are very mission critical and built by data engineering team mostly and are well maintained. Unfortunately the same cannot be said for data being consumed in analytic databases. AlloyDB I assume you are using it for more bespoke use cases as it’s not typically something used to store all data permanently to allow a large group of teams to consume.
@JohnMcclaned
@JohnMcclaned 9 ай бұрын
@@practicalgcp2780 I am building an event sourced event store and I need a way to have ordered changes propagated out. I am defaulting to 1 second interval polling though I am exploring other solutions.
@SwapperTheFirst
@SwapperTheFirst 9 ай бұрын
I like this format of battle stories/coaching.
@practicalgcp2780
@practicalgcp2780 9 ай бұрын
Thanks ☺️ thought might try a different way to present feels like more people can relate to this
@SwapperTheFirst
@SwapperTheFirst 9 ай бұрын
Any examples of such tools for cataloging, certification and lineage? Especially OSS? I had some experience with Qlik Catalog, but not sure if this is a good choice to GCP and how well it is integrated with BQ. Beyond usual suspects (Collibra, Immuta, ...)
@practicalgcp2780
@practicalgcp2780 9 ай бұрын
There are a few who are GCP partners has very good integration with GCP to save you a lot of time doing meta data integration by engineers. Collibra is one of them as you already mentioned, you can also look at Atlan, a new player in the field but has some powerful features too. That’s the two I am aware of in my view have pretty good integration and features but please do your own research there are pros and cons and these are not recommendations I am making here. OSS do you mean support systems like JSM?
@SwapperTheFirst
@SwapperTheFirst 9 ай бұрын
@@practicalgcp2780 nope, I mean open source software, like Apache Airflow for workflow management. From which you can also make managed solutions, like Astronomer or Cloud Composer. I think something should exist in this space too?
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН
Enceinte et en Bazard: Les Chroniques du Nettoyage ! 🚽✨
00:21
Two More French
Рет қаралды 42 МЛН
Introduction to Dataform in Google Cloud Platform
41:47
Cloud 4 Data Science
Рет қаралды 31 М.
Faster Data Cleansing with BigQuery Pipe Syntax
26:59
PracticalGCP
Рет қаралды 453
GCP Restrict Authorised Countries with ACCESS CONTEXT MANAGER
6:56
Kevin Mandia, Google Cloud | mWise Conference 2024
31:18
SiliconANGLE theCUBE
Рет қаралды 15 М.
Systems Design in an Hour
1:11:00
Jordan has no life
Рет қаралды 34 М.
DBT Core on Cloud Run Job
39:26
PracticalGCP
Рет қаралды 2,5 М.
Automated data profiling and quality scan via Dataplex
26:48
PracticalGCP
Рет қаралды 9 М.
Scaling development teams with Cloud Workstations
26:13
PracticalGCP
Рет қаралды 827
Real-time Analytics with Cloud Spanner CDC
37:29
PracticalGCP
Рет қаралды 555
Леон киллер и Оля Полякова 😹
00:42
Канал Смеха
Рет қаралды 4,7 МЛН