Easy way to learn and implement the Microsoft technologies.
Showing posts with label google cloud functions tutorial. Show all posts
Showing posts with label google cloud functions tutorial. Show all posts
Wednesday, May 8, 2024
Cloud Functions - How to Read PDF Files on GCS Events and Store in BigQuery
In this tutorial, you will learn "How to create an event-driven Cloud Function that reads PDF files from Google Cloud Storage (GCS) and pushes their contents into BigQuery" in GCP.
Labels:
google cloud functions Gen2,
google cloud functions tutorial,
google cloud functions tutorial python,
Read PDF file by Cloud Function
With over 17 years of experience in the Data Engineering stack across a variety of cloud and on-premises systems, I have successfully delivered more than ten complete business product solutions. My expertise lies in building robust infrastructure and architecture to support data engineering, data analytics, and machine learning processes. These solutions have significantly improved collaboration among cross-functional teams, including data scientists, business analysts, software engineers, and stakeholders.
Key Contributions
Data Modelling and Integration
• Data Modeling: Developed various data models to produce suitable data for business users, data analytics, data science, and data visualization teams.
• Legacy Systems and Cloud Technologies: Integrated legacy systems with modern cloud-based technologies (AWS, Azure, GCP), data lakes, and data warehouses.
• Streamlined Data Pipelines: Built efficient data pipelines, data warehouses, BI reports, and dashboards to streamline data access and insights.
Thursday, February 8, 2024
Google Cloud Platform - How to Create Gen2 Cloud Function
In this article, you will learn how to Create Python based Gen2 Cloud function in Google Cloud Platform.
Cloud Functions are a serverless computing service offered by Google Cloud Platform (GCP) which are an easy way to run your code in the cloud.
It supports Java, Python, Ruby, Node.js, Go, and .Net.
Currently, Google Cloud Functions support events from the following providers- HTTP, Cloud Storage, Cloud Firestore, Pub/Sub, Firebase, and Stackdriver.
Gen1 is more lightweight, one concurrency per instance, simple features and less knob to tweak, cheaper, it's pretty much deploy and forget, it is actually an AppEngine standard,
while gen2 is on Cloud Run (on GKE), you have more control, up to 1k concurrency per instance, larger resources, longer timeouts, etc, If you don't need it, just use gen1.
To complete the tasks outlined above, you must have a GCP account and appropriate access.
Data validation — Data validation is the process of checking the data against predefined rules and standards, such as data types, formats, ranges, and constraints.
- 💫Schema Validation: Verify data adherence to predefined schemas, checking types, formats, and structures.
- 💫Integrity Constraints: Enforce rules and constraints to maintain data integrity, preventing inconsistencies.
- 💫Cross-Field Validation: Validate relationships and dependencies between different fields to ensure logical coherence.
- 💫Data Quality Metrics: Define and track quality metrics, such as completeness, accuracy, and consistency.
- 💫Automated Validation Scripts: Develop and run automated scripts to check data against predefined rules and criteria.
To learn more, please follow us -
🔊 http://www.sql-datatools.com
To Learn more, please visit our YouTube channel at —
🔊 http://www.youtube.com/c/Sql-datatools
To Learn more, please visit our Instagram account at -
🔊 https://www.instagram.com/asp.mukesh/
To Learn more, please visit our twitter account at -
🔊 https://twitter.com/macxima
Labels:
Event based Cloud functions,
firebase cloud functions,
GCP Cloud Function,
google cloud functions tutorial,
google cloud functions tutorial python,
google-cloud/functions-framework
With over 17 years of experience in the Data Engineering stack across a variety of cloud and on-premises systems, I have successfully delivered more than ten complete business product solutions. My expertise lies in building robust infrastructure and architecture to support data engineering, data analytics, and machine learning processes. These solutions have significantly improved collaboration among cross-functional teams, including data scientists, business analysts, software engineers, and stakeholders.
Key Contributions
Data Modelling and Integration
• Data Modeling: Developed various data models to produce suitable data for business users, data analytics, data science, and data visualization teams.
• Legacy Systems and Cloud Technologies: Integrated legacy systems with modern cloud-based technologies (AWS, Azure, GCP), data lakes, and data warehouses.
• Streamlined Data Pipelines: Built efficient data pipelines, data warehouses, BI reports, and dashboards to streamline data access and insights.
Subscribe to:
Posts (Atom)