Welcome to my tech blog, where I share my tutorials and projects.
Don’t fear the unknown; embrace it.
Experience is the ultimate teacher. Dive in, get your hands dirty, and uncover the power of learning by doing.
Ivy Wang
BigQuery Analysis for Beginners
BigQuery is a fully managed, AI-ready data analytics platform and enterprise warehouse that helps you maximise value from your data.
BigQuery is designed to handle large datasets efficiently and also supports standard SQL syntax. Learning BigQuery can be valuable for data professionals, analysts, and organisations looking to leverage the power of cloud-based data warehousing and analytics for their data-driven decision-making processes.
Redis Tutorial
Redis is a powerful and versatile open-source, in-memory data structure store that can be used as a database, cache, and message broker. This beginner’s tutorial aims to introduce users to the fundamental concepts and functionalities of Redis.
The tutorial covers basic data structures supported by Redis, including strings, lists, sets, sorted sets, and hashes. It provides clear examples and explanations of how to perform CRUD (Create, Read, Update, Delete) operations on these data types.
Furthermore, the tutorial delves into more advanced features such as transactions, pub/sub messaging, and key expiration. It demonstrates how to utilize these features effectively to build robust applications.
MongoDB Tutorial
MongoDB is a popular NoSQL database that offers flexibility and scalability for modern applications. This tutorial will introduce you to the basics of MongoDB, including installation, data modelling, querying, and more.
The tutorial begins with an introduction to NoSQL databases, exploring the fundamental concepts behind their rise in popularity and their distinct advantages over traditional relational databases. We delve into the various types of NoSQL databases, with a particular emphasis on MongoDB’s document-oriented architecture and its suitability for a wide range of use cases.
BigQuery ML Tutorial
You will learn how to build machine learning models directly within BigQuery, without the need for complex data transfers or additional infrastructure. You’ll learn how to select appropriate features, choose the right model type, and evaluate model performance using familiar SQL syntax.
- Regression model
- Classification model
- Clustering model
- Time Series model
- Anomaly detection model
- Matrix Factorisation model
PySpark Tutorial
PySpark is a powerful tool for processing large-scale data using the Python programming language. This tutorial is designed for beginners to help them get started with PySpark and gain a solid understanding of its core concepts and functionalities.
The tutorial begins by introducing PySpark and explaining its importance in big data processing. Additionally, it covers advanced topics such as DataFrame and SQL operations in PySpark, providing users with alternative ways to analyze and manipulate data.
Airflow Tutorial
Apache Airflow is an open-source platform used for orchestrating and scheduling complex workflows. The tutorial covers the fundamental concepts of Airflow, including its architecture, components, and workflow definition through Directed Acyclic Graphs (DAGs).
Through practical examples you will learn how to create, schedule, and monitor workflows efficiently using Airflow’s rich set of features such as operators, sensors, and hooks. Ultimately, you’ll attain proficiency in designing workflows tailored for data processing, ETL pipelines, and seamless task automation.