Andrew Gilliland

Data Governance

Last Updated: November 19, 2024

What is Data Governance?

Data governance is the process of managing the availability, usability, integrity, and security of the data used in an organization. It involves establishing policies, procedures, and standards to ensure data is accurate, consistent, and used responsibly. Effective data governance helps organizations make better decisions, comply with regulations, and protect sensitive information. The following tools from AWS help organizations achieve data governance:

AWS Glue

A fully managed extract, transform, and load (ETL) service designed to make it easy for users to prepare and load their data for analytics.

AWS Glue Data Catalog

A fully managed metadata repository provided by AWS Glue that serves as a central metadata store for all your data assets. This makes data easier to discover, organize, and manage.

AWS Athena

An interactive query service provided by AWS that allows you to analyze data directly in Amazon S3 using standard SQL.

AWS Lake Formation

A fully managed service provided by AWS that simplifies the process of building, securing, and managing data lakes.

AWS Redshift

A fully managed data warehouse service provided by AWS designed to handle large-scale data analytics and processing.

Amazon DataZone

A data management service provided by AWS that helps organizations catalog, discover, share, and govern their data across various AWS services and on-premises data sources. Amazon DataZone helps organizations unlock the value of their data by making it more accessible and manageable while maintaining governance and security controls.