Data standard inspection tool_1.Overview

Introducing a data standard inspection tool that can be helpful for field DAs (Data Architects) who suffer from repetitive manual work in data standardization work. By reducing manual time, you can focus on more productive tasks and eliminate or reduce overtime.

This tool was created with Excel VBA. (reference: Excel VBA Course(1): Overview of Excel VBA)

1. Overview of Data Standards Check Tool

Data standardization, one of the tasks of a data architect, is quite a daunting task.

Collect the current property name (logical name, physical name), data type, and length from the database table/column comment, table definition, etc., separate the semantic unit into words, and bind columns with the same meaning to the same logical name, physical name, and data. The process of defining a single term with type and length requires many manual repetitions.

(Note. This article does not explain the detailed concept of data standardization. Please refer to the URL below.)
https://dataonair.or.kr/db-tech-reference/d-guide/da-guide/?mod=document&uid=269

Data standardization can be summarized as a process of reviewing basic dictionaries and current attribute names repeatedly, creating word/term/domain dictionaries in Excel sheets, and supplementing and finalizing them through review with standard managers.

데이터 표준화 작업의 흐름 요약
Data Standardization Workflow Summary

In this process, several detailed tasks are executed, such as:

  • Attribute logical name change
  • Change word physical name
  • Assignment/deactivation of word attribute classifier
  • Specify domain classification name for attribute classifier
  • Register two (or more) words as compound words
  • Specify synonym relationships: make only one word canonical and other synonyms non-canonical
  • Change the domain's data type, length
  • etc…

For the current property name and data type/length, the above operations are repeated to 1) use standard words, 2) designate standard domains, and finally 3) make standard terms and standardize them.

Most data architects (DAs) who perform data standardization work in the field are doing manual work using Excel. There are products of various companies for metadata management solutions that manage data standards, but most of these products focus on the function of managing the standard data dictionary after it is created rather than the function of creating a standard data dictionary.

I also went through a lot of difficulties while performing data standardization work at various clients. In 2012, I first tried tooling, and I developed a tool with basic functions in April 2016 by constantly developing it in my spare time. The tool was developed using Excel VBA, and the screen is as follows.

데이터 표준점검 도구 화면
Data standard check tool screen

This tool can be found in the github repository below.

https://github.com/DAToolset/ToolsForDataStandard

Alternatively, you can download directly from this URL.

https://github.com/DAToolset/ToolsForDataStandard/blob/main/Property standard check tool_v1.33_20210606_1.xlsm?raw=true


In the next article, we will take a look at the composition of the data standard dictionary and the standard checking function.

<< List of related articles >>

2 Responses

  1. Avatar photo 한경훈 says:

    Black teacher error occurs

    No error occurs when running property standard check tool_v1.36_20230505_1.xlsm
    And when you press the “check button” or any other button, the following error message appears:

    &H8000FFFF (-2147418113)ㅅA system error occurred.

    out of memory

    -> The monitor is set to 100%.

Leave a Reply

Your email address will not be published. Required fields are marked *

en_USEnglish