Method and device for determining relevancy of fields in database table

A correlation, database technology, applied in the computer field, can solve the problems of inability to quickly and accurately identify field types, inability to meet, lack of correlation analysis of database table fields, etc., to achieve the effect of visual output

Pending Publication Date: 2021-12-07
BEIJING WODONG TIANJUN INFORMATION TECH CO LTD +1
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This technology allows users to quickly analyze large amounts of databases by comparing their values across multiple tables without having them manually input each row separately. It also helps identify correlations that may exist among certain attributes within these tables. By grouping similar rows together, this technique makes it easier than traditional methods like linear regression to calculate an average level called correlation Index. Additionally, if there were any differences between numerical fields being compared, indicating potential problems such as misclassification, we could use this knowledge to create a graph showing how well they compare against another dataset containing related ones. Overall, this system improves efficiency and accuracy in relational analysis tasks while providing intuitive visual representations through its own set of rules.

Problems solved by technology

This patents describes two technical problem addressed by these previous researches related to determining correlations among multiple databases' rows. Firstly, current techniques require manual identification of each row separately but may result in confusion if an incorrect interpretation occurs due to factors like missing values from irrelevant categories. Additionally, they donot provide accurate results even on specific applications where datasets vary widely across columns. Finally, conventional approaches involve comparing entirety numbers over all possible combinations of column names without considering their importance based solely upon similarity scores calculated through pearson-correlating coefficients alone.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for determining relevancy of fields in database table
  • Method and device for determining relevancy of fields in database table
  • Method and device for determining relevancy of fields in database table

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0035] figure 1 is a schematic diagram of the main steps of the method for determining the field correlation in the database table according to the embodiment of the present invention. Such as figure 1 As shown, the method for determining the field correlation degree in the database table in the embodiment of the present invention can be specifically executed according to the following steps:

[0036] Step S101: For any two fields to be analyzed in the database table, determine the field type to which the field belongs according to the elements of each field.

[0037] In this step, the two fields to be analyzed may be fields of the same database table, or any fields in multiple database tables that can be associated with each other. For example, if the employee performance appraisal table and the employee basic information table can be related through the common employee name field, then correlation analysis can be performed on any two fields in the two database tables. In the

Embodiment 2

[0082] Figure 4 It is a schematic diagram of the specific execution of the method for determining the field correlation degree in the database table in the embodiment of the present invention. Such as Figure 4 As shown, the method for determining field correlation in a database table in the embodiment of the present invention may include three parts: preprocessing, correlation analysis, and result input.

[0083] In the preprocessing part, data cleaning needs to be performed on the database tables first. For example, if the database table is not in csv (Comma-Separated Values, comma-separated values) format, you need to split the header row and each row of records; unify the format of the elements in each field, and remove redundant spaces and punctuation , garbled characters, etc., unify invalid values ​​such as NULL, None, etc., and missing values ​​into empty characters. After that, the field type of the field to be analyzed can be determined according to the method in th

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method and device for determining relevancy of fields in a database table, and relates to the technical field of computers. A specific embodiment of the method comprises the following steps: for any two to-be-analyzed fields in a database table, judging a field type to which the field belongs according to an element of each field, wherein the field type comprises a numeric field and a classification field, and elements in the classification field belong to at least two element categories; when one of the two fields is a numeric field and the other field is a classified field, determining elements belonging to the same element category in the classified field, and forming an analysis group by the elements in the numeric field corresponding to the element; and determining an inter-group variance and an intra-group variance for each analysis group, and obtaining a correlation index of the two fields according to the inter-group variance and the intra-group variance. According to the embodiment, the relevancy of the numeric field and the classification field in any database table can be quantitatively calculated, and unified analysis of the relevancy of different types of fields can be achieved.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner BEIJING WODONG TIANJUN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products