Intelligent Indexing
Intelligent indexing is a type of coarse-grained index that includes data statistics.
When data is stored, every 65536 rows of data are packed into a Data Cell (DC)
package, and an intelligent index is automatically created for each package when the
data is imported. During data querying, statistics values can be obtained and data
GBase 8a MPP Cluster Technical White Paper
- 12 -
General Data Technology Co., Ltd.
filtering can be performed without unpacking the package, which can further reduce
I/O and significantly optimize complex queries. As shown in the figure below, the
"SmartIndex of Col 1" on the left represents the intelligent index built on the first
column data, with each data package (DC) as the unit.
Figure 3-3. Diagram of an intelligent index.
1)indexes without requiring users to manually create and maintain them.
2)Intelligent indexes themselves occupy a small space (about 1%), and do not
cause data inflation.
3)Intelligent indexes are established based on data packets, with fast creation
speed and unaffected by existing data packets.
Compared with traditional database indexing techniques, intelligent indexes are
established
on
data
packets
(coarse-grained
indexing),
and
each
field
is
automatically indexed, while traditional indexes are established on each row of data
(fine-grained indexing). Therefore, accessing intelligent indexes requires less I/O
than accessing traditional indexes. At the same time, the space occupied by
intelligent indexes is about one percent of the data, while traditional database
indexes can occupy 20-50% of the data.
GBase 8a MPP Cluster Technical White Paper
General Data Technology Co., Ltd.
- 13 -