What is an index if there is no index , The number of records scanned is greater than the number of records indexed

The index stores the value of the index column ( such as id Index column , Then store the value of the index column ), Address of the row corresponding to the index value in memory ( Or directly store the data of this row )

SELECT * FROM user WHERE username= 'jiajun'
,username Indexing , If the data structure of the index is hash surface , So at this time , By calculation jiajun of hash value ,O(1) Complexity can find the location of the record

hash Index under equivalent search , None at this time hash conflict , In this case , Efficiency is very high

But under the scope search , because hash Not orderly , Then search the range ,hash The advantages of table cannot be brought into play .

stay hash Under conflict , Search efficiency will decrease

Disk read disk read steps : Fixed cylinder , Fixed track , Fixed magnetic block

Disk time is mainly consumed on the positioning cylinder , So if you want to improve the speed , With the same amount of data , Put as much data as possible on the disk block , Then this can reduce the number of times the head positioning cylinder moves , reduce IO Times of .

The value of all nodes in the left subtree of the binary search tree is less than the value of its root node

The value of all nodes in the right subtree is greater than that of its root node

The left and right subtrees of any node are binary search trees

No node with equal key value

Analyze the complexity of binary search tree lgn

But is there any way to reduce it IO Times of , That is, can we reduce the height of the tree

B- tree

(m Step tree m/2<=k<=m) At least two child nodes of the root node

All leaf nodes are on the same layer

Intermediate node contains k-1 Elements and k Children

The elements in the node are arranged from small to large

Each node contains the value of the index column , And this data record ( Or the value of this data record )

Analysis relative to binary lookup tree ,B The tree became short and fat , Because each node stores more elements , So with the same element , Reduced the height of the tree , Then you can reduce IO Times of

Each node stores data ( The value of this line record or the address of this line record in memory ), So different query performance is different .

B+ Tree in B- On the basis of trees

Except for leaf nodes , Other nodes do not contain records ( Rows in the database ) Location of

The leaf node contains all index values , And arranged from small to large , And records ( Row in database ) Location of

Analysis if the nodes are the same size , So if we except leaf nodes , Other nodes do not contain data , Then you can put more elements ( Index value ), In this way, the tree will become more short and fat , that IO The number of times can be further reduced

Because the elements of leaf nodes are arranged in order , And a linked list is formed between leaf nodes , Then the efficiency of range query can be improved during orderly search

be relative to B tree , Because all data is stored in leaf nodes , That means that every search must find the leaf node from the root , This means that the query performance is average .

Summary index is a kind of data , Can avoid full table query , It can be compared with catalogues and books .

Indexes need a data structure to store

Use hash table (hash) The query complexity can reach O(1), But when querying the range ,hash Can't improve performance

IO Operation is time-consuming , To improve query performance , Can reduce IO Times of

For the storage structure of the tree , To improve performance , reduce IO Times of , Can lower the height of the tree

Read a node once IO, With the same amount of data , If each node can store more elements , Then the height of the tree can be reduced .

B The tree lowered the height of the tree , When the node size is the same , because B The nodes of the tree store elements and data , and B+ The tree stores all the data in the leaf node , So in that case , Each node can store more elements , Then you can lower the height of the tree again

B+ The query performance of the tree is more stable , And it is more conducive to range search

If it is a clustered index (InnoDB engine ), Then the data of this record stored in the node , The data file itself is an index file

If it is a nonclustered index (MyISAM engine ), Then the node stores the address of the record in this line . Index file and data file are separated

Type of index general index , Allow the same content

unique index , Unique index value , Allow null values

primary key , Automatically create a primary key index when creating a primary key , Unique and cannot be empty

Composite index , Multi column composite index

Use of indexes ALTER TABLE table_name ADD INDEX index_name (column_list) Add general index

ALTER TABLE table_name ADD UNIQUE (column_list) Add unique index

ALTER TABLE table_name ADD PRIMARY KEY (column_list) Add primary key index

Note that if this is username,age,sex Build joint index

Leftmost match means that the leftmost index is matched first ,(username)(username,age)(username,age,sex), As long as the query criteria use the leftmost column , Generally, index will be used . The order can be different , such as (age,username), This is the credit of the query optimizer .

Fuzzy query only % No. is not in the first character , Index can be used , such as username like '%jiajun' So it is not adopted

If or There is a condition in which there is no index ,sql Statement does not use index , such as usernmae ='jiajun' or pwd='666', The index is not adopted at this time

In composite index , If the query condition is not the first column of the index , Index may not be adopted , For example, at this time where age =1

If the column is character type , such as username It is a character type and an index column , If this is a query username=1 , Without quotation marks , Then the index will not be used at this time

Can be used show status like 'Handler_read%' To view index usage .

It is suggested to give priority to practice

Index principle index should be designed in where Column after , instead of select Column after

Indexes should be built on columns with large discrimination , For example, the status is only 1 and 2 There is no need to build an index

When indexing strings , A prefix length should be specified , For example, a column is char(200), If the first few characters need to be distinguished , Then index the first few characters , This reduces the footprint , Also increased speed

Don't create too many indexes , Index takes up space , And the speed will be reduced when updating , And if there are too many indexes ,Mysql When implementing the plan , Each index will be considered , It's also a waste of time

There is no doubt about the advantages and disadvantages of indexing , When used correctly , Index can improve query speed

Indexing can also improve the speed of grouping and sorting

When deleting and adding due to modification , To maintain index files , Adjust the tree , So the performance is reduced

Index files also need space

Technology