The cluster environment has been configured zookeeper,hadoop. And distributed database hbase. In this stage, we will start the practice of building a data warehouse . Software involved :
mysql, hive . 

      background :

         

      ( I used to think that a data warehouse is a database of a database ..)

          so ,hive The essence is a hadoop Tools for . 

   mysql Naturally, you don't have to search too much .  It should be noted that it is in the centos7 But there are some fastidious installation .    originally centos use yum
It can be very convenient to mysql Service installed , No environment variables need to be configured , But my network environment is not good , I have to download the installation package !! 
  stay ubutu Installed on mysql Of tar.gz package , stay windows It can also be downloaded from zip package , however centos The top needs the next  
bundle Bag of , And pay attention to the version .  adopt bundle Installation is still the first time , So there are a lot of problems . 

step :

      1. centos 7 Installation on mysql:

                  Download and unzip the downloaded bundle package , And adopt rpm Install . 

                 

                Change password policy and change password :

               

              mysql Some of its own operation task commands :

                   

                   

                   

                   

                    

                    

            A lot of orders are Mysql Required by itself , Although it has been used many times , But every time you use it, you still need to check it . 
It is estimated that it was not recited as an ancient poem in class , Or is it that your brain doesn't feel the need to remember ..

   2.hive Server installation :

          2.1 Extract and configure environment variables : 

                   

          2.2 stay hive Server side , take mysql Client connections for : mysql-connector Copy to :hive Of lib lower . 
be careful , this mysql-connector Just copy to hive Server side , Because only the server needs to follow the mysql signal communication .
hive The client only needs to communicate with the server , So there are no steps . 

          2.3 to configure hive Server configuration :  conf/hive-env.sh

               

               

          2.4 to configure hive Server configuration :  conf/hive-site.xml

              

              

 

    3. hive Client installation :

                3.0 Client needs and hadoo signal communication .  hadoop And hive Of jline Package versions need to be unified .
It can be downloaded from Hive Of lib Package will jline Of jar Package copy to hadoop Of lib position . Pay attention to the situation here : hive client , hadoop Server side . 

                3.1 Extract and configure environment variables .

                3.2 modify conf/hive-env.sh, newly added hdoop route . 

                3.3 to configure conf/hive-site.xml:

        

   5. hive Verification of data warehouse : 

                Server :       

                   

                    

            client :

                     

                   

                            

Technology