In the self learning for PGP DE - Big Data Hadoop Spark Developer; section 6.27 is a non-gradable practice project.
Are there step-by-step instructions for going through this project? Having a set of step-by-step instructions would help me out a lot with how to use this tool (namely, Hive). In addition, this goes similarly for other practice projects.
For Project #3 in the Assessment with the Stock Exchange Data Analysis. Steps to perform include: 1) Create a data pipeline using sqoop to pull the data from the table below from MYSQL server into Hive.
My questions is, are we supposed to sqoop the data from MySql into HDFS, and then into Hive? or sqoop the data directly from MySql into Hive?
The Simplilearn community is a friendly, accessible place for professionals of all ages and backgrounds to engage in healthy, constructive debate and informative discussions. Get your pressing questions answered,
participate in monthly contests, create polls to get a feel for the market, build your network, and more! Pull up a chair and come join the discussion -today!