Machine Learning Advanced Certification||Aug 29,30 Sep 5,6,12,13,19,20,26,27 Oct 3||Vaishali

Discussion in 'Big Data and Analytics' started by Abhishek_Tripathy, Aug 30, 2020.

  1. Abhishek_Tripathy

    Abhishek_Tripathy Moderator
    Staff Member Simplilearn Support Alumni

    Joined:
    Sep 24, 2018
    Messages:
    29
    Likes Received:
    0
  2. Vaishali_26

    Vaishali_26 Active Member

    Joined:
    Sep 12, 2019
    Messages:
    45
    Likes Received:
    2
    Day 2 datasets for practicing Pandas
     

    Attached Files:

    #2
  3. Ravneet Kaur Nagpal

    Ravneet Kaur Nagpal Active Member

    Joined:
    Jul 31, 2020
    Messages:
    15
    Likes Received:
    5
    Hi Vaishali,

    Can you please provide steps to install python Jupyter noebook on local machine? I have installed anaconda but not able to proceed further.
     
    #3
  4. OMAR FAISAL

    OMAR FAISAL Customer
    Staff Member Customer

    Joined:
    May 27, 2020
    Messages:
    16
    Likes Received:
    19
    #4
  5. Ogunsola Ayodeji samuel

    Joined:
    Nov 18, 2019
    Messages:
    3
    Likes Received:
    0
    Hi could you help out with my project on
    MERCEDES-BENZ GREENER MANUFACTURING IN ML
     
    #5
  6. Sandip Sankar Banerjee

    Joined:
    May 8, 2020
    Messages:
    3
    Likes Received:
    0
    Hi Vaishali,

    I have some queries on the practice project -SFO Public Department. Could you please guide me on the below queries-

    1)When I imported the dataset, I observed that the data type of 'Year' column became 'Integer'. How will I convert it to date format-"YYYY".
    2)Another question is -"How much total salary cost has increased
     
    #6
  7. Vaishali_26

    Vaishali_26 Active Member

    Joined:
    Sep 12, 2019
    Messages:
    45
    Likes Received:
    2
    Hi Ravneet,

    Jupyter notebook comes as part of the anaconda package.

    Please refer the below link to get to know the different ways in which Jupyter notebook can be accessed.
    https://pythonforundergradengineers.com/opening-a-jupyter-notebook-on-windows.html
     
    #7
  8. Vaishali_26

    Vaishali_26 Active Member

    Joined:
    Sep 12, 2019
    Messages:
    45
    Likes Received:
    2
    Hi Ogunsola,

    We will be discussing about projects in one of our Live classroom sessions.
     
    #8
  9. Vaishali_26

    Vaishali_26 Active Member

    Joined:
    Sep 12, 2019
    Messages:
    45
    Likes Received:
    2
    Hi Sandeep,
    Please let me know the name of the lesson in which this project is present and the module number.
     
    #9
  10. Prashant Garg_2

    Joined:
    May 12, 2020
    Messages:
    3
    Likes Received:
    0
    how to deal with highly imbalance data?
     
    #10
  11. Sandip Sankar Banerjee

    Joined:
    May 8, 2020
    Messages:
    3
    Likes Received:
    0
    Hi Vaishali

    The lesson number is 3.22 under Machine Learning module.

    Regards
    Sandip
     
    #11
  12. Support Simplilearn(4685)

    Staff Member Alumni

    Joined:
    Feb 11, 2010
    Messages:
    247
    Likes Received:
    27
    #12
  13. Shantanu_33

    Shantanu_33 Member

    Joined:
    Jun 30, 2020
    Messages:
    2
    Likes Received:
    0
    How to convert a numpy array into 1267 rows and 6 columns class ??
     
    #13
  14. Ravneet Kaur Nagpal

    Ravneet Kaur Nagpal Active Member

    Joined:
    Jul 31, 2020
    Messages:
    15
    Likes Received:
    5
    Hi Vaishali,

    I'm stuck with practice project - Ensemble (9.16)
    Mtcars, an automobile company in Chambersburg, United States, has recorded the production of its cars within a dataset. In order to classify cars, the company has come up with two classification models (KNN and Logistic Regression).

    Objective: Perform a model selection between the two models above using the sampling technique as 10-fold cross-validation.

    I'm not able to get input(X) and output (y) variable for this dataset. Can you please help me with this.

    Dataset attached
     

    Attached Files:

    #14
  15. Shaoni Chakravarthy

    Joined:
    Jun 22, 2020
    Messages:
    3
    Likes Received:
    0
    Hi Vaishali,

    In SVM, when we use higher dimensionality , how we get the data point co-ordinates for that dimension?
    like - if we have a 2D data set, and we are imposing higher dimensionality to it. Now in 3D, where from we will get the coordinates for Z axis?
     
    #15
  16. Nirmal Chandra Dash

    Joined:
    Nov 22, 2019
    Messages:
    6
    Likes Received:
    0
    Hi Vaishali,

    I have doubt on correlation. I am working with horse data set where multiple features are there. When i apply correlation function as well as plot them in heatmap, i am not able to find any insight. Can you please guide me how to deal with that..
     
    #16
  17. Hitesh H S

    Hitesh H S Active Member
    Staff Member Simplilearn Support

    Joined:
    May 27, 2020
    Messages:
    35
    Likes Received:
    9
    Hi Shantanu_33,

    Please find the below example for your reference.

    import numpy as np
    import pandas as pd

    # Creating a 2 dimensional numpy array
    data = np.array([[5.8,2.8],[6.0,2.2]])
    print(data) data
    array([[5.8,2.8],[6.,2.2]])

    # Creating pandas dataframe from numpy array
    dataset = pd.DataFrame({'Column1': data[:,0],'Column2': data[:,1]})
    print(dataset)
    Column1 Column2
    0 5.8 2.8
    1 6.0 2.2

    or you can even use this

    a = np.array([[1, 2], [3, 4]])
    a
    array([[1, 2],
    [3, 4]])

    a.transpose()
    array([[1, 3],
    [2, 4]])

    a.transpose((1, 0))
    array([[1, 3],
    [2, 4]])

    a.transpose(1, 0)
    array([[1, 3],
    [2, 4]])

    I hope this helps you.

    Happy learning !!
     
    #17
  18. Ravneet Kaur Nagpal

    Ravneet Kaur Nagpal Active Member

    Joined:
    Jul 31, 2020
    Messages:
    15
    Likes Received:
    5
    Hi Vaishali Mam,

    I was working on Demo project 9.19 (Ensemble Learning)
    I have tried using XGBoost in this case study. Can you please have a look at the code and check if its correct.

    Attaching code file and dataset.
     

    Attached Files:

    #18
  19. Surya Chaturvedula

    Simplilearn Support

    Joined:
    Jun 21, 2020
    Messages:
    2
    Likes Received:
    0
    Can you please help me fixing this Error while importing xgboost:
    ---------------------------------------------------------------------------
    ModuleNotFoundError Traceback (most recent call last)
    <ipython-input-11-e9ddc4c00522> in <module>
    ----> 1import xgboost as xgb
    2 from sklearn.metrics import r2_score
    3 from sklearn.model_selection import train_test_split

    ModuleNotFoundError: No module named 'xgboost'


    Installed: using command ::: conda install -c anaconda py-xgboost
    (base) C:\Users\surya>conda install -c anaconda py-xgboost
    Collecting package metadata (current_repodata.json): done
    Solving environment: failed with initial frozen solve. Retrying with flexible solve.
    Solving environment: failed with repodata from current_repodata.json, will retry with next repodata source.
    Collecting package metadata (repodata.json): done
    Solving environment: failed with initial frozen solve. Retrying with flexible solve.
    Solving environment: \
    Found conflicts! Looking for incompatible packages.
    This can take several minutes. Press CTRL-C to abort.
    failed
    UnsatisfiableError: The following specifications were found
    to be incompatible with the existing python installation in your environment:
    Specifications:
    - py-xgboost -> python[version='>=2.7,<2.8.0a0|>=3.6,<3.7.0a0|>=3.7,<3.8.0a0|>=3.5,<3.6.0a0']
    Your python: python=3.8
    If python is on the left-most side of the chain, that's the version you've asked for.
    When python appears to the right, that indicates that the thing on the left is somehow
    not available for the python version you are constrained to. Note that conda will not
    change your python version to a different minor version unless you explicitly specify
    that.
    (base) C:\Users\surya>conda install -c anaconda py-xgboost
     
    #19
  20. Support Simplilearn(4685)

    Staff Member Alumni

    Joined:
    Feb 11, 2010
    Messages:
    247
    Likes Received:
    27
    Hi Ravneet,

    We request you to kindly submit the practice project as well for evaluation from your LMS. The concerned team will evaluate it.

    Thank You
     
    #20
  21. Raghavendra B M

    Raghavendra B M Well-Known Member
    Staff Member Simplilearn Support

    Joined:
    Jan 6, 2020
    Messages:
    64
    Likes Received:
    36
    Hi Surya,

    This issue happens because the environment is not set up properly for the installation.

    Kindly follow the below steps:
    conda create --name myenv
    conda activate myenv

    And then run : conda install -c anaconda py-xgboost

    This should fix the issue.

    Regards,
    Team Simplilearn
     
    #21
  22. Shaoni Chakravarthy

    Joined:
    Jun 22, 2020
    Messages:
    3
    Likes Received:
    0
    Hi Vaishali/Support Team,

    I was trying with the practice project on supervised learning.
    4.28 Practice Project: Health Insurance Cost. I did not find the data set in the problem . Please check if the data set is missing from simplilearn end. If its not missing then how to proceed as there is no data.
     
    #22
  23. Support Simplilearn(4685)

    Staff Member Alumni

    Joined:
    Feb 11, 2010
    Messages:
    247
    Likes Received:
    27
    Hi Shaoni,

    The dataset is available kindly click on the arrow mark and download the folder you will find "insurance2.csv" file in side the folder.

    Thank You
     

    Attached Files:

    #23

Share This Page