I'm a bit stumped on project 1, the one with the Portuguese bank. I don't think we were ever taught in the training materials how to do some of the required operations. I start off by loading the file into Spark: val input = sc.textFile("/user/vshideler_gmail/project_1_data.csv") Performing "input.count()" shows that it is reading the file properly. Then I create a class named "Bank" to define the schema: case class Bank(age:Int, job:String, marital:String, education:String, default:String, balance:Int, housing:String, loan:String, contact:String, day:String, month:String, duration:Int, campaign:Int, pdays:String, previous:Int, poutcome:String, y:String) Then split the file using semicolon separation: val input_split = input.map(line => line.split(";")) And create an rdd from the data: val bankrdd = input_split.map(x => Bank(x(0).toInt, x(1), x(2), x(3), x(4), x(5).toInt, x(6), x(7), x(8), x(9), x(10), x(11).toInt, x(12).toInt, x(13), x(14).toInt, x(15), x(16))) Then create a data frame: val bankDF = bankrdd.toDF() So far, so good. The challenging part begins. It's necessary to find out the marketing success rate. By this, I assume that I need to find the "success" entries under "poutcome" and compare them to the total number of entries. So I filter the successes, assign them to a data frame and divide them by the bankDF data frame: val success = bankDF.filter($"poutcome" === "success") val successDF = success.toDF() val k = bankDF.count() val z = successDF.count() val x = k/z I'm hoping that this is correct. I don't know if you can verify, since it has to do with the project answers but I'm hoping that you can verify that the format is at least correct. The next problem is to figure out the "maximum, mean, and minimum age of average targeted customer" and also "check...average balance, median balance of customers." This is something I don't know how to do and I'm pretty sure we were never taught how to do. I know that Spark includes functions for means, averages, medians, etc., but I'm not sure how to use them. Please assist.