MA12009 Data Analysis for Business Coursework 2 | UoB

Published: 08 Apr, 2025
Category Assignment Subject Computer Science
University University of Bath Module Title MA12009 Data Analysis for Business

Exploratory Data Analysis 

There are two workbooks available on Moodle:

1. “bristol_airbnb_listings.xlsx”

  • These data pertain to Airbnb listings in Bristol from the last few months of 2024 (last recorded on 25/12/2024).
  • The Excel file contains two worksheets:

a. “Data description”: This worksheet includes the legend and description of the variables collected.
b. “Bristol Airbnb Listings”: This worksheet contains the raw data (approximately 2,700 listings and 51 variables).

2. “spotify_songs.xlsx”

  • These data relate to the technical and artist descriptions of tracks stored in Spotify from various playlists.
  • The Excel file contains two worksheets:

a. “Data description”: This worksheet includes the legend and description of the variables collected.
b. “Spotify Songs”: This worksheet contains the raw data (approximately 30,000 tracks and 23 variables).

Choose only one of the two datasets and perform exploratory data analysis on the data of your choice. Your exploratory data analysis should include at least the following:

  • Graphical, tabular and / or numerical summaries for the core variables of interest that you find relevant. Discuss your findings.
  • Explore possible associations, either by graphical, tabular and / or numerical summaries, between the core variables of interest that you have found relevant. Discuss your findings.
  • Report a general section for conclusions and possible points of discussions for improving the analysis in the future. For example, are there any other variables that might be collected? Are there any weaker aspects in the analysis that might be strengthened in the future?

Get the Solution to this Assessment. Hire Experts to solve this assignment Before your Deadline

Order Non-Plagiarized Assignment

Probability and Statistical Inference Analysis 

1. Let 𝑋 denote the random variable “number_of_reviews” in a year for a randomly selected property from the “bristol_airbnb_listings.xlsx” file. [5/30]

a. What is the distribution of 𝑋? (I.e., choose between Binomial, Poisson, Uniform, Exponential, Normal, or none of the above.) Explain the reasons for your choice.
b. Suppose 𝑋 follows a Poisson distribution with mean πœ‡ = 52 reviews in a year.

  • What is the probability that a randomly selected property has more than 60 reviews?
  • What is the probability that a randomly selected property has at most 50 reviews?
  • What is the probability that a randomly selected property will have a number of reviews between 40 and 80?
  • Compare the probabilities in (i), (ii) and (iii) with the empirical probabilities (i.e., relative frequencies) of “number_of_reviews”. Are they comparable or are they different? Explain any differences you might see.

2. Let 𝑋 denote the random variable that count the number of super hosts out of the 2709 listings in the “bristol_airbnb_listings.xlsx” file. [5/30]

a. What is the distribution of 𝑋? (I.e., choose between Binomial, Poisson, Uniform, Exponential, Normal, no-one of the above.) Explain the reasons of your choice.

b. Suppose 𝑋 follows a Binomial distribution with probability 𝑝 calculated from the proportion of superhosts you have in the data set.

  • What is the probability that there are more than 350 superhosts?
  • What is the probability that there are less than 500 superhosts?
  • What is the average number of superhosts? What is the standard deviation of the number of superhosts?

3. Let 𝑋 denote the random variable “price” that describes the daily price of a randomly selected property in the “bristol_airbnb_listings.xlsx” file. [10/30]

a. What is the distribution of 𝑋? (I.e., choose between Binomial, Poisson, Uniform, Exponential, Normal, or none of the above.) Explain the reasons for your choice.

b. What is the distribution of 𝑋 (the average daily price of a property)? (I.e., choose between Binomial, Poisson, Uniform, Exponential, Normal, or none of the above.) Explain the reasons for your choice.

c. Suppose the data in “price” in “bristol_airbnb_listings.xlsx” is the population data. What are the values of the population mean and the population standard deviation for 𝑋?

d. Let 𝑋 be normally distributed with population mean πœ‡ and standard deviation 𝜎/√𝑛 (i.e., standard error), where the values of πœ‡ and 𝜎/√𝑛 are defined as in (c).

  • What is the probability that the average daily prices are above £250?
  • What is the probability that the average daily prices are between £150 and £276?
  • What is the minimum average price for the top 1% most expensive properties listed on Airbnb in Bristol?
  • What is the maximum average price for the top 1% less expensive properties listed on Airbnb in Bristol?

4. Now work on the “spotify_songs.xlsx”. 

  • Generate a simple random sample of size 𝑛 = 25 of the “danceability” of Spotify’s pop songs.
  • Calculate a 90%, 95% and 99% confidence interval for πœ‡, the average danceapop songbility of a pop-song on Spotify. Do you detect any differences in the confidence intervals? If yes, what are those differences?
  • From previous research studies in the musical sector, the average danceability of pop songs should be at least 0.60. It has been shown that the population standard deviation of danceability is 𝜎 = 0.10. Run a hypothesis test to challenge the null hypothesis 𝐻0: πœ‡ ≥ 0.60 using a significance level of 𝛼 = 0.05. What is your conclusion?
  • The hypothesis test and the confidence interval in (c) and (b), respectively, are reliable if and only if the population from which the sample is taken is normally distributed. Is it true in this case? Discuss.

Looking for expert guidance in MA12009 Data Analysis for Business? Worry no need! We are here to help you with assignments. Whether you need computer science assignment help or well-structured solutions, we will provide everything. You will get free coursework examples that will make your study material stronger. our expert team are providing all assignment services, you will get accurate, clear, and original content. Now, stop worrying about marks and complete your assignments hassle-free with expert support. So what's the delay? Get connected with us now and make your academic journey easy!

Workingment Unique Features

Hire Assignment Helper Today!


Latest Free Samples for University Students

RBP020L063H Leadership and Change Management Assignment Sample

Category: Assignment

Subject: Management

University: University of Roehampton

Module Title: RBP020L063H Leadership and Change Management

View Free Samples

HRMM080 Ethical and Responsible Leadership AS2 Reflective Portfolio Sample

Category: Assignment

Subject: Management

University: University of Northampton

Module Title: HRMM080 Ethical and Responsible Leadership

View Free Samples

ACAD1346 The child’s live Experience Developing Confidence Learners Assignment Sample

Category: Assignment

Subject: Education

University: University of Greenwich (UOG)

Module Title: ACAD1346 The child’s live Experience Developing Confidence Learners

View Free Samples

NUR7011 Developing Healthcare Leaders Assignment Sample | BPP

Category: Assignment

Subject: Nursing

University: BPP University

Module Title: NUR7011 Developing Healthcare Leaders

View Free Samples

Project Management, Leadership and Skills: Planning & Control Portfolio Example

Category: Assignment

Subject: Management

University: University of Salford Manchester

Module Title: Project Management, Leadership and Skills: Planning & Control

View Free Samples
Online Assignment Help in UK