CB0494 Introduction to Data Science and AI Assignment Questions | NTU

Published: 04 Apr, 2025
Category Assignment Subject Computer Science
University Nanyang Technological University Module Title CB0494 Introduction to Data Science and AI

Assignment Brief

1. Objectives

  • To relate potential real-life problems with data science.
  • To apply what has been taught and learnt in CB0494 on a real dataset and perform data analysis. The workflow is very important. This includes why your team uses certain tools and how these tools can help in your analysis.
  • To conclude potential findings with predictions as part of the solution for the problem.

This mini project is to demonstrate your understanding of the tutorials and the data science content from your lecture slides and to be competent in basic data science analysis. The focus is still on the thought process and each team’s creativity in bringing the best out of the dataset with the tools taught in the course for the problem statement indicated by the team.

2. Assessment Criteria

(Reminder: Overall weightage is 30%.)

(a) Project Content and Analysis:

Relating problem with data science: Marks
Relevant use of visualizations: Marks
Relevant use of machine learning techniques (taught in the course): Marks
Data science and programming good practices and clarity in conclusions: Marks
Team work: Marks

(b) Project Report:

Organization: Marks
Clarity:  Marks

(c) Presentation:

Organization: Marks
Clarity: Marks

Dataset 2: Regarding data on lung diseases (From Kaggle):

URL: https://www.kaggle.com/datasets/samikshadalvi/lungs-diseases-dataset

Dataset: lung_disease_data.csv

Submit Your Assignment Questions & Get Plagiarism Free Answers.

Order Non-Plagiarized Assignment

Data Description: Please refer to the URL for information on the data fields, etc

Upon selecting your dataset, explain what real problem your team is going to solve and relate it to the data science questions. Study the dataset and select the appropriate data for your analysis via the Jupyter notebook. Each team should select the suitable tools taught in class to analyse the data and make sense of the analysis related to the problem statement they have stated, and eventually conclude their findings with some prediction.

To analyse the data, it should follow the thought process as taught in class. This means that each team need to strive to provide proof and justification as to why certain predictors are selected, for example, and obviously, each team should select predictors to have the best goodness of fit score as possible.

Each team does not need to be so “obsessed” about getting the goodness of fit score to an exceptionally high value when it may be impossible for some cases due to the quality of the dataset, which is beyond the control of the team. However, the thought process on how each team try the best possible way to analyse the data with the tools taught and, hence obtain a justified goodness of fit score is still important.

Each team can identify the limitations of the work and list out some recommendations for future work. The recommendations may include suggestions of using some tools (not taught in class) with explanations.

Please do not blindly follow through the steps without justification and reasons. That will certainly mean a F grade as it does not demonstrate any understanding of the class and hence failed in the application of the tools taught for the data analysis. Incorrect use of tools or using the tools in an unsuitable way will have marks deducted.

Each team must first convince themselves that the thought process and result of their mini project are realistic, reasonable and meaningful before they can convince me about that.

Please organise your programming codes and make suitable comments in your mini project Jupyter notebook file before submission so that it can be easily understood and readable. There is no limit in the length of your Jupyter notebook file for your mini project.

Problem Statement:

Lung diseases remain a significant public health concern in Singapore, particularly among the elderly and chronically ill populations. While access to healthcare is robust, patient recovery outcomes still vary due to multiple clinical and demographic factors. This project aims to develop a predictive model to classify whether a patient is likely to recover from lung disease based on their medical and demographic information. By identifying patients at risk of non-recovery early, healthcare providers can tailor interventions, reduce complications, and improve overall treatment efficiency.

Struggling to complete this Assignment and feeling stressed? Take our Assignment Writing Services

 Buy Non-Plagiarized Assignments

Do you need help with an assignment for CB0494, Introduction to Data Science and AI? Look no further! We are here for computer science assignment help. We also provide free assignment solutions written by PhD expert writers—100% original content, no plagiarism! Plus, we also provide assignment help, that too by completing it before the deadline. Quality and accuracy are taken care of completely. So contact us today and be stress-free!

If you want to see the related solution of this brief, then click here:-Data Science

NCO111 Work and Learning in a Changing World Assignment Questions | SUSS

Chosen skills, characteristics, and/or types of knowledge must not be from the NCO111 study guide. Examples of skills, characteristics, and/or types of knowledge that are not discussed in the NCO111 study guide include empathy, digital literacy,

NCO201 Learn To Learn, Learn For Life Tutor-Marked Assignment 1 (TMA01) Presentation

NCO201 Question 1 For this assignment, you will create a profile of yourself as a learner at this juncture in time. Keep in mind that our learning profiles change as we learn and develop.

NCO201 TMA02: Learn to Learn, Learn for Life Self-Directed Learning Plan and Reflection | SUSS

NCO201: Question 1 Develop your ability for self-directed learning by completing a learning plan (refer to the template on Canvas)

Biopharmaceuticals And Drug Discovery Assessment Brief 2024-25

BioDD Assessment Brief:The coursework will require you to work within a group to prepare and present a scientific poster presentation on the drug discovery process leading to the development of a particular biopharmaceutical or small molecule drug.

YHS824 Access to Higher Education Health Science Professions level 3 Diploma

YHS824 Education Health Science Professions 1.1 Identify the main endocrine glands and describe the hormones that are produced by them. 1.2 Explain the function of the major hormones of the endocrine system. 1.3 Explain the bloodstream’s role in the distribution of hormones to target organs.

UJGUQJ Technology Law Assessment Brief | University of the West of England

The assessment is a research-based legal opinion. You are provided with a set of facts below. Read the facts carefully. You are expected to analyse these facts, identify the legal issues, and develop your legal opinion.

State and Society in the Middle East (2024-25) Sample Essay Questions

STME Essay Questions: Discuss the ways in which class politics has shaped state formation and forms of political contestation over time in countries such as Lebanon, Iraq and Egypt

Financial Reporting and Analysis Assessment 02 Brief | SMU

The assignment is to be completed individually. You are to examine the financial statements of a listed company (any country) for your discussion.

AIB501 Introduction to AI End Course Assessment | SUSS

Construct and design a detailed and comprehensive AI implementation strategy tailored for a software company specializing in serving the renovation, construction, maintenance, and design needs of the retail stores and outlets industry.

COM203 Consulting and Freelancing Assignment Questions | SUSS

Demonstrated critical understanding of relevant unit references. Appropriateness and application of a relevant example/s. Demonstrated understanding of the structure and considerations of a small business.

Online Assignment Help in UK