COS10022 Data Science Principles Assignment 2 Questions Semester 1 | SUT

Published: 13 May, 2025
Category Assignment Subject Computer Science
University Swinburne University of Technology Module Title COS10022 Data Science Principles
Assessment Title Data Cleaning, Integration, and Analysis 
Assessment Weighting 20% 
Due Date Sunday, 25th May 2025 at 11.59 pm (AEDT) 

Assessable Item: 

  • One (1) piece of a written report no more than 10 pages long, with the signed Assignment Cover Sheet.
  • One (1) zip file containing your KNIME workflow, the input file, the output file or any intermediate files produced in your workflow execution process.
  • The submitted report must pass the Turnitin check on Canvas with no more than 30% similarity, except for the parts from the template or the short answers. 

The submitted report should answer all questions listed in the assignment task section in sequence. 

You must include a digitally signed Assignment Cover Sheet with your submission. Submitting the zip file containing the input/output files and your fully functional KNIME workflow is essential for your submission to be marked. If the submitted zip file cannot execute properly or the execution result differs from what you have in the report, you will not get the mark even if you put in the correct answer. 

Purpose of Assignment 

This assignment aims to evaluate students' achievement of the following unit learning outcomes: 

  • Appreciate (and explain) the key concepts, techniques, and tools for handling the data and producing analytic outcomes.
  • Experiencing data cleaning and integration for a data science project. 

This is an individual assignment. Refer to the Unit Outline for the late submission penalty policy. You can ignore the high similarity that appears on the cover page, the template wording, and the short answers. You must make sure your submitted report has a similarity lower than 30% in total and less than 6% from a single source. Otherwise, your report will not be marked. 

Buy Answer to This Assignment & Raise Your Grades

Request to Buy Answer

Key Lessons: 

You are asked to use the specified dataset and then build models in the KNIME analytics platform and explain your design concept. Two source datasets are provided, and you are expected to perform data cleaning and integration to create a combined dataset. Furthermore, you are expected to perform specified analysis with the produced dataset. KNIME must be used to find answers to all questions except the part asking you to observe the data. 

Introduction

Assignment Goal 

This assignment aims to build experiences for students to clean the data before integrating multiple data sources into a combined dataset and explaining the outputs. A small part of the discovery and research component is included in the assignment for expanding the skill set of the students.

Hints and Supplement Materials 

  • Here is the list of nodes that require you to research how to use them. You may use them at least once in the workflow to answer all questions in this assignment:
  • Here is a YouTube video regarding using the Crosstab node for the Chi-Square Test in KNIME: YouTube
  • This assignment involves a heavy part of data cleaning. Observation is the key to completing the assignment correctly. Be careful about the data contained. You may need to do more processing than what was asked in the questions to get the result correct.
  • The Chi-Square table is provided below:

COS10022 Data Science Principles Assignment 2 Questions Semester 1 SUT

Assignment Task 

We have collected two data sets from different sources at different times. Your task is to clean the dataset according to the requirements and observations. The source files are specified in the tasks. The report should be prepared with the template. A table of Contents is not required. 

The data source contains many details of the record. We aim to clean the data sets and then integrate them into one complete dataset before performing data analytics on it. Please follow the steps below to prepare your data after loading it into KNIME. When preparing the zip file, please put all input files under the same path as your KNIME workflow. This will make the marking easier for the tutors. You can keep growing the workflow across multiple steps or start an independent one, depending on your need to complete the tasks. Your submitted workflow must reveal all results corresponding to all questions in the task sheet: 

Hire Experts to solve this assignment before your Deadline

Buy Today, Contact Us
  • Create a KNIME workflow to load the source file "2025_a2_src_d1.csv". Align the column in the order of "Resident ID," "Resident," "DoB," "Current Age," "Education," "Location," and "Income." Observe the content and perform data cleaning processes in KNIME. You may need to go back and forward in this process, as some abnormal cases will not be discovered until data integration.
  • Load the second source file entitled "2025_a2_src_d2.csv" in KNIME. Align the column in the order of "Customer ID," "Name," "Birthday," "Age," "Education Level," "City," "Purchase Date," "Shopping List," "Item List," "Item A," "Item B," "Item C," "Item D," "Item E," "Item F," "Item G," "Item H," "Item I," "Item J," "Item K," "Item L," "Item M," "Item N," "Item O," "Item P," "Item Q," "Item R," "Item S," "Item T," "Item U."Observe the content and perform data cleaning processes in KNIME. You may need to go back and forward in this process, as some abnormal cases will not be discovered until data integration. 
  • For both data sets, drop the unused columns according to the instructions in the assignment sheet and then integrate them into a complete record.
  • Perform the association rule analysis on the part of the data that is suitable for use in the association.

Are you trying to find someone who can help with my COS10022 Data Science Principles? Well! You're in the right place, our podium, Workingment, provides Computer Science Assignment Help UK. Our well-researched and talented professors can also provide you with odd assignments. Suppose you're judging whether to Write My Assignment with our professors. No doubt! Our team can help with your assignment. We also provide Free Sample assignments for your guidance. Get in touch right now!

If you want to see the related solution of this brief, then click here:- Data Science

Workingment Unique Features

Hire Assignment Helper Today!


BBSC4103 Assignment: Strategic Supply Chain Management Question Semester 2025 | OUM

BBSC4103 Part 1: The purpose of this assignment is to help learners study the important of strategic alliances by focusing on inter-organisational relationships and the challenges in managing them.

BBHI4103 Industrial Relation Assignment Question | Open University Malaysia

BBHI4103 Part 1: The purpose of this assignment is to enhance learners' ability to discuss the perspectives in industrial relations and evaluate which perspective is the best for the selected organisation

Scientific Research Review Assignment 4 | USM

You are required to write an individual review paper on a topic of advancements in your area of study. This paper must demonstrate your ability to plan, execute, and present a scholarly task ethically and professionally.

BTEC Level 5 Unit 16: Computing Research Project Assignment Part 1 Brief

BTEC Level 5 Unit 16: Assignment Brief: Artificial intelligence is at the forefront of innovation within Computer Science that uses a combination of logic, algorithms and large data sets to produce an AI model.

ACC210 Accounting for Decision Making and Control TMA Assignment Question | SUSS

ACC210 Question 1  Cool Strokes Pte Ltd ("CS") manufactures white board markers for educational use. The company's markers are sold by the box at $50 each in 20x3.

BM414 Financial Decision Making CW1 Assignment Brief (SIG) | BNU

Suppose you are part of the Accounting and Finance team at Sheffield Insulation Group (SIG) plc. You are required to write a report to the senior management team of SIG plc, discussing the importance of accounting and finance functions.

7167EXQ Environmental Management Systems and Auditing CW1 Brief | CU

7167EXQ Assignment Task: Essay on the relevance of organisational behaviour on environmental management in an organisation You may approach this essay in one of two ways depending on your experience and knowledge.

BENV1015 History of Design Thinking Assignment 1 Brief Term 2 2025 | UNSW

The assignment develops your knowledge of historical design ideas and communication skills corresponding to the course learning outcomes below. It aims to give an appreciation of the implications of past movements, practitioners, and projects for current challenges.

7ME500 Advanced Mechanical Design and Manufacturing Engineering CW1 Report Assessment Brief | UoD

An integrated design, material selection, and manufacturing approach provides one of the key enabling capabilities needed to effectively evaluate specific areas of initial concept development and perform detailed engineering design and analysis throughout a product development cycle.

Atomic Structure & Bonding Assignment : Lab-Based Element Analysis and Molecular Case Study

Atmospheric ions are electrically charged particles that occur naturally in the atmosphere. They form in the upper atmospheric layers mainly because of the action of ultraviolet radiation from the sun, as well as in lower atmospheric layers as a result of radioactive radiation and cosmic rays.

Online Assignment Help in UK