S222 HIT140 FOUNDATIONS OF DATA SCIENCE
Top of Form
Bottom of Form
Assignment Content
1.
Top of Form
About
This assignment should be completed in a group of 4 students.
This is the first of a two-part assessment, the second part being the “Data science group project 2”. Since professional projects often require that you work together with others, the assessments must be completed a team.
The current assignment evaluates your ability to:
· utilizese essential data wrangling techniques (Learning Outcome 2)
· perform descriptive and inferential statistical analyses (Learning Outcome 3)
In addition, it seeks to develop the following attributes:
· ability to work effectively in a team
· professionalism
· ethical conduct
Bottom of Form
2.
Top of Form
The Case Study
The district of Fairfield conducted an environmental study on freshwater reservoirs in its region. These include lakes, creeks, and public ponds. The study was instigated by recent concerns voiced by a local environmental protection group that fish in these reservoirs may have been contaminated by mercury that they are no longer safe for human consumption.
Mercury is a toxic metal that occurs naturally in the environment. At times, however, human activities may result in unnatural releases of mercury into water bodies, which could in turn enter fish. Consuming mercury-contaminated fish can lead to severe neurological and physiological disorders in humans.
Fairfield’s officials identified 943 water reservoirs (including natural lakes) that have significant fisheries and are relatively accessible, based on information found in a previous survey carried out a decade ago. Of these, using the simple random sampling technique, 142 reservoirs were selected for the current study. Then, samples of fish were collected from only 122 reservoirs that contained a targeted group of predator fish species that the researchers are interested in. There are certain criteria that the researchers used for deciding the targeted fish species.
Fish were collected by angling, gill nets, trap nets, dip nets or beach seines. Up to 5 fish from the hierarchical order of preferred predator species were obtained. Care was taken to keep fish clean and free of contamination. In the laboratory, the fish fillet (muscle) of each fish was extracted and the fillets from each reservoir were ground up, combined and homogenised. Then, the tissue was subsampled to analyse the mercury levels.
In addition to collecting fish samples, the officials examined other possible factors that could contribute to elevated mercury levels in fish. They reckoned that this information could be useful for policy-making by members of the Fairfield legislature.
Following completion of the field study, you were handed a dataset containing 122 records of the studied reservoirs. Each record is described by the following variables:
· Reservoir: name of the reservoir
· Fish: number of fish sampled