EEB330 – Precept 08: Stats in Python

Author: Michelle White

Date: November 7, 2024

Output: html

GitHub Assignment Link

All exercises are to be completed in Python
Due 1 week from today

Exercise 1: Hypothesis Testing with `scipy`

Example from geeksforgeeks.org.

Instructions:

Assess the homogeneity of variance in the sample datasets
Determine if the sample data is approximately normally distributed
Make sure there are no significant outliers in the sample data
Perform a two-sample t-test on the sample data and report the p-value

Deliverables:

Sufficient evidence that the assumptions of the t-test have been considered
A t-test p-value and corresponding interpretation of the p-value
Responses to the following questions: What is the difference between a two-sample versus paired t-test? Which scipy functions would you use for each?

# import the necessary libraries
import numpy as np
import scipy.stats as stats

# create sample data
data_group1 = np.array([14, 15, 15, 16, 13, 8, 14,
                        17, 16, 14, 19, 20, 21, 15,
                        15, 16, 16, 13, 14, 12])

data_group2 = np.array([15, 17, 14, 17, 14, 8, 12,
                        19, 19, 14, 17, 22, 24, 16,
                        13, 16, 13, 18, 15, 13])

# Your code here!

Exercise 2: Fitting Curves with `scipy`

Example from geeksforgeeks.org.

Instructions:

Create a function sine_fit that accepts an independent variable (your data), the amplitude, and the phase shift of a sine wave and returns the resulting y-value
Call scipy's curve_fit to estimate the parameters of your sample data (Hint: the first argument is your callable sine_fit function)
Plot the sample data in red and overlay the fitted curve as a dashed blue line

Deliverables:

A plot of the sample data overlaid with the fitted curve
Responses to the following questions: What is the purpose of using np.random.normal to create the sample data? How does knowing the sample data takes the form of a sine wave help inform the way you define sine_fit?

# import the necessary libraries
import numpy as np
import matplotlib.pyplot as plt
from scipy.optimize import curve_fit

# create sample data
x = np.linspace(0, 10, num = 40)
y = 3.45 * np.sin(1.334 * x) + np.random.normal(size = 40)

# Your code here!

Exercise 3: Sensitivity Analysis with `sensitivity`

Example from the Sensitivity Analysis Documentation.

Instructions:

Create a Python dictionary where x1 and x2 are the keys
Use SensitivityAnalyzer and my_model to obtain a DataFrame and hexbin plot of the results
Add a third key x3 with values [5, 10, 15] to your dictionary
Analyze and plot the pairwise sensitivities of all three variables using my_model2

Deliverables:

Three hexbin plots of the pairwise sensitivities among x1, x2, and x3 based on my_model2
Responses to the following questions: What do my_model and my_model2 represent in this example? How is the sensitivity between x1 and x2 different from the sensitvity between x2 and x3 for the given my_model2?

# import the necessary libraries
import pandas as pd
from sensitivity import SensitivityAnalyzer

# define some functions for known models
def my_model(x1, x2):
    return x1 ** x2

def my_model2(x1, x2, x3):
    return x1 * x2 ** x3

# create sample data
x1_vals = [10, 20, 30]
x2_vals = [1, 2, 3]

EEB330 – Precept 08: Stats in Python

Author: Michelle White

Date: November 7, 2024

Output: html

GitHub Assignment Link

Exercise 1: Hypothesis Testing with scipy

Example from geeksforgeeks.org.

Instructions:

Deliverables:

Exercise 2: Fitting Curves with scipy

Example from geeksforgeeks.org.

Instructions:

Deliverables:

Exercise 3: Sensitivity Analysis with sensitivity

Example from the Sensitivity Analysis Documentation.

Instructions:

Deliverables:

Exercise 1: Hypothesis Testing with `scipy`

Exercise 2: Fitting Curves with `scipy`

Exercise 3: Sensitivity Analysis with `sensitivity`