
It contains 1338 rows of data and the following columns: age, gender, BMI, children, smoker, region and insurance charges. The data contains medical information and costs billed by health insurance companies. This dataset was inspired by the book Machine Learning with R by Brett Lantz. The dataset includes the fish species, weight, length, height and width. Fish market dataset for regressionīuilt for multiple linear regression and multivariate analysis, the Fish Market Dataset contains information about common fish species in market sales.

CDC data: nutrition, physical activity, obesityįrom the Behavioral Risk Factor Surveillance System at the CDC, this dataset includes information about physical activity, weight and average adult diet. Along with the dataset, the author includes a full walkthrough on how they sourced and prepared the data, their exploratory analysis, model selection, diagnostics and interpretation. This dataset includes data taken from about deaths due to cancer in the United States. Linear regression datasets for machine learning

Additionally, some of the datasets on this list include sample regression tasks for you to complete with the data. For those of you looking to learn more about the topic or complete some sample assignments, this article will introduce open linear regression datasets you can download today.

Every data scientist will likely have to perform linear regression tasks and predictive modeling processes at some point in their studies or career.
