Разделы презентаций


Статистическая обработка данных

Содержание

Methods SectionFrom JAMA (impact factor - 47.661):In the Methods section, describe statistical methods with enough detail to enable a knowledgeable reader with access to the original data to reproduce the reported

Слайды и текст этой презентации

Слайд 1Статистическая обработка данных
Prepared by Artur Galimov M.D.

Статистическая обработка данныхPrepared by Artur Galimov M.D.

Слайд 2Methods Section
From JAMA (impact factor - 47.661):

In the Methods section,

describe statistical methods with enough detail to enable a knowledgeable

reader with access to the original data to reproduce the reported results.
Methods SectionFrom JAMA (impact factor - 47.661):In the Methods section, describe statistical methods with enough detail to

Слайд 3Study Designs in Medical Research

Study Designs in Medical Research

Слайд 4Distinguishing Between Study Designs

Distinguishing Between Study Designs

Слайд 5Common types of experiments

Common types of experiments

Слайд 6Experiment
Introduce a treatment to observe its effects
Might not involve randomization
Might

not even have a control group

ExperimentIntroduce a treatment to observe its effectsMight not involve randomizationMight not even have a control group

Слайд 7Randomized Experiment
The gold standard for demonstrating causality
Units (people, animals, groups,

etc.) are randomly assigned to receive either treatment or control.
If

the sample is large enough, we can assume that on the average, everything else about the two groups is similar because the two groups were randomly selected.
So any difference between the two groups after the experiment must be due to the treatment.

Randomized ExperimentThe gold standard for demonstrating causalityUnits (people, animals, groups, etc.) are randomly assigned to receive either

Слайд 8Quasi-experiment
There is a control group, but no random assignment to

treatment vs. control
Usually happens because it’s impossible or unethical to

do random assignment
Assignment to conditions occurs by self-selection (some people choose to smoke or exercise or join a program)
Example: effects of a new health media campaign that’s introduced in one community but not others
The main problem is that the groups are different in other ways (people who become smokers have different demographics and genetics)
Quasi-experimentThere is a control group, but no random assignment to treatment vs. controlUsually happens because it’s impossible

Слайд 9Natural experiment
(Not exactly an experiment because the experimenter didn’t manipulate

the cause, but the cause occurred)
Compare a group that experienced

a cause with a group that didn’t
(Or compare the same group before and after the cause)
Examples: effect of a natural disaster, effect of a policy change
Natural experiment(Not exactly an experiment because the experimenter didn’t manipulate the cause, but the cause occurred)Compare a

Слайд 10Correlational study
Nonexperimental because nothing is manipulated
Measure some variables and see

if there’s a mathematical relationship between them
Results can be consistent

with causality, but they can’t prove causality

Correlational studyNonexperimental because nothing is manipulatedMeasure some variables and see if there’s a mathematical relationship between themResults

Слайд 11Even randomized experiments aren’t perfect
Experimental conditions are usually artificial
They’re

conducted in one particular time and place – might not

generalize to other times or places
But we usually want to generalize the findings to other times and places

Cronbach: we usually want to generalize to other UTOS – units, treatments, observations (outcomes), and settings
Even randomized experiments aren’t perfect Experimental conditions are usually artificialThey’re conducted in one particular time and place

Слайд 12Populations

Populations

Слайд 13Population vs. Sample
Population
Described using population parameters
Usually represented by Greek letters
Sample
Described

using sample
statistics
Usually represented by Roman letters

Population vs. SamplePopulationDescribed using population parametersUsually represented by Greek lettersSampleDescribed using sample statisticsUsually represented by Roman letters

Слайд 14Types of Data (Variables)
Categorical
Nominal
-mutually exclusive

-no natural order
(qualitative)
Ordinal
-mutually exclusive

-ordered

Numeric
Discrete
-countable
-ordered
-integer value
-magnitude

of value important
Continuous
-countable

-takes any value

-magnitude of value important
Dichotomous

Types of Data (Variables)Categorical Nominal-mutually exclusive-no natural order(qualitative)Ordinal-mutually exclusive-orderedNumericDiscrete-countable-ordered-integer value-magnitude of value importantContinuous-countable-takes any value-magnitude of value

Слайд 15Types of Data (Variables)
Categorical
Nominal
-mutually exclusive

-no natural order
(qualitative)
Ordinal
-mutually exclusive

-ordered

Continuous
-countable

-takes any

value

-magnitude of value important
Dichotomous

Types of Data (Variables)Categorical Nominal-mutually exclusive-no natural order(qualitative)Ordinal-mutually exclusive-orderedContinuous-countable-takes any value-magnitude of value importantDichotomous

Слайд 16Histograms
Know how to interpret a histogram, i.e., normal, skewed left

(left tail), skewed right (right tail), and most importantly, infer

from it the appropriate descriptive statistics and analytical method, e.g., mean vs median, parametric vs. non-parametric
HistogramsKnow how to interpret a histogram, i.e., normal, skewed left (left tail), skewed right (right tail), and

Слайд 17Measures of Central Tendency
Mean: what’s commonly called “average”



Median (m):

middle-most observation of ordered data
n odd: m = the (n

+ 1)/2-th largest observation
n even: m = average of the (n/2)-th and (n/2 + 1)-th largest observations
Mode: most frequently occurring observation(s)

Measures of Central Tendency Mean: what’s commonly called “average”Median (m): middle-most observation of ordered datan odd: m

Слайд 18Measures of Variability (Dispersion)

Measures of Variability (Dispersion)

Слайд 20CD4 count (Numerical data)
One sample meanCD4
Mean change in CD4 levels

(paired samples)
Difference in mean CD4 between two groups
Independent t-test
Paired

t-test

One-sample tests

(Q.1: Is the mean CD4 level for HIV+ patients less than 400?)

(Q.2: Is treatment with AZT effective in raising CD4 levels of HIV+ patients?)

(Q.3: Are mean CD4 levels different between HIV+ and HIV– patients?)

CD4 count (Numerical data)One sample meanCD4Mean change in CD4 levels (paired samples) Difference in mean CD4 between

Слайд 21Independent vs. paired (dependent) samples

Independent vs. paired (dependent) samples

Слайд 22SPSS Output

SPSS Output

Слайд 23SPSS Output

SPSS Output

Слайд 27What is correlation?
Correlation captures the extent to which two variables

have a linear relationship.
Correlation coefficients are descriptive statistics that

describe the degree or strength of the linear relationship between two variables.
To calculate correlations we need pairs of numbers.
What is correlation?	Correlation captures the extent to which two variables have a linear relationship. Correlation coefficients are

Слайд 28r = 0.939
r = 0.939
SPSS output

r = 0.939r = 0.939SPSS output

Слайд 29SPSS output
rs = 0.764
rs = 0.764

SPSS outputrs = 0.764rs = 0.764

Слайд 30Simple linear regression
Purpose: to model the change in one

variable (Y, the “dependent variable”) as the other variable (X,

the “independent variable”) changes.
Assumptions
Independence: For any particular value of X, the Y-values are statistically independent of each other.
Homoscedasticity: For any particular value of X, the Y-values have the same variance.
Normality: For any particular value of X, the Y-values have a normal distribution.

Simple linear regression Purpose: to model the change in one variable (Y, the “dependent variable”) as the

Слайд 31Procedure for linear regression
Make a scatterplot of Y vs. X

to determine if data are linear and homoscedastic.
If the

scatterplot looks reasonable, then assume the simple linear regression model:


where  is the intercept,  is the slope, and  represents individual differences (“errors”) from the true population regression line:

Procedure for linear regressionMake a scatterplot of Y vs. X to determine if data are linear and

Слайд 32a = –5.996
b = 1.978
SPSS output: Coefficients
For simple linear regression,


this will be = r.

a = –5.996b = 1.978SPSS output: CoefficientsFor simple linear regression, this will be = r.

Слайд 33Multilevel Structured Data

Multilevel data frequently encountered in social sciences

research refer to data which contain multilevel (hierarchical or nested)

structure.
Multilevel structure indicates that data to be analyzed were obtained from units (e.g., individual) which are nested within higher level units (e.g., groups or clusters).

Multilevel Structured Data Multilevel data frequently encountered in social sciences research refer to data which contain multilevel

Слайд 34Example of Multilevel Data in Prevention Research

In school-based substance

use prevention research, schools are usually the units of assignment

to experimental conditions (program or control).
Data are then collected from both student (micro) and school (macro) levels
student (micro) and
school (macro) levels
to evaluate program effect.
Example of Multilevel Data in Prevention Research In school-based substance use prevention research, schools are usually the

Слайд 35Missing Data

Data are missing on some variables for some

observations.
Three goals of missing data handling
Minimize bias
Maximize

use of available information
Get good estimates of uncertainty (get accurate estimates of standard error, CI, p value)
Not a goal: imputed values “close” to real values
Missing Data Data are missing on some variables for some observations. Three goals of missing data handling

Слайд 36Missing Data: Methods to Deal with Missing
Listwise Deletion: Delete

cases with any missing on the variables being analyzed.
Missing

replacement by imputation:
Mean replacement:
using variable mean or group mean
will not affect mean, but reduce variance
Regression approach
predicting the missing value on one variable with scores on other variables
Multiple imputation
Sensitivity analysis
complete cases vs. missing replacement
Missing Data: Methods to Deal with Missing Listwise Deletion: Delete cases with any missing on the variables

Слайд 37Methods Section Outline

Participants and Procedures
Measures
Data Analysis

Methods Section OutlineParticipants and ProceduresMeasuresData Analysis

Слайд 38Participants and Procedures

Participants and Procedures

Слайд 39Data Analysis

Data Analysis

Слайд 40Q/A Session

Q/A Session

Слайд 41Arthur Galimov e-mail: galimov@usc.edu IG: ar_galimov

Arthur Galimov e-mail: galimov@usc.edu IG: ar_galimov

Обратная связь

Если не удалось найти и скачать доклад-презентацию, Вы можете заказать его на нашем сайте. Мы постараемся найти нужный Вам материал и отправим по электронной почте. Не стесняйтесь обращаться к нам, если у вас возникли вопросы или пожелания:

Email: Нажмите что бы посмотреть 

Что такое TheSlide.ru?

Это сайт презентации, докладов, проектов в PowerPoint. Здесь удобно  хранить и делиться своими презентациями с другими пользователями.


Для правообладателей

Яндекс.Метрика