The Research Article
Vartanian: SW 541
Due: April 15, 2009

Write a 14-15 page research article based on either the PSID given to you in class or the National Educational Longitudinal Survey (NELS).   The article should examine an important topic and hopefully one that has not been previously researched (though, for this class article, this need not be the case).  Choose at least 12-15 independent variables for the article and try to include more than a single dependent variable for what you’re examining. For example, you may be interested in issues related to work or how financially sound individuals are given their family backgrounds.  You may look at a number dependent variables including hours of work, whether or not the individual is living below the poverty line, and income or family income relative to the poverty line (a continuous variable that takes into account family size).  You may choose to measure the adolescent effects of welfare receipt, hours that the parents worked, and types of occupation of the head of household.

I would like you to use some of the different skills you’ve learned in the class to examine your topic.  For example, I would like you to run interactions between independent variables, run quadratic or cube functions, use log dependent variables, use Chow tests or likelihood ratio tests to determine if groups are different or adding variables adds anything to your model.  Run regressions with by adding sets of variables to see how your main independent variable changes when adding control variables.  You will thus have several columns of results as you add more and more variables.  The adding of variables should make some logical sense – don’t simply add them and give no reason for why you’re adding them. 

The following is a guide to how to write the article.

1. Statement of the problem.  What are you studying and why is it important?

2. Literature review.  What have others found to be important in the study of your problem.  The problem you are examining is your dependent variable.  For example, have others found adolescent poverty to be an important indicator of adult poverty.  If they have then list the authors that have found poverty to be important.  Perhaps in other studies, poverty was not found to be important.  List the authors that have not found poverty to be important.

Do not make an annotated bibliography in this section.  That is, do not list the articles and give descriptions of each of the articles.  Categorize the articles by subject area, such as poverty, and briefly discuss the findings of these studies.  If several of the studies have the same findings, discuss them together (within the same sentence). 

Remember that you are trying to explain your dependent variable as best you can.  You therefore do not want to only focus on the one or a couple of independent variables that you believe are most important in your study.  You should also focus on those variables that others have found to be important in researching your dependent variable, and include what they have found for these other independent/control variables.  

Limit the number of web sources you use to a few.  I would like you to examine the journals for what others have found regarding your dependent variable.  Also, limit citing sources that come from other sources. This means, do not say, "Johnson (1999) as cited in Vartanian (2006)".  Go to the original if you want to cite it. 

Have at least 10-12 sources for this article.

3. Theory.  Discuss the theoretical foundations for what you’re examining.  For example, if you’re studying persistent poverty, you may wish to examine culture of poverty theories or theories related to lack of economic opportunities.  I often examine alternative theories to determine which theory is best supported by the data and statistical analyses.  Explain where the theories come from?

4. Data sources and hypotheses.  Explain the data you'll be using for your study.  Discuss where the data comes from and the types of variables included in the data (in a general way).  Include the number of observations, perhaps the number of observations in the control and experimental groups (if there are such groups), the year or years the data were collected and the population from which the sample was taken from. For what populations can you generalize your results?

Hypotheses.  Given your problem statement, your literature review and your theories, what hypotheses follow?  Relate your hypotheses, your dependent and independent variables, to previous literature and theory.  Why are you hypothesizing what you're hypothesizing?  Perhaps you're hypothesizing the positive or negative relationship between variables because no one else has examined these relationships and these relationships are important for some theoretical reason.  You should come up with a logical reason why the relationships you're examining are important.  Or perhaps you're reexamining the relationship between variables -- others have examined this relationship but you have a different data source and will see if this relationship holds up for this different data.  Remember that an unbiased b coefficients doesn’t mean that you don’t need to replicate studies.  To find the population B coefficient, replication is necessary. You would like to verify that what appears to be going on in one sample is also going on in your sample -- you will replicate to verify (I believe that Ronald Reagan used to say something like this
about the Soviet Union).

Explain the variables you’ll be controlling in your statistical models and the importance of controlling these variables.  Cite other studies that have also used these variables as control variables.

5. Describe the types of statistical tests you’ll be using to examine the relationship between your independent and dependent variables.  In other words, you’ll indicate that you’re using a logistic regression, ordinary least squares, or other type of statistical models.

6. Results.  In this section, first report on the descriptive statistics for the study.  What are the means and standard deviations for the dependent, independent and control variables?  Create a table with this information and describe what you have found in the text of the article.  Next, report on the outcomes of your statistical tests -- either the OLS multivariate regression or the logistic regression, or whatever other type of statistical analysis you're using.  Was there a relationship between the independent and dependent variables or not?  Was the relationship in the correct direction?  Did your tests give support to your hypotheses?  Or must you accept the null hypothesis (or, in other words, you cannot give support to your hypotheses)?  Was the effect small in spite of being significant?  Exactly what is the relationship between the independent and dependent variables?  You’ll be able to determine this relationship by examining the b coefficients from the regression results. Next, I would you to present a table of predicted outcomes -- either probabilities if you are using logistic regression analysis or other types of predictions, using OLS or some other statistical analysis.  Talk to me about the type of predictions you should use for your paper and the SAS programming you will need to get these predictions.  If you are using indexed dependent variables, you will need to talk to me about getting predictions for these types of variables.  Generally, you will also need to choose an non-indexed variable also for your predicted outcomes.  Again, you will present at least three tables in the results section – one for the means and standard deviations,a second for the regression results, and a third for the predicted results.  Many of you will be presenting more than one regression table if you are also examining fully interactive models, or examining the effects of your independent variables on different groups.  I encourage you to be as creative as you can be in examining your models. 

In your tables, use no more than 3 decimal places for b, SE, and probability estimates.  Try to limit the number of tables to 5 or 6.  In your regression models, include all variables you've used in the regression so that I can more easily see what you're doing.  Use astericks to indicate statistical significance and put a note at the bottom of the table that indicates what the astericks mean.  Label each table explicity so that I know what kind of  regression you're using.  In the label, indicate the dependent variable.  Give each table a number (Table 1, Table 2).  You must write about each table that you present and include the tables you are discussing in the order that you discuss them.  Thus, don't discuss Table 3 first.  In your descriptive statistics, include all variables you've used in your regressions, including all independnet variables (included excluded categories for dummy variables) and dependent variables.  If you are doing analyses for separate groups (such as gender or race groups), include columns for means and standard deviations for these groups for each of your independent and dependent variables. 

7. Conclusion.  Go over what you were looking for in your analyses and what you found.  Do the two match up?  What is the significance (not statistical) of your findings?

Remember that you need to have transition statements between each of these different sections. Relate one section to the next, as you would in any paper.  Note that you may wish to order your paper somewhat differently than the way I’ve described it above.  Look for examples of how to write a research paper from the articles you’re reading this semester.  Follow their lead. Examine how they put together their literature review, how they report their results, etc, and their ordering of topic areas.

What to hand in:

A. The paper, with a title on a title page, and an abstract.  The paper should have 1" margins and use 12 point, Times New Roman font.  Papers over
    20 pages long will not be accepted.  Include a bibiography but these pages will not count as part of the 20 page maximum.
B. The SAS/SPSS/STATA output
C.  All programming that you've done to get your output.