![]() |
| ||||
Revised: July, 2003
I. Using the PSID Data Center
A. The Nature of the Exercise
Administrative data has shown that the Food Stamp caseload has declined substantially since 1994. In this tutorial we will examine changes in Food Stamp participation using the PSID. We will also examine characteristics of families who were still receiving Food Stamps in 1999. It has been claimed that families on Food Stamps in the late 1990s, after the caseloads had fallen tremendously, would have more barriers than families who were on aid in the mid-1990s. Specifically, we will estimate the share of food stamp participants in 1999 who are working and the share who have a health problem that limits the amount or type of work they can do. Furthermore, we will examine the extent to which Food Stamp recipients hold bank accounts. It has become more common to distribute food stamps in the form of Electric Benefit Transfers (EBT) instead of paper coupons. Presumably recipients who have bank accounts are able to more easily accommodate the shift to EBT.
The goal of this tutorial is to demonstrate that many of these questions can be answered very quickly -- within 60-120 minutes -- even for someone who has never used the PSID. Now let's begin the tutorial and see just how easy it is!
B. Getting Started
The food stamp data are derived from responses to the Computer Assisted Telephone Interviewing (CATI) questions. These questions -- questions F8 and F14 -- can be viewed in the PSID Documentation Center. We strongly encourage all users to examine the questions before analyzing the data. For example, the question asked in 1999 about participation in 1997 is:
F8. Did [you or anyone else in your family living there/you] use government food stamps at any time in 1997? That is, two years ago.
| 1 | YES |
| 5 | NO |
| 8 | DK |
| 9 | RF |
C. Using the Data Center
Now you need to go to the PSID Data Center to create your own customized data set. The data for this tutorial are selected from the 1995 and 1999 Public Release files. As shown in the screen shot, you want to select 'PSID Individual', 'PSID Family' and 'Family Weights' groups and press 'continue'. Then the years 2001 - 1969 will appear in a selection box, 'PSID Individual data by Years', 'PSID Family Data' and 'Family weights'. We are studying the 1995 and 1999 Food stamp usage, so you want to select both 1995 and 1999 years in the selection boxes. Then click 'continue'. This will bring up six scrollable variable selection boxes by Data Group, two for Individual, the other two for Main Family and the last two for Family Weights. Later on you will want to see the values of the codes of the variables that you are choosing; you can easily review these codes by looking at the on-line CATI documentation already described above in Section B of this tutorial.
Screen Shot 1: Selecting Data Groups

Screen Shot 2: Selecting Data Categories and Years

To create our file we want to scroll through the selection screens below. You need to select the following individual data to limit the responses to heads living in the family unit from the 1999 list. (A technical note: to select non-adjacent variables in the variable list box, hold down the 'control' key and skip those variables you do not want to select by scrolling to the desired variable and highlighting (in dark blue) with the mouse/cursor.)
ER33502 SEQUENCE NUMBER 99(codes 1-20
identify individuals who lived in the family at the time of the 1999 interview)
ER33503 RELATION TO HEAD 99(code values 10 identifies heads)
From 1995 Individual data level, again you should select ER33202 SEQUENCE NUMBER 95 and ER33203 RELATION TO HEAD 95.
The other variables needed are from the following family level variables 1999 list.
ER13001 RELEASE NUMBER
ER13010 AGE OF HEAD
ER13011 SEX OF HEAD
In order to examine differences in food stamp participation by the number of children in the family, we need ER13013 (# CHILDREN IN FU). We will also need ER13021 (HEAD MARITAL STATUS). Then scroll down and select ER13205 (B1 1ST MENTION), which is the variable indicating the head's employment status. Now scroll to Section F, where we need the following family level variables:
ER14240 F8 WTR USED FOOD STAMPS IN 1997
ER14270 F14 WTR RECEIVED FOOD STAMPS IN 1999
Next, select ER15019 (W27 WTR CK/SAVINGS/CD, ETC). This is a variable determining whether or not the family has money in bank accounts. Then, scroll to Section H (for health) and choose ER15449 (H2 LIMIT TYPE/AMT WRK H), a measure of the health status of the head of the family:
From the 1995 list, you need to select the following family level variables:
ER5001 RELEASE NUMBER
ER6058 F8 REC FOOD STAMPS PYR
Finally, scroll to 'Family Weights' and select FCWT99 (1999 PRELIMINARY CORE/IMMIGR FAMILY WEIG) and FCWT95 (1995 PRELIMINARY CORE FAMILY WEIGHT). The PSID is not a random sample of the US population, so these weights are used to make the sample representative -- more on this issue later.
After you have submitted your selections you will see a "PSID/CDS data cart contents" as illustrated in Screen Shot 4, which will list Individual level, Main Family Level and Family Weight Level variables that we chose above. Note that a few variables -- 1999 interview number, 1995 interview number, 1968 interview number, and person number for 1968 -- will be automatically selected even if you did not choose them. These variables are often needed for linking records.
Screen Shot 3: Selecting Variables

Screen Shot 4: Shopping Cart

Now we want to focus on the group needed for our purposes. We only want information about the head of the family and the head must be living with the family. To include only those cases where the individual resides with the family, you need to type the following statements into the subsetting box illustrated below: (ER33202<21) and (ER33502<21). (To simplify the analyses, we required that the head be in the family in 1995 AND 1999; for some of the questions we answer below the user might want to instead compare all PSID families in 1995 with all PSID families in 1999. If this interests you, give it a try!) In order to only include information about the head, also add (ER33203=10) and (ER33503=10). Additionally, you should add '((ER6058=1) or (ER6058=5)) and ((ER14240=1) or (ER14240=5)) and ((ER14270=1) or (ER14270=5))' which will remove from our sample any respondents who would have missing values (=8,9) for the food stamp questions (This only removes 7 people but users may devise their own alternative way to handle missing values represented by 8 and 9.). Finally, add (FCWT95>0) and (FCWT99>0) so you only include heads with positive family weights. The format for the subsetting statements is important and there is on-line help if you need it. A typo in this box will return the message 'Internal Server Error' when you go to create your analysis file.
Your final subsetting criteria should be:
(ER33202<21) and (ER33502<21) and (ER33203=10) and (ER33503=10) and ((ER6058=1) or (ER6058=5)) and ((ER14240=1) or (ER14240=5)) and ((ER14270=1) or (ER14270=5)) and (FCWT95>0) and (FCWT99>0)
We are getting close to having the data pulled together, so stay with us! The next step consists of selecting the format of the data that will be created. Since the analysis will use the software Excel, you should select the dBase choice, as illustrated below. If you want, you can 'unclick' any activated Data Definition Statement boxes, but this step is not required.
You will receive a File Documentation message like the one below that reports the size of the Data File (DBF) and the Variable Labels file. PC users should right click (other users may be required to use alternatives to right click) on the blue text of dBase Data File (DBF) and select open; the file will be transferred to you. You should do the same for the Variable Labels. Then you need to save the dBase Data File as an Excel file. Excel will work with a dBase file directly, but doesn't save your work. To save your work, save the file as an Excel Workbook rather than a DBF file. Instructions for using Excel come next!
Screen Shot 5: Censoring and Output Data File Options

Screen Shot 6: Job Completion/Download Message

II. Using Excel (2000) with your customized PSID data set
You should have an Excel file with the variable numbers arrayed across the top row and the variable values running from row 2 to row 4445, i.e., there should be 4,444 observations in the dataset. Note that had we selected 'All Individuals' in the 'Data Merge Options' section above, we also would have 4,444 observations, since we are selecting only individuals who were family heads in the 'Subsetting Criteria' box (ER33203 = 10 and ER33503 = 10). We now follow a series of steps that will help answer the several questions about food stamp participation in the late 1990s.
|
A |
B |
C |
D |
E |
F |
G |
H |
I |
J |
K |
| ER30001 | ER30002 | ER33201 | ER33202 | ER33203 | ER5001 | ER6058 | FCWT95 | ER33501 | ER33502 | ER33503 |
| L | M | N | O | P | Q | R | S | T | U | V |
| ER13001 | ER13010 | ER13011 | ER13013 | ER13021 | ER13205 | ER14240 | ER14270 | ER15019 | ER15449 | FCWT99 |
First, let's calculate the "exit rate", which we define as the proportion of the families who used food stamps in 1997 that no longer receive them in 1999. To do this, in cell Z3 enter the command [=IF(X3=1,V3,"")]. This will return the weight for those families who were using food stamps in 1997 and a blank for those who were not. You should fill down to cell Z4446 and enter the command [=1-(SUMPRODUCT(Y3:Y4446,Z3:Z4446))/(SUM(Z3:Z4446))] in cell W4453. This uses the weights of those using food stamps in 1997 and the recoded responses from 1999 to give the inverse of the percentage of families who were using food stamps in 1997 that were still receiving them in 1999. This gives an answer of 37.9%, which means just over one-third all families receiving food stamps in 1997 were not receiving assistance in 1999.
We also want to calculate the "entry rate"; i.e., the proportion of families who did not use food stamps in 1997 that were receiving them in 1999. To get this value, enter the command [=IF(X3=0,V3,"")] in cell AA3. Now fill down to AA4446 and enter the command [=(SUMPRODUCT(Y3:Y4446,AA3:AA4446))/(SUM(AA3:AA4446))] in cell W4454. Where we find that just 1.1% of all families not receiving food stamps in 1997 were in fact receiving assistance in 1999.
Let's tackle each of these questions in turn. First, we will find the share who were employed. Responses to question B1 should be in column Q. These responses must be recoded before we can use our formula. Name column AD 'Work Variable Recode' and enter the command [=IF(Q3=1,1,0)] in cell AD3 and 'fill down'. This will return a '1' for those that are employed (Q=1) and a zero for everyone else. Now we can create our sub-sample. In cell AE3 enter the formula [=IF(Y3=1,V3,"")], naming column AE 'Weight-Receiving Food Stamp 99' and 'fill down'. This limits the analysis to families who received food stamps in 1999 (Y3=1) and returns their weight in column AE. Now we can use our recoded responses to find what we want. Enter the formula [=(SUMPRODUCT(AD3:AD4446,AE3:AE4446))/(SUM(AE3:AE4446))] in cell W4457 and we see that 30.1% of those who received food stamps in 1999 were working. To find the same thing for those that did not receive food stamps, enter the formula from AE3 into AF3 but change 'Y3=1' to 'Y3=0'. Fill down and use our formula in cell W4458 to find that 68.2% of non-participants were working.
We can answer the other two questions using the same formulas with minor changes. To find the share of food stamp recipients with bank accounts, change 'Q3=1' to 'T3=1' in the recode formula (responses to question W27 should be in column T). To do this, enter the command in cell AG3 [=IF(T3=1,1,0)], naming column AG 'Bank Variable Record'. Now fill down and use the formula [=(SUMPRODUCT(AG3:AG4446,AE3:AE4446))/(SUM(AE3:AE4446))] in cell W4459 and [=(SUMPRODUCT(AG3:AG4446,AF3:AF4446))/(SUM(AF3:AF4446))] in cell W4460. Just over two-fifths of food stamp recipients have a bank account, while over 85 percent of non-recipients have an account. To find the shares with limiting conditions, follow the same procedure, changing 'Q3=1' to 'U3=1' in the recode formula (responses to question H2 should be in column U) To do this, enter the command in cell AH3 [=IF(U3=1,1,0)], naming column AH 'Limit Condition Variable Record'. Now fill down and use the formula [=(SUMPRODUCT(AH3:AH4446,AE3:AE4446))/(SUM(AE3:AE4446))] in cell W4461 and [=(SUMPRODUCT(AH3:AH4446,AF3:AF4446))/(SUM(AF3:AF4446))] in cell W4462. We find that food stamp recipients are much more likely to have a health problem that limits the amount or kind of work that they can do.
Institute for Social Research | University of Michigan | Privacy | Conditions of Use