Data Entry

In previous years, I collected information from undergraduates in class. Here it is represented numerically:

Age Sex Reg  Maths Dist

 37   0     2   3   27
 29   1     5   3    2
 22   0     7   2   -9
 20   1     1   2    1
 20   1     1   2    6
 20   1     1   3   24
 21   1     2   2    0
 21   1     2   2    0
 20   1     7   2    0
 22   0     1   3    1
 20   0     2   3   25
 20   1     3   3    1
 20   1     5   2    5
 20   1     3   3   75
 21   0     3   3   20
 21   1     1   3    5
 21   1     1   3    7
 21   0     1   3    3
 19   0     1   2   40
 19   1     2   3    5

Sex is coded with

Reg is region of birth and is coded with

  1. Munster
  2. Leinster
  3. Connacht-ulster
  4. NI
  5. GB
  6. EU
  7. US
  8. Other

Dist is distance of home from UL, in miles.

Maths is last maths done, coded with

  1. Post LC
  2. LC(h)
  3. LC(o)
  4. JC(h)
  5. JC(o)
  6. Lower

What to do

Enter this data into Stata. Either enter the data into the data editor, or copy the numbers into a file and use an infile command.

Use DEScribe, SUmmarize, List and TABulate to examine the data.

Be clear about which of TABulate and SUmmarize is appropriate for the different variables. Tabulation is usually helpful where there are relatively few distinct values, and summaries such as means and standard deviations are only meaningful for interval and ratio variables.


Brendan Halpin
Department of Sociology, University of Limerick
F1-007, x 3147; brendan.halpin@ul.ie