*Copyright @ www.mycsg.in;

What does PROC TRANSPOSE do

`proc transpose` reshapes data by converting values from rows into columns or from columns into rows
It is commonly used when the structure of a dataset must be changed before reporting, summarisation, or merging
A frequent use case is converting repeated values within a subject into separate variables on one row
Another common use case is turning multiple variables into one stacked variable for later analysis

Create a simple example dataset

We create a long-style dataset named `scores_long` where each student has multiple rows
The variable `test` identifies the type of score, and `score` contains the value to be transposed
This layout is a good starting point for understanding how rows can be converted into columns

data scores_long;
   input student $ test $ score;
datalines;
Alice Math 88
Alice Science 91
Alice English 84
John Math 79
John Science 85
John English 82
Jane Math 93
Jane Science 89
Jane English 95
;
run;

`scores_long` contains one row per student per test
Each student therefore appears multiple times in the dataset
Inspect the dataset and confirm that the same student name repeats across different test values

Basic transpose without BY grouping

In the simplest case, `proc transpose` takes values from one variable and writes them as separate observations in the transposed output
This example transposes the `age` and `height` variables from a single-observation dataset so the learner can see the default output structure
The output dataset usually contains `_NAME_` to store the original variable name and `COL1` to store the transposed value

data one_student;
   set sashelp.class(obs=1 keep=name age height);
run;
 
proc transpose data=one_student out=one_student_t;
   var age height;
run;

The variables listed on the `var` statement become separate observations in `one_student_t`
`_NAME_` identifies whether the row came from `AGE` or `HEIGHT`
`COL1` stores the corresponding value

Transpose rows to columns within each student

Most practical transpose tasks use a `BY` variable so each group is transposed separately
We sort the data by `student` before using `by student;` because BY group processing requires matching sort order
`id test;` tells SAS to use the values of `test` as output column names
`var score;` tells SAS which values should populate those new columns

proc sort data=scores_long out=scores_long_sort;
   by student;
run;
 
proc transpose data=scores_long_sort out=scores_wide;
   by student;
   id test;
   var score;
run;

`scores_wide` now contains one row per student
The values of `test` such as `Math`, `Science`, and `English` become output columns
The corresponding values from `score` fill those columns for each student
Verify that each student now appears only once in the transposed dataset

Transpose columns to rows for all students

`proc transpose` can also be used to stack multiple variables into one value column
In this example, `age`, `height`, and `weight` are converted into repeated rows per student
This is useful when wide data must be converted into a long structure

proc transpose data=sashelp.class out=class_long name=source_variable;
   by name;
   var age height weight;
run;

The output dataset contains one row per student per original analysis variable
`source_variable` identifies whether the value came from `age`, `height`, or `weight`
`COL1` stores the corresponding value

Understand the NAME option

The `name=` option lets us rename the default `_NAME_` variable created by `proc transpose`
This is useful when the default variable name is not descriptive enough for the output dataset

proc transpose data=one_student out=one_student_t2 name=source_variable;
   var age height;
run;

The output now uses `source_variable` instead of `_NAME_`
This makes the meaning of that column easier to understand when reviewing the dataset

Key points to remember

`proc transpose` reshapes datasets by converting rows to columns or columns to rows
`var` identifies the values to transpose
`by` groups observations and produces one transposed result per group
`id` uses the values of a variable as output column names
When a BY statement is used, the input dataset should be sorted by the same BY variables first

*Copyright @ www.mycsg.in;

What does PROC TRANSPOSE do

Create a simple example dataset

SAS Log

Dataset View

Basic transpose without BY grouping

SAS Log

Dataset View

Transpose rows to columns within each student

SAS Log

Dataset View

Transpose columns to rows for all students

SAS Log

Dataset View

Understand the NAME option

SAS Log

Dataset View

Key points to remember