The TRANSREG Procedure

Example 101.5 Metric Conjoint Analysis of Tire Data

This example, which is more detailed than the previous one, uses PROC TRANSREG to perform a metric conjoint analysis of tire preference data. Conjoint analysis can be used to decompose preference ratings of products or services into components based on qualitative product attributes. For each level of each attribute of interest, a numerical part-worth utility value is computed. The sum of the part-worth utilities for each product is an estimate of the utility for that product. The goal is to compute part-worth utilities such that the product utilities are as similar as possible to the original ratings. Metric conjoint analysis, as shown in this example, fits an ordinary linear model directly to data assumed to be measured on an interval scale. Nonmetric conjoint analysis, as shown in Example 101.4, finds an optimal monotonic transformation of original data before fitting an ordinary linear model to the transformed data.

This example has three parts. In the first part, an experimental design is created. In the second part, a DATA step creates descriptions of the stimuli for the experiment. The third part of the example performs the conjoint analyses.

The stimuli for the experiment are 18 hypothetical tires. The stimuli represent different brands (Goodstone, Pirogi, Machismo),[37] prices ($69.99, $74.99, $79.99), expected tread life (50,000, 60,000, 70,000 miles), and road hazard insurance plans (Yes, No).

For a conjoint study such as this, you need to create an experimental design with 3 three-level factors, 1 two-level factor, and 18 combinations or runs. The easiest way to get this design is with the %MktEx autocall macro. The %MktEx macro requires you to specify the number of levels of each of the four factors, followed by N=18, the number of runs. Specifying a random number seed, while not strictly necessary, helps ensure that the design is reproducible. The %MktLab macro assigns the actual factor names instead of the default names x1, x2, and so on, and it assigns formats to the factor levels. The %MktEval macro helps you evaluate the design. It shows how correlated or independent the factors are, how often each factor level appears in the design, how often each pair occurs for every factor pair, and how often each product profile or run occurs in the design. See Kuhfeld (2010) for more information about experimental design and conjoint analysis; look for the latest Conjoint Analysis report. The following statements create, evaluate, and display the design:

title 'Tire Study, Experimental Design';

proc format;
   value BrandF
              1 = 'Goodstone'
              2 = 'Pirogi   '
              3 = 'Machismo ';
   value PriceF
              1 = '$69.99'
              2 = '$74.99'
              3 = '$79.99';
   value LifeF
              1 = '50,000'
              2 = '60,000'
              3 = '70,000';
   value HazardF
              1 = 'Yes'
              2 = 'No ';
run;

%mktex(3 3 3 2, n=18, seed=448)

%mktlab(vars=Brand Price Life Hazard, out=sasuser.TireDesign,
        statements=format Brand BrandF9. Price PriceF9.
                   Life LifeF6. Hazard HazardF3.)

%mkteval;

proc print data=sasuser.TireDesign;
run;

The %MktEx macro (Kuhfeld, 2010) output displayed in Output 101.5.1 shows you that the design is 100% efficient, which means it is orthogonal and balanced. The %MktEval macro output displayed in Output 101.5.2 shows you that all of the factors are uncorrelated or orthogonal, the design is balanced (each level occurs once), and every pair of factor levels occurs equally often (again showing that the design is orthogonal). The n-way frequencies show that each product profile occurs once (there are no duplicates). The design is shown in Output 101.5.3. The design is automatically randomized (the profiles were sorted into a random order and the original levels are randomly reassigned). Orthogonality, balance, randomization, and other design concepts are discussed in detail in Kuhfeld (2010), in the Experimental Design, Efficiency, Coding, and Choice Designs report.

Output 101.5.1: Tire Study, Design Efficiency

Tire Study, Experimental Design

Algorithm Search History

                                   Current          Best                        
           Design    Row,Col  D-Efficiency  D-Efficiency  Notes                 
           ----------------------------------------------------------           
                1      Start      100.0000      100.0000  Tab                   
                1        End      100.0000                                      
                                                                                

Tire Study, Experimental Design

                              The OPTEX Procedure                               

                            Class Level Information                             

                             Class  Levels  Values                              

                             x1       3     1 2 3                               
                             x2       3     1 2 3                               
                             x3       3     1 2 3                               
                             x4       2     1 2                                 

Tire Study, Experimental Design

Design Number D-Efficiency A-Efficiency G-Efficiency Average Prediction
Standard Error
1 100.0000 100.0000 100.0000 0.6667


Output 101.5.2: Tire Study, Design Evaluation

Tire Study, Experimental Design
Canonical Correlations Between the Factors
There are 0 Canonical Correlations Greater Than 0.316

  Brand Price Life Hazard
Brand 1 0 0 0
Price 0 1 0 0
Life 0 0 1 0
Hazard 0 0 0 1



Tire Study, Experimental Design
Summary of Frequencies
There are 0 Canonical Correlations Greater Than 0.316

    Frequencies
  Brand 6 6 6
  Price 6 6 6
  Life 6 6 6
  Hazard 9 9
  Brand Price 2 2 2 2 2 2 2 2 2
  Brand Life 2 2 2 2 2 2 2 2 2
  Brand Hazard 3 3 3 3 3 3
  Price Life 2 2 2 2 2 2 2 2 2
  Price Hazard 3 3 3 3 3 3
  Life Hazard 3 3 3 3 3 3
  N-Way 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1


Output 101.5.3: Tire Study, Design

Tire Study, Experimental Design

Obs Brand Price Life Hazard
1 Pirogi $79.99 50,000 No
2 Machismo $79.99 60,000 No
3 Machismo $74.99 70,000 Yes
4 Machismo $74.99 50,000 No
5 Goodstone $74.99 60,000 Yes
6 Pirogi $69.99 60,000 Yes
7 Goodstone $69.99 50,000 Yes
8 Machismo $69.99 50,000 Yes
9 Pirogi $74.99 60,000 Yes
10 Pirogi $74.99 50,000 No
11 Goodstone $79.99 60,000 No
12 Goodstone $69.99 70,000 No
13 Pirogi $79.99 70,000 Yes
14 Goodstone $74.99 70,000 No
15 Machismo $69.99 60,000 No
16 Machismo $79.99 70,000 Yes
17 Pirogi $69.99 70,000 No
18 Goodstone $79.99 50,000 Yes


The %MktEx macro requires SAS/STAT, SAS/QC, and SAS/IML software. Alternatively, you can make a design for this experiment using the %MktDes macro, which requires only SAS/STAT and SAS/QC software. The %MktDes macro contains a small subset of the functionality of the %MktEx macro. It can be used as follows:

%mktdes(factors=Brand=3 Price=3 Life=3 Hazard=2, n=18)

The results of this step are not shown or used.

Next, the questionnaires are printed and given to the subjects, who are asked to rate the tires.

The following statements produce Output 101.5.4:

data _null_;
   title;
   set sasuser.TireDesign;
   file print;
   if mod(_n_,4) eq 1 then do;
      put _page_;
      put +55 'Subject ________';
   end;
   length hazardstring $ 7.;
   if put(hazard, hazardf3.) = 'Yes'
      then hazardstring = 'with';
      else hazardstring = 'without';

   s = 3 + (_n_ >= 10);
   put // _n_ +(-1) ') For your next tire purchase, '
          'how likely are you to buy this product?'
       // +s Brand 'brand tires at ' Price +(-1) ','
       /  +s 'with a ' Life 'tread life guarantee, '
       /  +s 'and ' hazardstring 'road hazard insurance.'
       // +s 'Definitely Would                 Definitely Would'
       /  +s 'Not Purchase                             Purchase'
       // +s '1     2     3     4     5     6     7     8     9 ';
run;

This output in Output 101.5.4 is abbreviated in the interest of conserving space; the statements actually produce stimuli for all combinations.

Output 101.5.4: Conjoint Analysis, Stimuli Descriptions

                                                       Subject ________         


1) For your next tire purchase, how likely are you to buy this product?         

   Pirogi brand tires at $79.99,                                                
   with a 50,000 tread life guarantee,                                          
   and without road hazard insurance.                                           

   Definitely Would                 Definitely Would                            
   Not Purchase                             Purchase                            

   1     2     3     4     5     6     7     8     9                            


2) For your next tire purchase, how likely are you to buy this product?         

   Machismo brand tires at $79.99,                                              
   with a 60,000 tread life guarantee,                                          
   and without road hazard insurance.                                           

   Definitely Would                 Definitely Would                            
   Not Purchase                             Purchase                            

   1     2     3     4     5     6     7     8     9                            


3) For your next tire purchase, how likely are you to buy this product?         

   Machismo brand tires at $74.99,                                              
   with a 70,000 tread life guarantee,                                          
   and with road hazard insurance.                                              

   Definitely Would                 Definitely Would                            
   Not Purchase                             Purchase                            

   1     2     3     4     5     6     7     8     9                            


4) For your next tire purchase, how likely are you to buy this product?         

   Machismo brand tires at $74.99,                                              
   with a 50,000 tread life guarantee,                                          
   and without road hazard insurance.                                           

   Definitely Would                 Definitely Would                            
   Not Purchase                             Purchase                            

   1     2     3     4     5     6     7     8     9                            


The third part of the example performs the conjoint analyses. The DATA step reads the data. Only the ratings are entered, one row per subject. Real conjoint studies have many more subjects than five. The TRANSPOSE procedure transposes this ($5 \times 18$) data set into an ($18 \times 5$) data set that can be merged with the factor level data set sasuser.TireDesign. The next DATA step does the merge. The PRINT procedure displays the input data set.

PROC TRANSREG fits the five individual conjoint models, one for each subject. The UTILITIES a-option displays the conjoint analysis results. The SHORT a-option suppresses the iteration histories, OUTTEST=UTILS creates an output data set with all of the conjoint results, and the SEPARATORS= option requests that the labels constructed for each category contain two blanks between the variable name and the level value. The ODS SELECT statement is used to limit the displayed output. The MODEL statement specifies IDENTITY for the ratings, which specifies a metric conjoint analysis—the ratings are not transformed. The variables Brand, Price, Life, and Hazard are designated as CLASS variables, and the part-worth utilities are constrained to sum to zero within each factor.

The following statements produce Output 101.5.5:

title 'Tire Study, Data Entry, Preprocessing';

data Results;
   input (c1-c18) (1.);
   datalines;
233279766526376493
124467885349168274
262189456534275794
184396375364187754
133379775526267493
;
* Create an Object by Subject Data Matrix;
proc transpose data=Results out=Results(drop=_name_) prefix=Subj;
run;

* Merge the Factor Levels with the Data Matrix;
data Both;
   merge sasuser.TireDesign Results;
run;

proc print;
   title2 'Data Set for Conjoint Analysis';
run;

title 'Tire Study, Individual Conjoint Analyses';

* Fit Each Subject Individually;
proc transreg data=Both utilities short outtest=utils separators='  ';
   ods select TestsNote FitStatistics Utilities;
   model identity(Subj1-Subj5) =
         class(Brand Price Life Hazard / zero=sum);
run;

The output contains two tables per subject, one with overall fit statistics and one with the conjoint analysis results.

Output 101.5.5: Conjoint Analysis

Tire Study, Data Entry, Preprocessing
Data Set for Conjoint Analysis

Obs Brand Price Life Hazard Subj1 Subj2 Subj3 Subj4 Subj5
1 Pirogi $79.99 50,000 No 2 1 2 1 1
2 Machismo $79.99 60,000 No 3 2 6 8 3
3 Machismo $74.99 70,000 Yes 3 4 2 4 3
4 Machismo $74.99 50,000 No 2 4 1 3 3
5 Goodstone $74.99 60,000 Yes 7 6 8 9 7
6 Pirogi $69.99 60,000 Yes 9 7 9 6 9
7 Goodstone $69.99 50,000 Yes 7 8 4 3 7
8 Machismo $69.99 50,000 Yes 6 8 5 7 7
9 Pirogi $74.99 60,000 Yes 6 5 6 5 5
10 Pirogi $74.99 50,000 No 5 3 5 3 5
11 Goodstone $79.99 60,000 No 2 4 3 6 2
12 Goodstone $69.99 70,000 No 6 9 4 4 6
13 Pirogi $79.99 70,000 Yes 3 1 2 1 2
14 Goodstone $74.99 70,000 No 7 6 7 8 6
15 Machismo $69.99 60,000 No 6 8 5 7 7
16 Machismo $79.99 70,000 Yes 4 2 7 7 4
17 Pirogi $69.99 70,000 No 9 7 9 5 9
18 Goodstone $79.99 50,000 Yes 3 4 4 4 3


Tire Study, Individual Conjoint Analyses

The TRANSREG Procedure


The TRANSREG Procedure Hypothesis Tests for Identity(Subj1)

Root MSE 1.34164 R-Square 0.8043
Dependent Mean 5.00000 Adj R-Sq 0.6674
Coeff Var 26.83282    

Utilities Table Based on the Usual Degrees of Freedom
Label Utility Standard Error Importance
(% Utility
Range)
Variable
Intercept 5.0000 0.31623   Intercept
Brand Goodstone 0.3333 0.44721 20.833 Class.BrandGoodstone
Brand Pirogi 0.6667 0.44721   Class.BrandPirogi
Brand Machismo -1.0000 0.44721   Class.BrandMachismo
Price $69.99 2.1667 0.44721 54.167 Class.Price_69_99
Price $74.99 0.0000 0.44721   Class.Price_74_99
Price $79.99 -2.1667 0.44721   Class.Price_79_99
Life 50,000 -0.8333 0.44721 16.667 Class.Life50_000
Life 60,000 0.5000 0.44721   Class.Life60_000
Life 70,000 0.3333 0.44721   Class.Life70_000
Hazard Yes 0.3333 0.31623 8.333 Class.HazardYes
Hazard No -0.3333 0.31623   Class.HazardNo

Tire Study, Individual Conjoint Analyses

The TRANSREG Procedure


The TRANSREG Procedure Hypothesis Tests for Identity(Subj2)

Root MSE 0.56765 R-Square 0.9710
Dependent Mean 4.94444 Adj R-Sq 0.9506
Coeff Var 11.48049    

Utilities Table Based on the Usual Degrees of Freedom
Label Utility Standard Error Importance
(% Utility
Range)
Variable
Intercept 4.9444 0.13380   Intercept
Brand Goodstone 1.2222 0.18922 25.658 Class.BrandGoodstone
Brand Pirogi -0.9444 0.18922   Class.BrandPirogi
Brand Machismo -0.2778 0.18922   Class.BrandMachismo
Price $69.99 2.8889 0.18922 65.132 Class.Price_69_99
Price $74.99 -0.2778 0.18922   Class.Price_74_99
Price $79.99 -2.6111 0.18922   Class.Price_79_99
Life 50,000 -0.2778 0.18922 7.895 Class.Life50_000
Life 60,000 0.3889 0.18922   Class.Life60_000
Life 70,000 -0.1111 0.18922   Class.Life70_000
Hazard Yes 0.0556 0.13380 1.316 Class.HazardYes
Hazard No -0.0556 0.13380   Class.HazardNo

Tire Study, Individual Conjoint Analyses

The TRANSREG Procedure


The TRANSREG Procedure Hypothesis Tests for Identity(Subj3)

Root MSE 2.48104 R-Square 0.3902
Dependent Mean 4.94444 Adj R-Sq -0.0367
Coeff Var 50.17832    

Utilities Table Based on the Usual Degrees of Freedom
Label Utility Standard Error Importance
(% Utility
Range)
Variable
Intercept 4.9444 0.58479   Intercept
Brand Goodstone 0.0556 0.82701 18.261 Class.BrandGoodstone
Brand Pirogi 0.5556 0.82701   Class.BrandPirogi
Brand Machismo -0.6111 0.82701   Class.BrandMachismo
Price $69.99 1.0556 0.82701 31.304 Class.Price_69_99
Price $74.99 -0.1111 0.82701   Class.Price_74_99
Price $79.99 -0.9444 0.82701   Class.Price_79_99
Life 50,000 -1.4444 0.82701 41.739 Class.Life50_000
Life 60,000 1.2222 0.82701   Class.Life60_000
Life 70,000 0.2222 0.82701   Class.Life70_000
Hazard Yes 0.2778 0.58479 8.696 Class.HazardYes
Hazard No -0.2778 0.58479   Class.HazardNo

Tire Study, Individual Conjoint Analyses

The TRANSREG Procedure


The TRANSREG Procedure Hypothesis Tests for Identity(Subj4)

Root MSE 1.90321 R-Square 0.6185
Dependent Mean 5.05556 Adj R-Sq 0.3514
Coeff Var 37.64598    

Utilities Table Based on the Usual Degrees of Freedom
Label Utility Standard Error Importance
(% Utility
Range)
Variable
Intercept 5.0556 0.44859   Intercept
Brand Goodstone 0.6111 0.63440 36.885 Class.BrandGoodstone
Brand Pirogi -1.5556 0.63440   Class.BrandPirogi
Brand Machismo 0.9444 0.63440   Class.BrandMachismo
Price $69.99 0.2778 0.63440 12.295 Class.Price_69_99
Price $74.99 0.2778 0.63440   Class.Price_74_99
Price $79.99 -0.5556 0.63440   Class.Price_79_99
Life 50,000 -1.5556 0.63440 49.180 Class.Life50_000
Life 60,000 1.7778 0.63440   Class.Life60_000
Life 70,000 -0.2222 0.63440   Class.Life70_000
Hazard Yes 0.0556 0.44859 1.639 Class.HazardYes
Hazard No -0.0556 0.44859   Class.HazardNo

Tire Study, Individual Conjoint Analyses

The TRANSREG Procedure


The TRANSREG Procedure Hypothesis Tests for Identity(Subj5)

Root MSE 1.36219 R-Square 0.8162
Dependent Mean 4.94444 Adj R-Sq 0.6875
Coeff Var 27.54987    

Utilities Table Based on the Usual Degrees of Freedom
Label Utility Standard Error Importance
(% Utility
Range)
Variable
Intercept 4.9444 0.32107   Intercept
Brand Goodstone 0.2222 0.45406 9.023 Class.BrandGoodstone
Brand Pirogi 0.2222 0.45406   Class.BrandPirogi
Brand Machismo -0.4444 0.45406   Class.BrandMachismo
Price $69.99 2.5556 0.45406 67.669 Class.Price_69_99
Price $74.99 -0.1111 0.45406   Class.Price_74_99
Price $79.99 -2.4444 0.45406   Class.Price_79_99
Life 50,000 -0.6111 0.45406 15.789 Class.Life50_000
Life 60,000 0.5556 0.45406   Class.Life60_000
Life 70,000 0.0556 0.45406   Class.Life70_000
Hazard Yes 0.2778 0.32107 7.519 Class.HazardYes
Hazard No -0.2778 0.32107   Class.HazardNo

The next steps summarize the results. Three tables are displayed, showing the following: all of the importance values, the average importance, and the part-worth utilities. The first DATA step selects the importance information from the UTILS data set. The final assignment statement stores just the variable name from the label, relying on the fact that the separator is two blanks. PROC TRANSPOSE creates the data set of importances, one row per subject, and PROC PRINT displays the results. The MEANS procedure displays the average importance of each attribute across the subjects. The next DATA step selects the part-worth utilities information from the UTILS data set. PROC TRANSPOSE creates the data set of utilities, one row per subject, and PROC PRINT displays the results. The following statements produce Output 101.5.6:

title 'Tire Study Results';

* Gather the Importance Values;
data Importance;
   set utils(keep=_depvar_ Importance Label);
   if n(Importance);
   label = substr(label, 1, index(label, '  '));
run;

proc transpose out=Importance2(drop=_:);
   by _depvar_;
   id Label;
run;

proc print;
   title2 'Importance Values';
run;

proc means;
   title2 'Average Importance';
run;

* Gather the Part-Worth Utilities;
data Utilities;
   set utils(keep=_depvar_ Coefficient Label);
   if n(Coefficient);
run;

proc transpose out=Utilities2(drop=_:);
   by _depvar_;
   id Label;
   idlabel Label;
run;

proc print label;
   title2 'Utilities';
run;

Output 101.5.6: Summary of Conjoint Analysis Results

Tire Study Results
Importance Values

Obs Brand Price Life Hazard
1 20.8333 54.1667 16.6667 8.33333
2 25.6579 65.1316 7.8947 1.31579
3 18.2609 31.3043 41.7391 8.69565
4 36.8852 12.2951 49.1803 1.63934
5 9.0226 67.6692 15.7895 7.51880


Tire Study Results
Average Importance

The MEANS Procedure

Variable N Mean Std Dev Minimum Maximum
Brand
Price
Life
Hazard
5
5
5
5
22.1319800
46.1133697
26.2540671
5.5005832
10.2301014
23.7391251
18.0547195
3.6989117
9.0225564
12.2950820
7.8947368
1.3157895
36.8852459
67.6691729
49.1803279
8.6956522

Tire Study Results
Utilities

Obs Intercept Brand Goodstone Brand Pirogi Brand Machismo Price $69.99 Price $74.99 Price $79.99 Life 50,000 Life 60,000 Life 70,000 Hazard Yes Hazard No
1 5.00000 0.33333 0.66667 -1.00000 2.16667 0.00000 -2.16667 -0.83333 0.50000 0.33333 0.33333 -0.33333
2 4.94444 1.22222 -0.94444 -0.27778 2.88889 -0.27778 -2.61111 -0.27778 0.38889 -0.11111 0.05556 -0.05556
3 4.94444 0.05556 0.55556 -0.61111 1.05556 -0.11111 -0.94444 -1.44444 1.22222 0.22222 0.27778 -0.27778
4 5.05556 0.61111 -1.55556 0.94444 0.27778 0.27778 -0.55556 -1.55556 1.77778 -0.22222 0.05556 -0.05556
5 4.94444 0.22222 0.22222 -0.44444 2.55556 -0.11111 -2.44444 -0.61111 0.55556 0.05556 0.27778 -0.27778

Based on the importance values, price is the most important attribute for some of the respondents, but expected tread life is most important for others. On the average, price is most important, followed by expected tread life and brand. Road hazard insurance is less important. Each of the brands is preferred by some of the respondents. All respondents preferred a lower price over a higher price, a longer tread life, and road hazard insurance.



[37] In real conjoint experiments, real brand names would be used.