Summary of Recent Unpublished Research
John Rust, Yale University
April, 2001

The purpose of this note is to describe, selectively, a number of unpublished working papers that are not listed on my vita.

Before I describe these projects, I would like to say a few words about how I classify myself. The economics profession likes to narrowly classify people by field, but I resist such classification. I do econometrics but I do not consider myself an econometrician, I use economic theory but I am not a theorist, and I do empirical work but most people would not classify me as an empirical economist. As for field of interest, it also hard to classify me: I have worked on dynamic models of labor supply but I am not a labor economist, I have worked on social security and retirement but I am not a public economist, and I have written theoretical papers on durable goods, planned obsolescence, and market microstructure, but I am not an IO economist. A substantial part of my research involves theoretical and applied computation (I am editorial board of the computer science journal Complexity, a coeditor of the Handbook of Computational Economics, and was part of a group of economists who founded the Society for Computational Economics), yet I do not classify myself as a ``computational economist''. If fact, I am interested in all of these areas, and use a broad range of theoretical, econometric, and computational tools in my research. I think the best description of what I do is ``applied micro'' with a special focus on dynamic economic modeling and computational methods. I may pay a price of ``falling between the cracks'' and not being regarded as the preminent figure in any particular narrowly defined field, but I feel this cost is more than offset by the ``arbitrage profits'' afforded by the academic freedom of being able to cross field and disciplinary boundaries and work on promising ideas without worrying about whether the project is in my ``area''. My main constraint in taking on projects is that I am fairly practically oriented. Although I love it when I can develop, learn, and use new mathematical or computational tools in my research, I think it is important not to let the love of tools drive the type of research I do. Thus, I almost always work on projects that are likely to provide useful insights about the world in the here and now. In particular, the type of empirical work and economic modeling that I do will rarely be regarded as merely descriptive, or as theory for ``theory's sake''. I think good economic models should give us insights and quantitative guidance to help society design better institutions and policies. On the other hand, I do not regard myself as a ``policy guy'' and I try not to let my political/moral views direct the type of research I do. Thus, my interest is to develop tools to help others make better decisions and policies, and in the process, contribute to economic science.

1.
Econometrics monograph: I have been slowly accumulating material for a monograph entitled Stochastic Decision Processes: Theory, Computation, and Empirical Applications. This monograph will summarize the work I have been doing in this area over the last 15 years. In the appendix I repeat material I wrote on a summary of my research in this area when I came to Yale in 1996 which further expands on my contributions in this area, which has been the main unifying element connecting the different applied research topics I have worked on. The monograph will be based on a chapter I wrote for the volume 4 of the Handbook of Econometrics ``Structural Estimation of Markov Decision Processes'' and a chapter I wrote for the Handbook of Computational Economics ``Numerical Dynamic Programming in Economics'' and will include a number of empirical applications including the work on speculation in the steel market and modeling social insurance at the end of the life cycle that I describe below. It will also be linked to a web site that will freely distribute computer code for the various algorithms and estimation methods I describe in the monograph, including the recent advances in random multigrid algorithms that I introduced in my 1997 Econometrica paper `` Using Randomization to Break the Curse of Dimensionality'' and the parametric policy iteration methods that I describe in my current research on computational methods below. Although I intended to complete the manuscript during my sabbatical at Yale last year, several new research opportunities crossed my path and I felt it was better to delay the monograph in order to get these projects underway. Also, there have been important recent developments on estimation and solution of dynamic programming models (including new results from my work on the ``steel project'' that I describe below) that I felt it was worthwhile to delay the monograph so I would have time to fully develop these topics and include them in the monograph. I currently expect that the manuscript will be completed and ready to send to a publisher by the end of 2002.

2.
The ``Steel Project'': This project represents an unexpected detour in my main line of research, which has focused on computational methods, development of better econometric methods for estimation of stochastic decision processes, and empirical applications of these methods to retirement and Social Security (described in item 3 below). The ``detour'' resulted from a random fortuitous encounter with a businessman in a bar in New Haven. This businessman, John H., is the general manager of the New England division of a large steel company that is owned by a single very wealthy individual who owns 12 similar plants located around the U.S. The New England plant that John manages is known as a ``steel service center'': it does not produce steel, but rather serves as a ``middle man'' between steel producers and steel consumers. The firm purchases large quantities of steel in the wholesale market and stores it for later resale to retail customers at a markup. John H. has given me access to a monthly data feed on all of the transactions made by his company, including the transaction dates, prices, and quantities of each of the 2200+ steel products it buys and sells and the identities of each of its customers and suppliers. This is a unique dataset: to my knowledge there is no comparable dataset on the operations of on ongoing business operation, and in particular, there is no other high frequency data set where pricing and inventory decisions can be examined at this level of detail. It was clear to me that analysis of these data could lead to important new contributions to inventory theory, theory of speculation, the theory of price discrimination, and the overall industrial organization of markets. For this reason I decided to actively pursue this opportunity, even at the cost of delaying my monograph and my work on Social Security. However ex post I feel this decision was a good one, and has already been justified by a number of different scientific contributions (both empirical and theoretical) that have emerged from our analysis of these data.

This is a particularly interesting period to be studying the steel market. The aftermath of the Asian crisis of 1997 lead to a collapse in worldwide demand for steel during 1998 when financially strapped countries such ast Brazil, Russia and Japan started pouring huge quantities of steel onto the world market at record low prices. This resulted in record high levels of steel imports, and ultimately to protectionist response in the U.S. when large antidumping duties were levied on Japan and Brazil and quotas imposed on Russia in early 1999. With efficient mini mills such as Nucor continuing to drive down the cost of steel, and the continuing impact of the attempts by steel service centers to liquidate the large volumes of steel acquired in 1998, 1999 and 2000, we are still seeing a continued reduction in steel prices in the U.S. However as the high inventories are sold, the high cost of energy may start to raise the cost of production of steel in the U.S. but when steel prices will start to recover is uncertain due to the effects of the U.S. recession. Already we are seeing many of the older and weaker steel producers such as LTV file for bankruptcy, which is likely to result in a more concentrated and less competitive production sector. All of these events are being factored into the calculations of steel speculators as well as producers and consumers of steel. A final important development is the advent of online trading of commodities over the world wide web. The steel market has been somewhat behind the times in this area, but over the next few years we expect some dramatic changes in the structure of trading of steel. For all of these reasons I think this is a particularly opportune period of time to be studying the steel market, and to have the privilege of getting such detailed data on the operations of one of the service centers that collectively intermediate nearly 50% of all trade in the steel market.

I realized that the large number of different topics that would arise from this project makes it essential to have the help of coauthors and graduate students. I enlisted my colleague George Hall to work on the project since he has expertise in macroeconomics and inventory theory (areas I know little about), and George has also shown an interest in more micro-oriented studies of paticular firms and industries (George's thesis focused on production and inventory decisions by automobile producers). Working with George has been one of the most positive research experiences of my career and most of the key results could not have been obtained without his help. George and I did an initial exploratory analysis of the steel data that was published in the 1999 Carnegie-Rochester series on Public Policy, ``An Empirical Model of Inventory Investment by Durable Commodity Intermediaries''. We also applied for and were awarded a 3 year $300,000 National Science Foundation grant on ``Empirical Models of Inventory Investment and Price Discrimination by Durable Commodity Intermediaries''. Currently the ``steel project'' has grown to include 2 Yale graduate students ( Rupa Athreya and Hiu Man Chan), and 3 additional Yale faculty members (Herbert Scarf, Truman Bewley, and William Brainard). We have made numerous visits to the New England steel plant to discuss their operations with the general manager and the owner, and have been invited by them to participate in steel trade seminars that the general manager organizes. We are currently in the process of writing a jointly authored paper, ``Intermediation in the Steel Market'' that summarizes the work done to date. I expect that this paper will eventually evolve into a monograph on the steel market.

Our initial descriptive empirical work on the steel microdata lead us to the conclusion that the standard versions of the LQ-inventory model and the classical (S,s) inventory model would not be able to explain the patterns we observe in our daily level microdata. The LQ model does not account for the binding constraint that purchases of steel cannot be negative. In the microdata purchases of steel are zero for most days and only occasionally do we observe large positive purchases. Also, most versions of the LQ model posit that the firm's objective is to minimize the deviations about an exogenously specified inventory/sales ratio. Our analysis suggests that the optimal trading strategy of a speculator would never imply a fixed time-invariant inventory/sales ratio: if the speculator is trying to profit from ``buying low and selling high'' then inventory/sales ratios should be high when wholesale prices are low and low when wholesale prices are high. We also rejected the simple version of the (S,s) inventory model which was developed in part by our Yale colleague Herbert Scarf. In the simplest version of the (S,s) theory the firm has a desired or target inventory level S and an order threshold s such that whenever current inventory is lower than s the firm orders an amount sufficient to bring inventories up to the desired level S. A cursory visual inspection of the inventory time series reveals that the standard (S,s) model is incapable of explaining the data. Instead of returning inventories to the same level S whenever the firm orders, we observe widely varying levels of inventory following new purchases of steel at different times of the year (reflecting seaonal effects on demand) and at different steel prices (reflecting the firm's speculative motive). Also, we observe that the firm makes new orders for steel at widely different levels of inventory: sometimes no orders are placed until there is a stockout (which would suggest s=0) but other times we observe new orders even though the firm has millions of pounds in current inventories for a given type of steel.

So we proceeded in a very natural way by formulating a new theory of the firm's behavior that is based on a dynamic programming model that we believed best approximated the actual decision problem that this firm was attempting to solve. We describe the firm's activities as ``speculation'' since it does face serious uninsurable risks that movements in the steel market may result in losses if delays in selling the steel it bought in the wholesale market is sold ex post at lower prices in the retail market. This is exactly what occasionally happens for some of the steel products that John H's firm deals in.

Initially we used a ``calibration'' approach of making reasonable guesses for some of the unknown parameters of the firm's profit function and John H's beliefs about the dynamics of wholesale prices for different types of steel. We found that upon solving this model numerically, we were able to duplicate the basic stylized facts about the firm's behavior, including ``multiple regimes'' consisting of long periods of very low inventory holdings (including occasional stockouts) followed by extended periods of extremely high inventory holdings. In either regime purchases on any given day are usually zero but occasionally we observe positive purchases. There is a huge variations in the size of purchases when they occur, corresponding to a common but counterintuitive finding in the inventory literature about the relative volatility of production (purchases) and sales (if we regard purchases as being this firm's ``production'' then the observation that ``purchases are more volatile than sales'' corresponds to the common observation that ``production is more volatile than sales''). In order to solve the firm's optimal pricing, trading, and inventory holding problem numerically we encountered a number of very challenging computational hurdles since this is an infinite horizon dynamic programming problem with at least 2 continuous state variables (wholesale price pt and current inventory holdings qt) and two continuous decision variables (the firm's optimal retail price prt and the optimal level of new orders of steel inventory qot). Furthermore we needed to account for the constraint that $q^o_t \ge 0$ which is almost always a binding one due to the fact that the firm only occasionally places large positive orders and on most days orders nothing. My work on an improved algorithm to solve this problem lead me to develop and implement a very promising new solution method I call ``parametric policy iteration'' that I will describe below in more detail.

What we realized from tabulations of the numerical DP solutions that the firm's optimal trading strategy could be described as a generalized (S,s) rule. That is, there exist functions S(p,x) amd s(p,x) with $S(p,x) \ge s(p,x)$ such that if the firm's current inventory q is less than s(p,x) it is optimal for it to order an amount qo(p,q,x)=S(p,x)-q to bring its inventory level up to the desired level S(p,x). If q < s(p,x) then it is optimal not to order, i.e. qo(p,q,x)=0. These functions depend on the current wholesale price p and a vector x of other variables that affect the firm's beliefs about future sales and costs including ``macro shocks'' (e.g. industrial production indices and indices indicating overall steel prices, etc.) and current interest rates (which affects the firm's interest opportunity costs for holding steel). We also observed that when the fixed costs of placing an order K is zero we had S(p,x)=s(p,x), otherwise whenever s(p,x) > 0 we found that S(p,x) is strictly greater than s(p,x). However when we presented our work at a seminar at CREST in Paris, France, our discussant, Guy Laroque (an expert in inventory theory and commodity pricing) claimed that our numerical results could not be right and that the optimal trading strategy would not be of the generalized (S,s) form. However in a subsequent theoretical paper, ``The (S,s) Rule is an Optimal Trading Strategy in a Class of Commodity Price Speculation Problems'', we were able to prove that the generalized (S,s) rule suggested by our computational results is in fact an optimal trading strategy in a fairly wide class of speculative trading dynamic inventory problems. We proved the result by providing a sufficient condition under which the value function of the firm's dynamic programming problem has the property of K-concavity (a function f(q) is K-concave if for all $q,z,b \in R$ satisfying $0 \le q-b \le q \le q+z \le \overline q$ we have: $ f(q+z) - K \le f(q) + z[ f(q) - f(q-b)]/b$). Following original work by Herbert Scarf (who was the first to define the analogous property of K-convexity and use it to prove the optimality of the (S,s) rule in the simpler context of minimizing costs of holding inventory in a problem without any Markovian cost shocks) we proved that if the value function V(p,q,x) is K-concave in q for any fixed values of (p,x) then the generalized (S,s) rule is an optimal trading strategy for a commodity price speculator. Our new contribution was a fairly general sufficient condition for the K-concavity property to hold: we call it the no expected loss condition. A strong form of this condition is that the retail price on day t always exceeds the expected discounted procurement price on day t+1, $p^r_t \ge \beta
E\{p_{t+1}\vert p_t,x_t\}$, where $\beta \in (0,1)$ is the speculator's discount factor. This is an extremely natural and intuitive condition that we would expect to be automatically satisfied by profit maximizing speculators: it states that no speculator would sell a unit of inventory for an amount less than the expected discounted cost of replacing that unit on the next trading day. We believe this represents an important new theoretical contribution that links formerly separate liteatures on the theory of speculation and optimal inventory theory. This paper was very well received when we presented it last year at the MIT Operations Research seminar and one of the editors of the journal Operations Research invited us to submit it for consideration for publication. We are currently extending the paper to allow endogenous determination of the retail price prt in which case we believe we can derive the no expected loss condition as a consequence of expected profit maximization and prove our results for an even wider class of problems than has ever been considered in the literature. We expect to be submitting the paper to Operations Research shortly.

Having solved important computational and theoretical problems that allows us to characterize and numerically compute optimal dynamic speculative trading strategies, and having found that our initial calibrations of these trading strategies were very promisingly realistic in the sense of displaying the key qualitative and quantitative characteristics of the actual time series patterns of trading, prices, and inventories that we observe in John H's steel data, our next step was to develop the econometric methods for estimating and testing the model more comprehensively. There are two key ``unknowns'' that we need to estimate econometrically: 1) John H.'s beliefs about the law of motion for steel prices in the wholesale market (which we represent as a Markov transition probability $g(p_{t+1},x_{t+1}\vert p_t,x_t,\theta_1)$ governing the vector Markov process $\{p_t,x_t\}$ where pt is the wholesale price of a particular steel product and xt is a vector of macro varaiables such as industrial production indices, unemployment rates, interest rates, etc. that are correlated with and useful for forecasting future steel prices), 2) John H's beliefs about the demand for steel at his New England plant (which we represent by a conditional probability distribution $H(q^r_t\vert p^r_t,p_t,x_t,\theta_2)$ which gives the conditional probability distribution of retail sales qrt on trading day tconditional on the retail sales price prt, the wholesale price ptand the other macro variables, xt). Our objective is to find values of the unknown parameters $\theta=(\theta_1,\theta_2)$ characterizing the key objects determining John H's trading strategy for steel by finding values for which the implied optimal trading strategy (computed from our dynamic programming model) ``best fits'' the observed sequence of purchases, sales, retail and wholesale prices, and inventory levels for each of the 2200+ steel products that John H's firm trades. Note that the conditional distribution H constitutes a stochastic downward sloping demand function for steel, in the sense that the conditional expectation of demand for steel, $E\{q^r_t\vert p^r_t,p_t,x_t\}=\int q^r_t H(dq^r_t\vert p^r_t,p_t,x_t)$ is a downward sloping function of the retail price prt that John H sets, reflecting his local market power due to informational problems and search costs and the relatively high transportation costs for steel. This demand is complicated by having a mass point at 0, reflecting the fact that on given business day there is a positive probability of zero retail sales. The straightforward way to approach the problem is to estimate the unknown parameters $\theta$ by the method of maximum likelihood. However this presents several major problems. First, deriving the likelihood function is quite complicated due to the fact that the stochastic process for purchases, sales and inventory levels has discrete and continuous components. For example as noted above, on any given day there is positive probability that purchases and sales will be zero, but also positive probability of a stockout, i.e. of receiving a retail demand that would exceed current stock on hand, resulting in inventory level of qt+1=0 on the next trading day. The second problem is that the likelihood function governing the full data, $\{p^r_t,p_t,q^s_t,q^o_t,x_t\}$ (where $q^s_t=\min[q_t+q^o_t,q^r_t]$ is actual retail sales, the smaller of retail demand qrt and stock on hand after orders are made, qt+qot) depends on the solution to the full dynamic programming problem. Thus it is not possible to express the likelihood function in closed form, but rather a ``nested fixed point'' algorithm similar to the algorithm I pioneered in my earlier work described in the appendix must be employed. The nested fixed point is the contraction fixed point representation of the infinite horizon dynamic prorgramming problem. Our computational advances allow us to solve this problem rapidly, but it still must be solved many hundreds of times for the various trial values of $\theta$ that are encountered in the search for the the maximum likelihood estimates $\hat\theta$.

However the final problem confronting us was the most daunting: John H's firm only records the wholesale prices pt and retail prices prt on the days that the firm purchases and sells steel, respectively. We have described this as an endogenous sampling problem since failing to account for it can lead to serious biases in the estimation procedure. It is easy to see why this problem arises: since the firm's objective is to buy low and sell high, if we form an average of the prices pt only on the days where the firm buys steel, we will get an under-estimate of the true long run average price of steel. Similarly, since the firm is trying to sell high, if we form an average of the retail prices using only the prices prt on the days the firm sells steel, it will overestimate the average retail price that John H. could ordinarily expect to sell at. Of the two problems, the problem of endogenous sampling of wholesale prices is far more serious since purchases occur much less frequently than sales. A natural solution to this problem would be to augment John H's microdata with market data on the wholesale prices of steel on the days John H's firm did not buy steel. Unfortunately, this is not possible since there is no market for steel similar to ones that exist for other commodities such as wheat, aluminum ingot, gold, or silver. Instead the steel market operates entirely as a ``telephone market'' where transaction prices between retail customers and steel middle men, and between steel middle men and steel producers are privately negotiated and not recorded in any centralized exchange (below I describe a theoretical paper that we recently completed that attempts to explain why an exchange market for steel does not exist). In any event the econometric problem of endogenous sampling defied an easy solution. The straightforward approach is to form a marginal likelihood function that ``integrates out'' the missing wholesale price information on the days that we do not observe John H's firm purchasing steel. However out of the over 1000 business days we have data on the firm's transactions, we observe purchases on individual steel products on only a small subset of days, say 40 to 60 days for most steel products. Thus, the implied dimensionality of integration to to full maximum likelihood ( 1000 - 60=940) is extremely large, and the region of integration is irregular since it involves integrating in a region of the state space where it is not optimal to order, i.e. for (p,q,x) state combinations where $q > s(p,x,\theta)$ where $s(p,x,\theta)$ is the optimal order threshold corresponding to a dynamic program with parameter $\theta$ described previously. While I have been able to provide sufficient conditions under which such ``ultra high dimensional'' numerical integration is both possible and can be done at essentially uni-dimensional rates when the ``effective dimension'' of the problem is low (this work is forthcoming in a recently accepted Econometrica paper with computer scientists Joseph Traub and Henryk Wozniakowski), it is hard to verify analytically that our likelihood statisfies the regularity conditions in our forthcoming Econometrica paper. While George Hall and I plan to try to implement direct maximum likelihood estimation using these ``low discrepancy'' quasi monte carlo methods, we discovered an alternative much simpler estimation method that we call ``simulated minimum distance estimation'' (SMD). We have verified that this new estimation procedure not only is computationally feasible, but it also works (both theoretically in terms of its asymptotic properties, and in practice in monte carlo studies of the actual numerical performance of the estimator, where we generated artificial data from the model and verified that the SMD estimator converges to values very close to our choices of the ``true'' parameter values $\theta^*$. This work is described in our paper, ``Econometric Methods for Endogenously Sampled Time Series: The Case of Commodity Price Speculation in the Steel Market''.

The basic idea underlying the SMD estimator is elegant and almost beautifully simple. Using the actual data on prices, purchases, inventories, etc. we form a set of ``sample moments'' such as the mean purchase price of steel, the average markup over the historical average purchase price and the current retail price, the average size of inventory purchases, the average inventory holdings, and so forth. Note that some of these moments, such as the average purchase price of steel, will be biased and inconsistent estimators of the true ergodic population moments (i.e. the ``long run average'' purchase price of steel in the wholesale market) due to the endogenous sampling problem described above. However the beauty of the estimator is that we can simply ignore these problems and still obtain a consistent and asymptotically normal estimator for the structural parameters of interest $\theta$. The idea is that once we solve the DP problem and uncover the optimal dynamic trading strategy for any trial value of the unknown parameters $\theta$, we can simulate the model and censor and endogenously sample the simulated data in exactly the same way as the actual data are censored and endogenously sampled, i.e. we throw away the simulated wholesale prices of steel on the days that the simulated trading strategy does not make purchases of steel. Thus some of the simulated moments will also be biased and inconsistent estimates of the corresponding ergodic moments of the process, but the key point is that both the actual sample moments and the simulated moments will be biased in symmetrical ways so that an estimator that is formed by choosing values $\hat\theta$ that minimizes a quadratic form in the difference between the simulated and sample momemnts can be shown to be consistent and asymptotically normally distributed. The estimator is similar in many respects to the ``method of simulated momemts'' (MSM) estimator pioneered by the recent Nobel prize winner Daniel McFadden (my thesis advisor at MIT), and the asymptotic properties of the SMD estimator is similar to the MSM estimator (although the former is derived in a time series context and the latter in a cross sectional context). In particular, the SMD estimator can be shown to be consistent and asymptotically normal using only a single time series realization of the trading process to form simulated moments. The cost to doing only one simulation of the time series of trading behavior (as opposed to averaging the simulated moments from S independent time series realizations) is that the covariance matrix for the $\theta$ parameters is multiplied by a factor (1+1/S). Thus, when S=1 the penalty to doing only a single simulation is to double the variances of the parameter estimates compared to a situation where $S=\infty$ which would essentially be equivalent to doing exact numerical integration to form the moments implied by the DP solution.

We believe that this paper allows us to overcome the remaining important obstacle to estimating our structural model of optimal dynamic speculative trading in the steel market. While other researchers have advocated similar simulation estimation procedures in other contexts (in addition to the MSM estimator, a group of French econometricians Gourieroux, Monfort have proposed a related method that they call ``Indirect Inference'' and a number of different econometrician in the U.S. (Gallant, Tauchen, J.F. Richard, Anthony Smith, Duffie and Singleton, Ingram and Lee and others) have proposed related methods under the names ``efficient method of moments'' and ``simulated method of moments'' and other names. I believe our new contribution to the econometrics literature is that leading time series researchers such as Robert Engle and others who work on financial time series data do not seem to have been aware of the endogenous sampling problem (even though it appears to be endemic to a variety of irregularly and endogenously sampled financial and commodity price series), and our use of the simulated minimum distance principle to solve the endogenous sampling problem appears to be new. We now realize, following the adage of J.F. Richard, ``If you can simulate it, you can estimate it.'' I believe the SMD estimator can be used in a variety of other contexts to allow us to estimate much more realistic econometric models than we have been previously able to estimate. This includes in microeconometric contexts estimation of much richer models of reporting bias and non-response. I am embarking on a new project in this direction with my Yale colleague Michael Keane, and applying the method in the estimation of my work on disability and retirement using the Health and Retirement Survey data where issues of non-reporting or partial (bracketed) reporting of items such as assets, consumption, labor supply, health status and so forth have created severe challenges for previous estimation approaches. I believe that the SMD estimation principle will allow us to estimate much richer and more realistic models than have been possible to estimate in the past.

In the meantime, once we complete the estimation of our dynamic structural model of speculation in the steel market, we plan to encode the estimated model as an ``expert system'' on a laptop computer that we will deliver to John H. We believe this will provide the most demanding ``specification test'' of the validity of the model: John H will be able to enter information he receives on prices in the wholesale market and his current inventory levels, and the trading strategy will return a recommendation about whether or not to purchase, and if so, how much. It can also provide recommended retail prices for each type of steel product that John deals in. We can have the computerized trading strategy run a simulated inventory portfolio and make simulated trades and we can compute the profits of the computerized strategy and compare them to the profits actually earned by John H and his sales force. If John H feels this model is realistic, and finds it to be a useful decision support tool, then we will consider the model to have passed the most demanding and practical ``specification test''. In the process of refining our model and our assumptions about the right functional forms for the law of motion for wholesale prices and the firm's stochastic demand function, we plan to use visual ``elicitation tools'' programmed on the laptop computer we will give to John H. For example, we can have the computer display projected future wholesale prices and 95% confidence intervals around these projections and ask John H if he thinks these projections are reasonable. We can do similar elicitation experiments to refine our specification of John's beliefs about the quantity demanded (i.e. his stochastic demand function). We believe this opportunity to work directly with the subjects we are attempting to model could be of independent interest in the economic and psychology literatures.

The latest, most recent research result to emerge from the steel project is a paper entitled ``Middle Men versus Market Makers: A Theory of Competitive Exchange''. The paper is currently pending at the Journal of Political Economy. The goal of this paper is to explain why it is that there is no trading of steel in an exchange run by market makers, and while almost all steel trade is conducted by middle men such as John H's firm in an informal ``telephone market''. Building on a 1996 Review of Economic Studies paper by Daniel Spulber, ``Market Making by Price Setting Firms'', George Hall and I constructed a dynamic model of search equilibrium in which buyers and sellers of steel have two trading options: 1) they can purchase steel from a market maker at publicly posted bid and ask prices, or 2) they can search for better prices from a steel middle man in the telephone or dealer market. Using this model, we ask under what conditions would we predict that it would be profitable for a market maker to enter in a market where the initial equilibrium has only middle men and no market makers? We prove that entry by a market maker is profitable if the market maker's transactions cost is lower than the transactions cost of the least efficient middle man in the equilibrium without market makers. In this case, entry by a market maker reduces bid/ask spreads in the dealer market, increases total quantity traded, and results in a strict Pareto improvement for a positive mass of buyers and sellers (i.e. it makes a positive fraction of buyers and sellers strictly better off compared to the initial equilibrium with only middle men and no market makers). The entry of the market maker drives out the less efficient middle men and reduces profits of the middle men who remain. Conversely, entry by competitive middle men can substantially reduce the bid ask spreads that a monopolist market maker would quote in the absence of competition from a dealer market. Thus, while a market maker that quotes publicly posted prices is arguably a more efficient type of exchange institution, we find that as long as there are sufficiently low barriers to entry that enable sufficiently efficient middle men to enter the market, the co-existence of a dealer maker with atomistic middle men and an exchange run by one or more market makers is a remarkably a robust outcome. The middle men serve as a ``competitive fringe'' that limits the market power of a monopolist market maker. Each type of trading institution competes with each other, and the beneficiaries of this competition are the buyers and sellers of the asset or commodity. We believe this model casts light on the current organization of the steel industry and provides new insights into the structure of trade in financial markets and the rapid emergence of the ``business to business'' (B2B) exchanges which are likely to transform the structure of trade in many different commodity and asset markets.

Note that in our model the market microstructure -- the division of trade between middle and market makers -- is determined endogenously. While we show that the equilibrium is unique, there are three regimes depending on values of three key parameters:

1.
The level of search costs, $\delta$

2.
The market maker's per unit transactions cost, c

3.
The most efficient middle man's transaction cost, kl

For fixed $\delta$, the three regimes depend on the values of kl and c and can be characterized as follows:

1.
If c is sufficiently large relative to kl, it is not profitable for a market maker to enter, so middle men handle all trade.

2.
If kl is sufficiently large relative to c, the market maker is able to drive all middle men out of business, so the market maker handles all trade.

3.
For intermediate values of kl and c middle men and market makers coexist. The division of trade depends on the relative values of c and kl.

We can view the steel market (with no market makers) as a case where c is high relative to kl. We can view the NYSE (where there are almost no middle men) as a case where kl is high relative to c. Actually, there has been recent entry of market makers into the steel market (e-STEEL.com and metalsite.com) but our case studies of these recent entrants suggests that they have adopted a business model that is unlikely to be successful (indeed recent evidence suggests that e-STEEL.com is failing and has sold itself to U.S. Steel under unfavorable terms to stay solvent). Our papers suggests that lack of market transparency is the reason why e-STEEL.com has done poorly. Our theoretical analysis suggests that in order to gather a large share of the trading volume and become profitable, the key is to post publicly observable bid and ask prices. e-STEEL.com does not do this. However the fact that a particular entrant such as e-STEEL.com has not been successful still doesn't solve the puzzle of why a ``real'' market maker hasn't entered the steel market. Our paper addresses this question, but offers the prediction that a market maker with a better business plan will eventually be successful in entering and transforming trade in the market for steel. Our empirical analysis of John H's transaction data shows very significant variations in prices charged to different customers for individual products in the same period of time. Although much of the variation in prices can be accounted for by location of customer relative to competitors (spatial price discrimination) and by size of purchase and past ``loyalty'' of customer (quantity discounts), nevertheless, over 30% of the variation in prices remains unexplained. We interpret this residual price variation as evidence of informational price discrimination. This suggests that there are are significant search costs and informational imperfections in the steel market. There should also be large potential efficiency gains from introducing better market institutions that can help reduce search costs. and the high level of price dispersion in the steel market. This line of reasoning also suggests that it should be profitable for a market maker to enter the steel market and post publicly observable bid/ask prices. The profits earned by the market maker is the reward for creating a better exchange institution: it comes out of the efficiency gains obtained by reducing search costs and price dispersion, and driving out inefficient middle men. Indeed, there are over 5,000 locations of ``steel service centers'' and over 88,000 people employed at these locations. We believe that with an efficient market, the number of locations and number of employees can be drastically reduced and the cost savings resulting from these reductions could be passed onto consumers and producers of steel as suggested by our theoretical model.

The next step in the steel project will be to econometrically estimate and test a detailed dynamic model of the inventory and pricing decisions of middle men in the steel market (see Hall and Rust, 1999). (It is likely that this model could be extended to provide a dynamic model of specialist behavior in financial markets.) Once this model is complete we plan to build and estimate realistic dynamic models of producers and consumers of steel. These models will allow us to develop a more realistic dynamic spatial equilibrium model of the steel market. If it is possible to develop such a model we could use it to assess the profitability and welfare gains from the entry of a market maker. However we believe this first, simple model provides some intuition and guidance about what to look for and expect in a more realistic, but inherently more complex analysis.

2.
Dynamic programming models of social insurance at the end of the life cycle. This research continues and extends my previous work on modeling retirement behavior and social insurance at the end of the life cycle initiated in my 1997 Econometrica article with Christopher Phelan, ``How Social Security and Medicare Affect Retirement Behavior in a World of Incomplete Markets''. That paper estimated the unknown parametes of individuals' utility functions for consumption versus leisure and demonstrated how Social Security interacts with incomplete markets for medical insurance and annuities and creates strong disincentives for certain ``health insurance constrained'' individuals to continue working at firms that offer employer-provided health insurance until they are 65 years old and eligible for Medicare, and thereafter the ``earnings test'' provision of Social Security creates strong disincentives to continued work. Overall, the model showed that many of the apparently artifactual features of opbserved retirement behavior, including pronounced peaks in retirement rates at age 62 and 65 can be explained by careful modeling of the strong incentives Social Security creates in interaction with patterns of incompleteness in markets for private insurance. Although the model is simplified in many respects, it remains one of the most ambitious and detailed attempts to model the impact of Social Security on retirement behavior and analyze its impact on individual welfare. The appendix describes this model in a bit greater detail.

My current plan is to relax a number of restrictive aspects of the Rust-Phelan model so as to produce an even more realistic and useful model for analyzing the welfare and distributional impacts of changes in Social Security policy. I am currently collaborating in this research with my former Yale colleague Moshe Buchinsky (currently at Brown University) and my former Phd advisee Hugo Benitez-Silva who is currently at SUNY-Stony Brook. We have written a proposal to the NIH, ``Dynamic Structural Models of Retirement and Disability'', that describes the motivation for our research in more detail. But in basic terms we propose to extend the Rust-Phelan model in the following dimensions:

1.
Estimate the DP model using all available waves of the Health and Retirement Survey (HRS). (The Rust-Phelan model was estimated on the older and now out of date Retirement History Survey based on the 1905-1912 birth cohorts whowere followed over the decade 1969 to 1979).

2.
Relax the assumption that consumption equals savings in the Rust-Phelan model and allow individuals to accumulate and decumulate assets in an optimal fashion.

3.
Expand the treatment of Social Security to include the Social Security Disability Insurance program (SSDI) and the Social Security Supplemental Security Income program (SSI).

My work on retirement behavior has attracted attention in the policy community and I have been chosen to serve on the Social Security Advisory Board's 1999 Technical Panel that reviews the assumptions underlying the Social Security's long term projections of the Social Security Trust Fund, and the Panel on Retirement Income Modeling of the Committee on National Statistics of the National Academy of Sciences. I have also served as an advisor to the Social Security Administration on the development of their MINT modeling project (Modeling in the Near Term), and I wrote a report entitled, ``Strategies for Incorporating Risk in Models of Social Insurance'' as part of a contract to the Urban Institute to advise SSA on longer term modeling strategies. I am currently an academic advisor to both the Social Security Administration and the Congressional Budget Office. In my role as an advisor I have become keenly aware of the deficiencies of current generation models that are being used in Washington. I believe that the dynamic programming models that I and others are developing could provide much better guidance, especially for some of the more radical reforms to the Social Security program that are currently being contemplated such as indidividual accounts and overhauling the structure of the disability insurance program.

I will be heavily involved in a long term ``demonstration project'' being conducted by the Social Security Administration that arose from the 1999 Ticket to Work and Work Incentives Act. This far reaching legislation gives Social Security the authority to conduct controlled experiments with current SSDI and SSI beneficiaries, changing the rules facing randomly selected individuals in the ``treatment group'' in order to determine how they respond to incentives designed to encourage them to leave the SSDI and SSI roles. The most prominent incentive is known as the ``1 for 2 offset'' which means that disabled individuals who return to work will be able to keep 1 dollar of SSDI or SSI benefits for every 2 dollars earned above some fixed set-aside amount. This is a more generous provision than the current law allows, since after a 9 month trial work period SSDI and SSI beneficiarieswill lose 100% of their benefits if they keep on working past the trial work period.

I believe my role as an academic advisor to SSA will result in unique scientific opportunities to test the predictions of the types of dynamic programming models that I have been developing in rigorous out-of-sample experimental tests. If the DP models provide accurate predictions of the behavior responses of the individuals in the treatment groups, these models will have much more credibility for use in policy analysis in Washington. (Currently simpler models are being used, and while certain agencies are using increasingly sophisticated models such as the Stock-Wise ``option value model'' that is similar in certain respects to a dynamic programming model, a fair amount of investment in human capital is required at the key agencies in order for these models to be useful. Unfortunately the current attitude in Washington is that most econometric models are incomprehensible ``black boxes'' and for this reason Congress typically mandates that policy evaluation be done via experimental methods because these are thought to be more trustworthy than predictions of econometric models). I believe that the dynamic programming models that I am developing will result in accurate predictions of the behavioral responses of the 1 for 2 offset policy and a variety of other policies, and that my position as an advisor to SSA will provide me a unique opportunity to show how these models can be used and to convince policy makers that the models are not untrustworthy black boxes. The key policy makers already recognize some decisive shortcomings of experimental methods, particularly the expense of these methods and the long time it takes to get answers. Furthermore there are some questions for which experimental methods cannot generate reliable answers at a reasonable cost. An example is the issue of ``induced entry'' resulting from the implementation of the 1 for 2 offset policy. Policymakers recognize that the 1 for 2 offset means that the DI program is more generous than it currently is, and that on the margin this will increase the incentives to apply for DI benefits. The Office of the Actuary of the Social Security Administration forecasts that even a small induced entry effect could overwhelm any cost savings from ``induced exits'' due to the 1 for 2 offset policy. However it seems reasonable to expect that the increase in application probabilities resulting from the 1 for 2 offset policy are likely to be small, and thus it would take a huge experimental sample size to produce a statistically reliable estimate of the magnitude of induced entry. On the other hand the dynamic programming models I have been developing will readily generate predictions of the magnitude (and budgetary impact) of the induced entry effect. If these models gain credibility, they could be far cheaper and faster than using experimental methods to analyze specific policy options. It seems to me that the ideal solution is to combine experimental and structural econometric methods: a few carefully chosen experiments can be designed to determine whether existing econometric methods (including DP models) are capable of providing accurate predictions of the behavior changes in the treatment group, and if the models are accurate, they can then be used to evaluate a much wider class of alternative policies.

It has been clear to economists for quite a while that the structural approach to policy analysis is a superior one, at least if the models are well identified and are accurate representations of how people actually behave. There might be some skepticism that a rational dynamic programming model could provide an accurate behavioral model for the disabled, some of which might be severely mentally handicapped. We plan to estimate a model with different ``types'' and we could include myopic or irrational individuals as some of these types, letting the data tell us what fraction of the disabled population they appear to be. The standard econometric approach to policy evaluation presumes that individuals are rational: if not, then it is less clear how to predict the welfare and behavioral impacts of policy changes. Our work will begin with the presumption that individuals are rational, but will not presume that everyone is fully rational. If we find out that the assumption of 100% rational agents is strongly rejected we will have to modify our approach to policy evaluation, using the standard approach only for the subset of rational types and then adopting some other approach for the irrational types.

Presuming that the predominant majority of individuals will be found to be best approximated as rational decision makers, I have embarked on a new theoretically oriented research project with V.V. Chari of the University of Minnesota and Hugo Hopenhayn of the University of Rochester. We are attempting to characterize and compute optimal social insurance policies. An optimal social insurance policy can be characterized mathematically as the solution to a certain class of recursively formulated mechanism design problems in which the government chooses a policy that minimizes the expected discounted cost of providing social insurance benefits to indviduals subject to the constraint that these individuals are at least as well off as under the status quo. This latter constraint can be regarded as a ``political feasibility'' consraint: individuals would not object to replacing the current disability policy with an alternative policy if they are no worse off under the new proposed policy. The expected discounted cost savings of the optimal social insurance policy enables us to quantify the degree of inefficiency in the current system. For example, in previous research, Hopenhayn and Nicolini showed that an optimal unemployment insurance policy could deliver the same expected welfare to citizenss but at 30% lower cost. During the first week of April, Hopenhayn, Chari and I met at Yale and obtained some first general characterizations of the form of optimal disability insurance policies. For example under fairly general conditions that extend the classic static results of Diamond and Mirrlees to a dynamic context with costly monitoring, we can prove that an optimal disability insurance plan involves a benefit stream that increases steadily with the age of first entry into the program, but decreases over time once an individual enters the DI roles. The latter provides incentives for individuals who recover from a disability to voluntarily leave the roles, and the former discourages individuals who are not disabled from applying for DI benefits at a young age.

In addition to the theoretical work I have been involved in empirical work to analyze the current structure of the U.S. Social Security disability application and appeal process. This is best viewed as a multistage game between individual applicants and the government since an individual has the option to appeal a denial of an application for DI benefits to increasingly higher judicial level, starting out with Administrative Law Judges (ALJ) and proceeding to the Social Security Appeals Board in Washington, then Federal Court and in rare cases, all the way to the U.S. Supreme Court. The initial application is considered by disability claim workers located in one of 54 state operated Disability Determination Services (DDS) located across the U.S. In our first descriptive empirical paper ``An Empirical Analysis of the Social Security Disability Application, Appeal and Award Process'' (published in the 1999 Labour Economics), we showed there are apparent gains from playing the ``appeals game'': since by appealing to higher levels one can increase the award rate from 50% as the initial DDS level to a 75% ``ultimate award rate'' when we also include applicants who are awarded benefits on appeal. However our empirical analysis revealed the long delays at higher appeal stages. During this long period of appeal the applicant must not be observed to earn more than the ``substantial gainful activity'' (SGA) level which is currently $700 per month. Otherwise this is regarded by SSA as prima facie evidence that the person is not disabled and result in a disqualification for benefits. From the perspective of our strategic, game theoretic analysis of the applications and appeals process, the delays constitute a ``type dependent application fee'' that helps SSA induce self-selection and improve its ability to direct benefits to the most disabled individuals. The reason is that if a person is not truly disabled, they face a substantial opportunity cost in terms of lost wages from signalling that they are disabled by not working during the entire application and appeal process. If a person is truly disabled this delay involves a welfare cost to the extent the person faces liquidity constraints (if they are ultimately awarded benefits, the benefits are awarded retroactively to the point the person is found to have become disabled), but not an opportunity cost since the person is truly unable to work. Thus, playing the ``appeals game'' is much more costly for those who are not truly disabled, and this induces self-selection: those who appeal an initial denial are much more likely to be truly disabled.

In order to analyze the classification errors involved in the disability application and appeal process, we needed a measure of ``true disability''. Unfortunately it is very hard to determine an objective standard for judging ``true disability.'' The SSA definition of disability is ``The inability to engage in any substantial gainful activity by reason of any medically determinable mental or physical impairment which can be expected to result in death or which has lasted or can be expected to last for a continuous period of at least 12 months''. The HRS survey asks respondents whether they have ``a health condition or limitation that prevents them from working entirely.'' The HRS is a privately administered survey (run by the University of Michigan Survey Research Center) and respondents are given strong guarantees of the confidentiality of their responses. Thus, we argue that their self-reported disability status should be treated as a measure of ``true disability'' since we have evidence that individuals have no incentive to exaggerate or over-report disability in the HRS survey, and an individual is in the best position to know whether or not they are able to work or not. In a paper entitled ``How Large is the Bias in Self-Reported Disability Status?'' (this paper is currently in revise and resubmit status at the Review of Economic Studies) we provide strong econometric evidence that self-reported disability status is an unbiased measure of ``true disability status'', to the extent the latter is measured by SSA's ultimate award decision (i.e. including reversals of initial rejections by the DDS for those who successfully appealed). We implement a battery of statistical tests for the unbiasedness of individual reports of disability status and are unable to reject the hypothesis that self-reported disability status and the SSA's ultimate award decision have different conditional probability distributions. However we can decisively reject the hypothesis that self-reported disability is an unbiased indicator of the DDS initial award decision. Thus our analysis suggests that the DDS is too severe in judging disability status and errs on the side of rejecting DI applicants. However since truly disabled individuals are more likely to appeal and have their initial rejections reversed at the ALJ appeal stage, which suggests that contrary to the claims of the GAO and other analyses which have suggested ALJs are too lenient by reversing many initial denials by the DDS, our results suggest the opposite: namely that the DDS are too strict and the ALJ reversals improve the overall quality of screening by letting more of the truly disabled DI applicants onto the roles without allowing a substantially larger number of non-disabled individuals to also be successful in obtaining benefits.

We are currently in the process of using these results to write a paper analyzing the multistage DI application and appeal process in a paper entitled, ``How Large are the Classification Errors in the Social Security Disability Application and Appeal Process?''. Overall, we find that approximately 20% of current DI beneficiaries are not disabled and by their own admission capable of working, but that approximately 50% of rejected DI applicants are truly disabled but are disuaded from pursuing an appeal due to the long delays, legal costs, and uncertainties in the appeal process, and due to the possibility that some of them can obtain support from other family members such as a working spouse or children who can help suppport them. We also find out that a large number of disabled people (including disabled females) never even bother to apply for DI or SSI benefits, perhaps for many of the same reasons. These are substantial classification errors and represent part of the costs and inefficiencies in the current disability application and award process. My analysis of the problems in the current system will be closely connected with my theoretical work with Hopenhayn and Chari, since this theoretical work will be focused on ways to design improved screening and monitoring systems to increase the overall efficiency of the DI system and result in better targetting of benefits to those who truly need them.

In summary, I believe that my research on retirement and the disability screening process could not only result in a new structural econometric approach to the analysis of social insurance, it could also result in practical insights and policy innovations that might actually have a chance of being implemented at SSA. However I have no particular political stand on what the ``right'' disability system might look like: instead I approach the problem from a scientific perspective and try tolet an objective analysis determine how we view the current system and think about the pros and cons of alternative systems. I will be summarizing a lot of my recent work in this area at an invited plenary presentation at the 2001 Australasian Meetings of the Econometric Society in New Zealand on ``The Econometrics of Social Insurance.'' I will also be presenting this work at special meetings at the New Zealand Treasury following the Econometric Society meetings.

4.
Research on Computational Complexity A major focus of my research has been on overcoming the ``curse of dimensionality'' of dynamic programming. This problem prevents us from solving the type of detailed, realistic models that we would like to be able to solve in any of the applied projects I have been working on, including the steel project and the Social Security disability and retirement project. I have a 1997 Econometrica paper, ``Using Randomization to Break the Curse of Dimensionality'' that proves that it is possible to circumvent this problem in certain cases, but the use of randomized methods has certain undesirable characteristics: the computed solutions using the ``random multigrid algorithm'' that I introduced in this paper can exhibit ``bumpiness'' resulting from the use of monte carlo integration methods. I have a forthcoming paper with Traub and Wozniakowski, ``Is There a Curse of Dimensionality for Contraction Fixed Points in the Worst Case?'' that proves that it is possible to break the curse of dimensionality using deterministic algorithms.

As I noted above, I havve also had very good success with an even faster algorithm I call ``parametric policy iteration''. In this algorithm I approximate the value function in each stage of the standard Howard policy iteration algorithm as a linear combination of certain well selected ``basis functions'' that have the property that a small number K of these basis functions can be used to approximate the true value function very closely. Then for any given policy, it is possible to compute an approximate value function for that policy as a simple least squares problem involving only K unknown coefficients. This results in a vast reduction of computational effort compared to traditional policy iteration in which the state space of a continuous dyanmic program is discretized into N points and the approximate value functions at each policy iteration stage require the solution of N linear equations in N unknowns. In practice, in multidimensional continuous state DP problems N must be on the order of several hundred thousand (even using my random multigrid algorithm) in order to obtain good approximations to the true value function whereas in the examples we have looked at, the parametric policy iteration algorithm produces excellent approximations when K as small as 5 to 10. I have not yet been able to provide sufficient conditions under which the parametric policy iteration algorithm can be gauranteed to converge, but I have two papers that provide numerical comparisons of these different algorithms and have given us a great deal of insight into what sorts of problems these algorithms can be effective in solving and also problems for which the method might not always converge. The paper ``A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations'' provides an analysis of the traditional policy iteration algorithm using alternative discretizations of the continuous state space of some illustrative dynamic programming problems. These test problems have closed form solutions which enbale me to compute the exact approximation errors for each of the methods considered. A subsequent multi-authored paper, ``A Comparison of Discrete and Parametric Approximation Methods for Continuous-State Dynamic Programming Problems in Economics'' (coauthored with my Yale colleague George Hall, Giorgio Pauletto, and two Yale grad students, Hugo Benitez-Silva and Günter Hitsch, compares the discretization methods with the parametric policy iteration method and with other ``smooth approximation'' methods including a number of different algorithms advocated by Kenneth Judd.

In related work, I am collaborating with Vassilis Hajivassiliou of the London School of Economics to evaluate the performance of simulation estimators based on ``low discrepancy sequences''. Our initial monte carlo tests indicate that low discrepancy sequences such as Sobol and Tezuka points are just as easy to compute as simulation estimators based on the traditional psuedo random number generators (such as the linear congruential number generator) but the mean square estimation errors that result from using the low discrepancy sequences are 50% lower. The reason is that standard monte carlo methods inject unnecessary additional ``noise'' into the econometric procedure. The low discrepany sequences exhibit less noise, more accurate quasi monte carlo integrals, and as a result, lower mean squared estimation error. I believe that the new simulation methods we develop and test in this paper will have broad application in applied econometrics.

5.
Analysis of Auctions I am involved in a multidisciplinary research project with faculty in the computer science, mathematics, and School of Management at Yale to develop a commodity market for cpu cycles. That is, we are attempting to develop computerized ``agents'' that will trade small blocks of cpu cycles in a microsecond basis, representing rentals of time slots on computers owned by individuals and researchers located anywhere on the internet. As part of this project, we are trying to develop algorithms that display ``intelligence'' in determining how to decompose a large computation into smaller units and to place the component computations out for bid in the market in order to meet the general objectives of the human ``principal'' on whose behalf the computation is being performed. For example, the principal might be rich but time constrained and be willing to pay any amount to get a computational result in a certain period of time, whereas others may be less impatient and more cash constrained. For the latter, the computerized agent would seek to bid for cycles in off peak times when computing prices are very low, whereas in the former case the computerized agent would attempt to spawn many concurrent subtask and place these tasks out for bid on multiple fast processors in order to solve the problem as rapidly as possible in parallel. We are right now in the processs of submitting a proposal to the National Science Foundation to fund this research. A copy of our preproposal that I drafted as Principal Investigator is available online at the CPU Auctions web site I created for this project, where we have posted our preproposal entitled ``Agent Based Distributed Computing''.

In earlier work in the 1990s I used funds from an Alfred Sloan Fellowship as prize money in a computerized double auction tournament. With subsequent NSF funding with co-PI Vernon Smith at the University of Arizona, we used the internet software we developed from the tournament to conduct a series of human vs. computer experiments in which human subjects played the roles of buyers or sellers in a dynamic trading game facing computerized opponents drawn from the computerized algorithms that were submitted to our computerized double auction experiment held at the Santa Fe Institute (this latter tournament is describe in a book I coedited with Daniel Friedman, Double Auction Markets: Theory, Institutions, and Laboratory Evidence 1993, Redwood City, CA: Addison Wesley, and also in a 1994 article in the Journal of Economic Dynamics and Control entitled ``Characterizing Effective Trading Strategies: Insights from a Computerized Double Auction Tournament''. Due to the press of other research projects, I have never fully analyzed the huge amount of experimental data on human-computer trading interactions. In research that I am planning to do with my former University of Wisconsin colleague Mahmoud El Gamal (now at Rice University), we plan to use dynamic programming and learning algorithms to see if it is possible to model the human subjects' behavior as ``best replies'' to their beliefs about the fixed strategies used by their computerized opponents, or to see if traders adopt other simpler hueristics or ``rules of thumb''. Ultimately, our objective is to provide a more comprehensive theory of trading under uncertainty in the context of simplified experimental environments. To my knowledge no economist have ever been able to solve for dynamic equilibrium trading strategies in the dynamic double auction game. Our research should take us a step closer to understanding how humans learn to trade effectively in such enviroments, and what features of their trading behavior and the institutional structure of the trading game lead to the highly efficient, near competitive equilibrium outcomes that havve been consistently observed in the experimental literature.

6.
Other Research Projects I have a number of other miscellaneous research projects that I have initiated but remain incomplete. One is joint with Mita Das at the Indian Statistical Institute, to estimate and test a dynamic model of investment and disinvestment in cement kilns by cement manufacturers. This project could ultimately be as fruitful as the steel project, however the cement companies have been much less cooperative in providing us data on their operations, perhaps in part because certain cement firms have been under threat of antitrust suits by the U.S. government. The higher quality of the steel data lead me to put this project on the ``back burner'' but I ultimately intend to complete it since it could lead to a better model of investment in lumpy, ``putty clay'' investment goods such as cement kilns than exists in the current literature on investment. I have also written several papers with Geoffrey Rothwell on nuclear power. I intend to return to this project at some point so we an complete a monograph on the nuclear power industry, in part summarizing our existing work and undertaking new work in forecasting the future of nuclear power in an environment where fossil fuels are becoming increasing non-competitive due to the greenhouse effect and other environmental problems. A final future, speculative project is one I have been discussing with my wife Deborah Minehart: to use random graph theory to try to construct a better, more realistic model of endogenous technological progress, treating ideas as separate objects or ``nodes'' on a directed graph where connections between ideas can lead to entirely new products and technologies. I would like to develop a model that could be empirically tested using information in patent and scientific article citation databases. Ultimately the goal is to produce a better model of technological change, and to understand how basic research can be appropriately ``valued'' for the links it ultimately makes to more applied ideas that result in new products and services. A final project that I would like to pursue is to obtain a deeper understanding of decentralization and how natural organisms such as the immune system and the brain and human societies are able to coordinate the actions of hundreds of millions of agents to acheive collective goals. I believe that this is a deep problem whose solution could lead to vastly more effective algorithsm for decentralized parallel processing. My initial attempts to come to grips with these issues are reflected in an unpublished paper, ``Dealing with the Complexity of Economic Calculations''.

Overall, I have enough unfinished research to keep myself busy for the rest of my career. My chief self-criticism is that I tend to get overcommitted with too many different projects and end up getting spread too thin. I want to be able to devote a sufficient amount of time to each of my projects to move them to the ``critical'' point where new discoveries are likely to occur. I have been successful in solving some hard problems already: I now feel that with much of this work completed and behind me that I will be able to do a better job of focusing on these existing unfinished research projects and monographs. My main problem will be to force myself to resist being drawn into tempting new research opportunities that continually cross my path.

Appendix: Research Summary Prepared for Yale University in 1996

Nearly all of the research that I have done after receiving my PhD in Economics at MIT in 1983 has focused on developing econometric and computational methods for understanding and predicting human decision-making over time and under uncertainty, an area I refer to as empirical models of stochastic decision processes (SDP's). Although mathematicians and statisticians have provided us with an elegant but abstract normative theory of optimal sequential decision making under uncertainty (known as statistical decision theory or stochastic control), my interest has been on the positive application of this theory to develop improved empirical models of human decision making. I have been able to solve a number of difficult problems associated with the practical implementation of this abstract theory, resulting in the development of a set of computational and econometric tools for building, estimating, and simulating fairly detailed and realistic empirical models of a wide range of economic phenomena. A considerable body of empirical research conducted by myself and others over the last decade has demonstrated that the SDP framework leads to improved understanding and more accurate quantitative predictions of individual behavior. My primary contributions have been in the following three areas:

1. development of econometric methods for structural estimation of the primitives of an SDP, namely, the individual's underlying preferences and beliefs. These estimation methods are used in conjunction with diagnostic goodness of fit tests to help us judge whether SDP models provide accurate descriptions of human behavior.

2. development of faster algorithms to allow us to compute approximate solutions to increasingly detailed and realistic dynamic choice models. The speed and accuracy of these algorithms are critical to our ability to do empirical work in this framework due to the fact SDP model is repeatedly re-solved many times for varying parameter values in a subroutine of an outer maximum likelihood estimation algorithm.

3. successful empirical applications of SDP models with particular focus on dynamic models of retirement behavior. The estimated retirement model generates accurate predictions of individual retirement behavior and provides simple explanations of several puzzling features of retirement behavior in the U.S. that previous models had been unable to explain.

Section 1 of this summary begins with a general discussion of the problems that motivated my research on SDP's, with special focus on the use of this framework to improve our understanding of retirement behavior. Section 2 summarizes some of my recent work in computational economics, including a new algorithm that succeeds in breaking the ``curse of dimensionality'' of computing approximate solutions to an important sub-class of SDP problems. Section 3 describes joint research with John Miller of Carnegie Mellon and Richard Palmer of Duke University on the use of artificial intelligence methods and automata trading in double auction markets. Section 4 concludes with an outline of some of my future research plans.

1. Empirical Models of Human Decision Making using Stochastic Decision Processes

Initially most economic theories and behavioral models (including the classical general equilibrium model of Arrow and Debreu) abstracted from a realistic treatment of time and uncertainty. However research by economists, mathematicians, and statisticians in the 1950's and 1960's lead to the development of the theory of dynamic programming, (DP), a very powerful tool for building an immense variety of models of optimal behavior that explicitly incorporates time and uncertainty. The ``behavior'' implied by a DP model is given by an optimal decision rule $d_t=\delta(s_t)$, i.e. a function specifying the individual's optimal decision dt in state stat time t (the precise sense in which this rule is optimal will be explained shortly). Starting in the late 1960's and early 1970's the theory of dynamic programming lead to a virtual revolution in economic theory, providing a major impetus for the development of the theory of rational expectations and dynamic game theory -- areas that now constitute the core of modern economic theory.

Before we could assess the empirical validity of these new dynamic economic theories, we needed to develope a completely different econometric methodology, which I refer to as dynamic structural econometrics. Traditional ``reduced-form'' statistical and econometric methods (including regression analysis) can be viewed as a way of summarizing the stochastic process governing the observable variables, $\{s_t,d_t\}$ where st is the state and dt is the action taken by a decision maker at time t. Traditional estimation methods posit a specific functional form for this stochastic process and estimate its unknown parameters. Dynamic structural econometric methods treat $\{s_t,d_t\}$ as a controlled stochastic process whose law of motion cannot be specified a priori but must be derived from the solution to the DP problem. The object of inference in dynamic structural econometrics is not the stochastic process $\{s_t,d_t\}$, but rather the individual's underlying preferences and beliefs. In a DP problem an individual's preferences are represented by a utility function u(st,dt) specifying the psychological or monetary reward received by an individual or firm who is in state st and takes action dt at time t, and also by a subjective discount factor $0 < \beta < 1$ at which the individual trades off current versus future utility. Beliefs are specified by a transition probability p(st+1|st,dt)representing the individual's subjective beliefs about next period's state given the current state and action. Given any particular specification for beliefs and preferences, $(\beta,u,p)$, the method of dynamic programming can be used to derive the optimal decision rule $\delta$ and the controlled stochastic process $\{s_t,d_t\}$ implied by $(\beta,u,p)$as the solution to the following optimization problem

\begin{displaymath}V(s) = \max_\delta E_\delta\left\{ \sum_{t=0}^T \beta^t
u(s_t,d_t)\big\vert s_0=s\right\}, \eqno(1)\end{displaymath}

where V is the value function representing the maximum expected disounted utility of an individual in state s at time t=0 and $E_\delta$ denotes mathematical expectation over the stochastic process $\{s_t,d_t\}$ induced by the law of motion p and the decision rule $\delta$. Dynamic structural econometrics can be viewed as solving the following inverse problem: given a stochastic process $\{s_t,d_t\}$ governing the states and decisions of a decision maker, is there a set beliefs and preferences $(\beta,u,p)$for which $\{s_t,d_t\}$ coincides with the solution to a DP problem, and if so, can we infer $(\beta,u,p)$ from observations of $\{s_t,d_t\}$? In practice we solve this inverse or ``revealed preference'' problem by parameterizing $(\beta,u,p)$ in terms of a vector $\theta$ of unknown parameters, so that different values of $\theta$ index different sets of beliefs and preferences. Let $(\beta_\theta,u_\theta,p_\theta)$ denote the preferences and beliefs corresponding to the vector $\theta$, and let $\delta_\theta$ and $V_\theta$ denote the corresponding optimal decision rule and value function. The goal of dynamic structural estimation is to compute an estimate $\hat\theta$ so that the implied optimal decision rule $d=\delta_{\hat\theta}(s)$ ``best fits'' our observations of $\{s_t,d_t\}$ in a sense I will make precise below.

One of the first contributors to this new econometric methodology was Thomas Sargent in his paper on dynamic labor demand in the 1978 Journal of Political Economy. This work and subsequent joint work by Sargent and Lars Hansen in the early 1980's yielded a well-developed estimation theory for the class of linear-quadratic DP problems, i.e. problems where the individual's utility $u_\theta(s,d)$ is assumed to be a quadratic function of their state s and action d and the individual's beliefs $p_\theta(s_{t+1}\vert s_t,d_t)$ are specified by linear stochastic difference equations. Subsequent work by Hansen and Kenneth Singleton in 1982 developed a method for structural estimation of a much wide class of models, namely DP problems whose optimal decision rules satisfy a stochastic Euler equation, a functional equation characterizing the first order necessary condition for the optimal decision rule $d=\delta_\theta(s)$. Since the derivation of this first order condition requires differentiation with respect to the decision variable d, the method only applies to continuous decision processes (CDP's), i.e. SDP's where the individual has a continuum of possible actions in each state s. In particular, these methods cannot be used for structural estimation of discrete decision processes (DDP's), i.e. SDP's where the individual has only a finite number of possible actions in each state s. An example of a DDP is an optimal search or stopping problem where there are two possible actions: dt=0 (reject current job offer and continue searching), and dt=1 (accept current job offer and stop searching). Motivated by prior work by my thesis adviser Dan McFadden on structural estimation of static discrete choice models, in 1984 I developed a nested fixed point maximum likelihood estimation algorithm (NFXP) that could be used to estimate preferences and beliefs of a wide class of DDP problems. The first empirical application of this approach was subsequently published in Econometrica in 1987. I established the asymptotic statistical properties of of the NFXP algorithm and the associated maximum likelihood estimator of $\theta$ (i.e. consistency and asymptotic normality) in my paper in the 1988 SIAM Journal on Control and Optimization.

The idea behind the NFXP algorithm is conceptually quite simple: one uses fast algorithms that repeatedly re-solve the DP problem (1) for different trial parameter values $\theta$ until the predictions of the DP model ``best fit'' the data in the sense of maximizing the likelihood of the observed states and decisions of a sample of individuals. The NFXP approach was initially treated with a great deal of skepticism in the economics profession since it was widely believed that the nested numerical solution of the inner DP problem would be computationally intractable and round-off and approximation error in the numerical solution would lead to instabilities in the outer maximum likelihood algorithm. However I was able to develop a fast, numerically stable DP solution algorithm that enabled me to estimate small to medium sized problems on personal computers, and large scale problems on supercomputers. The implementation of the NFXP algorithm is complicated by the fact that my estimation framework requires the individual's state s to be partitioned into two components, $s=(x,\epsilon)$ where x is observed by the individual and the econometrician and $\epsilon$ is observed only by the individual. This decomposition reflects the practical reality that no data set will ever be able to completely measure the full state of an individual. It is also motivated by the fact that the optimal decision rule $d=\delta_\theta(x,\epsilon)$is a deterministic function of $(x,\epsilon)$, which implies that in principle we could perfectly predict an individual's behavior if we knew $\delta_\theta$ and were able to observe $\epsilon$ in addition to x. Of course, no theory is sufficiently powerful to be able to perfectly predict an individual's behavior, and the assumption of fully observed state leads to a statistically degenerate econometric model (i.e. there may be no value of $\theta$ that can perfectly predict an individual's actual behavior). I was able to avoid this degeneracy problem by integrating out the unobserved state variable $\epsilon$ to obtain a conditional choice probability $P_\theta(d\vert x)$:

\begin{displaymath}P_\theta(d\vert x) = \int I\left\{d=\delta_\theta(x,\epsilon)\right\}
q_\theta(d\epsilon\vert x), \eqno(2)\end{displaymath}

where $I\{\cdot\}$ denotes the indicator function (1 if $d=\delta_\theta(x,\epsilon)$ and 0 otherwise), and $q_\theta(\epsilon\vert x)$ denotes the conditional density of the unobserved state variable $\epsilon$ given the value x of the observed state variable. The choice probability $P_\theta(d\vert x)$ implies that there is a positive probability (perhaps very small) that the individual will choose any decision d from the set D(s) of feasible decisions in state s. While this approach solves the problem of statistical degeneracy and yields a well-defined likelihood function $L(\theta)$ given by:

\begin{displaymath}L(\theta) = \prod_{t=1}^T P_\theta(d_t\vert x_t)
p_\theta(x_t\vert x_{t-1},d_{t-1}), \eqno(3)\end{displaymath}

it creates additional computational problems since the unobserved state variable $\epsilon$ will typically be a vector of continuous state variables. This leads to difficult numerical integration problems in order to compute the value function $V_\theta(s)=V_\theta(x,\epsilon)$ and the conditional choice probability $P_\theta(d\vert x)$. Borrowing from the previous work of my thesis adviser Dan McFadden, I showed that if $q_\theta(\epsilon\vert x)$ is a multivariate extreme value distribution, and if the individual's utility function takes the form

\begin{displaymath}u_\theta(s,d)=u_\theta(x,\epsilon,d) = u_\theta(x,d) + \epsilon(d) \eqno(4)\end{displaymath}

where $\epsilon=\{\epsilon(d)\vert d\in D(s)\}$ is a vector with as many components as there are alternatives in the individual's choice set D(s), then one obtains a very simple closed-form expression for the choice probabilities $P_\theta(d\vert x)$:

\begin{displaymath}P_\theta(d\vert x) = { \exp\{ v_\theta(x,d)\} \over
\sum_{d'\in D(s)} \exp\{ v_\theta(x,d')\}} \eqno(5)\end{displaymath}

where the function $v_\theta$ is the unique solution to the functional equation:

\begin{displaymath}v_\theta(x,d) = u_\theta(x,d) + \beta \int \log\left[ \sum_{d...
...exp\{ v_\theta(x',d')\}\right] p_\theta(dx'\vert x,d). \eqno(6)\end{displaymath}

Equation (5) is a dynamic generalization of the classical multinomial logit model that includes the well known static logit model as a special case when $\beta = 0$. The function $v_\theta(x,d)$ can be regarded as an intertemporal generalization of the static utility function $u_\theta(x,d)$ of the static logit model and is related to the value function $V_\theta(x,\epsilon)$ given in equation (1) via the identity:

\begin{displaymath}V_\theta(x,\epsilon) = \max_{d \in D(s)} \left[ v_\theta(x,d) + \epsilon(d)\right].
\eqno(7) \end{displaymath}

A similar approach to estimation has been adopted in independent contributions by other econometricians including Gotz and McCall's 1979 paper on exit decisions of air force pilots, Miller's 1984 work on occupational choice, Pakes's 1987 paper on patent renewal, and Wolpin's 1984 paper on fertility decision of Malayasian households. With the exception of Miller's method (which is restricted to the relatively narrow class of multi-armed bandit problems), the approach adopted by these other authors does not easily extend to non-binary decision problems whereas the methodology I developed applies to DDP problems with an arbitrary finite number of alternatives. Due in part to the relative simplicity of my estimation framework and the computational advantages of the closed-form multinomial logit expressions for the choice probabilities, nearly all applied work on estimation of non-binary DDP problems to date has adopted my estimation framework, estimating the structural parameters using versions of my NFXP algorithm. A recent example is Donna Boswell's 1994 use of the NFXP algorithm to estimate an SDP model of the effect of employer health care policies on worker absenteeism.

Having developed the econometric tools for estimating discrete dynamic programming models, my subsequent research turned to the more interesting economic question: do human decision makers actually behave according to the theory of dynamic programming? My first empirical application of the method was the paper ``Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher'' which appeared in the 1987 Econometrica. Zurcher was the head of maintenance at the Madison Metro Bus Company. One of Zurcher's regular duties was to decide which buses in his fleet should have their engines replaced with a new or overhauled engine, and which buses should have their current engines left in place with only routine engine maintenance being performed. I constructed a DP model that predicted the optimal time to replace an engine on a bus. The decision variable is binary: dt=0(keep the current engine), dt=1 (replace the current engine). The observed state variable xt was the cumulative mileage on the bus since last engine replacement. The optimal replacement policy was chosen to minimize the expected discounted costs of operating the bus. I found that the DP model did a remarkably good job of predicting Zurcher's replacement decisions, and was able to provide simple explanations for several puzzling features in the data. One of puzzles was the fact that Zurcher replaced the engines in a group of newer GMC buses an average of 57,000 miles earlier than the engines on older GMC buses despite the fact that the cost of replacment engines for the new GMC buses were 25% higher. The DP model ``explained'' this fact by imputing a faster rate of deterioration to the newer GMC engines. Follow-up discussions with Zurcher revealed that this was indeed the case and was precisely the reason why engines on the newer buses were replaced more frequently. The DP model also yielded predictions of the ``replacement demand function'', i.e. the number of annual replacement bus engines installed as a function of the cost of a replacement engine. I experimented with traditional econometric techniques for estimating this demand function including regressing the annual number of engine replacements on the cost of replacement engines. I found that these simpler econometric methods lead to inaccurate estimates of the replacement demand function for two reasons: 1) I had less than 10 years of time series data on this group of GMC buses so the regressions had very few degrees of freedom, 2) there had been little or no variation in the real cost of replacement engines for various models of GMC buses. When I ignored model year differences and pooled the older GMC buses with the newer GMC buses (whose engines are 25% more expensive) the regression model predicted an upward sloping replacement demand function.

The Zurcher paper succeeded in convincing many skeptics that the NFXP algorithm is computationally tractable and that at least for certain well-structured problems the dynamic programming framework can be quite successful in predicting the behavior of the ordinary ``man on the street''.1 In recognition of this contribution to our understanding of dynamic decision making, the Econometric Society awarded me the Ragnar Frisch Medal in 1992 (for leading empirical paper published in Econometrica in the preceding 5 years).

After I finished my initial work on the Zurcher paper in 1986, I became interested in using the DP framework to model retirement behavior. The challenges involved in developing an accurate and realistic empirical models that could capture the complex range of observed retirement patterns required much more of my research time than I initially anticipated, consuming nearly 50% of my available research time over the last 8 years. I decided to concentrate my research in this area for several reasons. Retirement is an important national and international policy issue in view increasing longevity, the continued trend towards early retirement, the impending wave of retirements of aging baby boomers, and the shaky financial condition of the Social Security trust funds. It seemed clear that the SDP framework could lead to improved economic models of retirement behavior and that these models could be of practical use to help governments and firms design improved Social Security systems and pension policies. However estimation and testing of these models is extremely data-intensive, requiring thousands of detailed longitudinal observations of individual retirement trajectories. Fortunately, an excellent panel data set had already been collected by the Social Security Administration -- the Retirement History Survey (RHS) -- representing more than 500 meagbytes of information on income, assets, labor supply, health and family status, and Social Security benefits for 11,000 men and women aged 58-63 interviewed in biennial surveys from 1969 to 1979.

Although the data-processing and computational challenges involved in estimating a comprehensive unified SDP model of retirement behavior were quite daunting, I believed the large up-front investment required was justified in view of several compelling advantages of using the SDP framework to model retirement behavior. From a policy perspective, the most important advantage of the SDP framework is that it allows us accurately model the impact of the complicated details of private pensions and Social Security rules. These rules lead to uncertain time, state, and decision-dependent benefit streams that depend on factors such as the person's age, earnings history, health, and labor force participation decisions. It turned out that accurate modelling the Social Security rules within the SDP framework has enabled me to solve a number of empirical puzzles about retirement behavior that previous models have been unable to explain. One of these puzzles is the large peaks in retirements at ages 62 and 65 -- the ages of eligibility for early and normal Social Security retirement benefits. There are several reasons why previous models have failed to explain these puzzles, the most obvious being their unrealistic treatment of time and uncertainty and their failure to model the pattern of incomplete insurance markets facing various individuals. The limitations create further problems in the models' ability to accurately capture the incentive effects of Social Security and pensions. For example, most of the existing retirement models assume that individuals pre-commit to a fixed retirement age at some arbitrary ``planning date'' such as age 55 and treat retirement as an absorbing state (i.e. one cannot ``un-retire'' after retiring). These models also assume that individuals have perfect ability to borrow against future income and pension benefits, and have either ignored uncertainty about future health status and earnings potential or assumed that all individuals have access to fairly priced health and disability insurance. I realized that between the NFXP algorithm and the RHS data, I had the technology and data necessary to estimate a SDP model that could circumvent all of these limitations. The result was a model that does a much better job of capturing the constraints created by incomplete markets and the uncertainties older individuals face as they approach retirement.

The details involved in carrying out this task turned out to be much more challenging and time consuming than I had originally anticipated. However I believe the results obtained from this research, summarized in my recent paper co-authored with Chris Phelan ``How Social Security and Medicare Affect Retirement Behavior in a World of Incomplete Markets'', justify the time and effort that I devoted to it. In my opinion it represents my best empirical work to date, and I hope it will ultimately be regarded as a significant contribution to the literature on retirement behavior. Our DP model succeeds in providing relatively simple explanations for many of the puzzling aspects of retirement behavior discussed above. The DP model permits a fairly realistic treatment of the sequential nature of the retirement process and individual subjective uncertainties about future mortality, marital status, health status, employment status, income levels, and health expenditures. The model places no restrictions on labor supply paths (i.e. retirement is not assumed to be an absorbing state), and separates the labor supply decision from the ``retirement decision'', which we identify with the decision to apply for Social Security benefits. The DP model delivers a rich set of predictions about the dynamics of retirement behavior, and comparisons of actual vs. predicted behavior show that the DP model is able to account for wide variety of phenomena observed in the data, including the pronounced peaks in the distribution of retirement ages at 62 and 65 (the ages of early and normal eligibility for Social Security benefits, respectively). The peak at 62 is a result of borrowing constraints that prevent individuals with relatively little tangible net worth from retiring prior to the age of first eligibility for early retirement benefits. The peak at age 65 is a result of incomplete annuities markets and the fact that Social Security benefit formula is actuarially unfair for retirements after age 65. However this only accounts for part of the large peak in retirements that occur precisely at age 65. The remainder of the peak is explained by another form of market incompleteness - incomplete health insurance - and the fact that Medicare insurance is only available to individuals over 65 who have applied for Social Security retirement benefits. Although the overall effect of Social Security is to create strong disincentives to continued labor force participation, it creates strong incentives for certain individuals to remain employed up until their $65^{\rm th}$birthday. We identify a significant fraction of ``health insurance constrained'' individuals who have no form of retiree health insurance other than Medicare, and who can only obtain fairly priced private health insurance via their employer's group health plan. The combination of significant individual risk aversion and a long tailed (Pareto) distribution of health care expenditures implies that these individuals impute a significant ``security value'' to remaining employed until they are eligible for Medicare coverage at 65. Overall, our findings suggest that the so-called ``age 65 retirement puzzle'' and several other puzzling aspects of retirement behavior can be viewed as artifacts of particular details of the Social Security rules whose incentive effects can be quite strong for lower income individuals and those who do not have access to fairly priced loans, annuities, and health insurance.

In order to obtain these findings, I had to overcome a number of difficult computational, econometric, data and measurement problems. The solutions to many of these problems represent independent contributions in their own right. The computational challenge was to develop an efficient algorithm to solve the DP model sufficiently rapidly to enable us to estimate on a limited computer budget. The DP model that we ultimately estimated contains 7 state variables and 2 control variables, and is to our knowledge the largest and most realistic model of its kind that has ever been estimated in this literature. I developed a highly efficient solution algorithm that was able to solve the DP problem in less than .5 cpu seconds on a Cray supercomputer. A paper describing this algorithm, ``A Dynamic Programming Model of Retirement Behavior'' won 3rd prize in the 1989 IBM Supercomputing Competition. The computational problems turned out to be minor in comparison to the econometric problems of obtaining accurate estimates of individuals' beliefs, $p(x_{t+1}\vert x_t,d_t,\theta)$. Individual subjective beliefs are obviously rather slipperly objects to estimate, so strong identifying assumptions had to be imposed. In this case I imposed the assumption of rational expectations, i.e. that subjective beliefs about mortality, earnings, etc. correspond to the objective probability distributions of realized outcomes for mortality, income, etc. for individuals in a homogenous population. However even given the assumption of rational expectations and the relatively large number of observations in the RHS data set it is still impossible to estimate p directly. The reason is that p still contains far too many unknowns than we have data to estimate. To see this note that even given our relatively coarse discretization of the state and decision variables, (with x assuming 14,400 possible values, d assuming 6 possible values in a total of 23 possible biennial time periods from age 56 to 102) the array p of conditional probability distributions representing an individual's beliefs contains a total of 1.9 million probability distributions over the 14,400 possible values of xt+1 -- amounting to over $2.86\times 10^{10}$ unknown probability values to be estimated! It is clearly impossible to reliably estimate all these probabilities from my cleaned RHS sample of approximately 7500 person/year transitions. I solved this problem by developing a method for decomposing p into a product of ``sub-transition'' densities for individual components of xt and imposing certain ``exclusion restrictions'' on the conditioning elements entering the sub-transition densities. This decomposition enabled me to obtain accurate estimates of the overall transition probability p using the limited number of available observations. Examples of the sub-transition densities include separate models of mortality, marital status, health status and wage earnings of husband and spouse. I developed a unique method for embedding Social Security rules within the p matrix via a set ``Social Security transition matrices'' transforming wage earnings exclusive of Social Security benefits to wage earnings inclusive of Social Security. Once all of the separate sub-transition densities had been estimated, it was a relatively simple matter to build the massive overall array of beliefs p by simple matrix multliplication. However the large number of elements in p necessitated development of a specialized highly efficient algorithm to rapidly re-construct p avoiding ``unecessary'' multplications implied by the exclusion restrictions. The result was a computationally and econometrically tractable algorithm capable of solving the the DP problem extremely rapidly (in less than .5 cpu seconds on a Cray supercomputer and less than 30 cpu seconds on a fast workstation).

Solving the data and measurement problems turned out to be by far the most time consuming aspect of my research on retirement behavior. Creating a consistently defined set of observations on variables such health status, labor supply, earnings, and consumption from successive waves of the RHS survey presented many challenges. An example of these problems is determining how to deal with internal inconsistencies in respondents' reports about labor supply (e.g. a respondent who reports that he is working at the 1969 interview but in the 1971 interview reports having been continuously unemployed since the previous interview). Determining how to create good measurements of subjective variables such as health status created additional difficulties, especially since the RHS did not use a consistent set of questions on health status throughout all survey waves. Other state variables required imputations to determine individuals' opportunity sets. An example is the health insurance state variable, which indicates whether an individual is constrained in their ability to purchase private health insurance at a fair price. As discussed above, this turned out to be one of the key variables to understanding the incentive effects of Social Security and Medicare. Since the RHS did not directly ask respondents about whether they had access to fairly priced private health insurance, I had to infer this information from their responses to other survey questions. However the problem of obtaining accurate measurements of consumption expenditures ct was the most severe, and ultimately forced me to simplify the specification of the DP model to circumvent it. I experimented with several alternative methods for constructing measures of consumption from the information in the RHS. For example I attempted to solve for ct from the budget equation wt+1=wt+yt-ct where wt+1 and wt are the observed net worth and yt is the imputed income flow to the household between successive survey waves. Measurement errors in wt+1 and wtto a lesser extent imputation errors involved in constructing yt lead to implausibly erratic measurements of ct, including a disturbingly high incidence of negative measurements of ct. Even though the original version of my algorithm included ct as one of the decision variables in the DP model, I found that the large measurement errors in ct interfered with the ability of the DP model to fit the other variables in the data such as labor supply and income that I was able to measure much more accurately. As a result all of the versions of the DP model I have estimated to date make the simplifying assumption that ct=yt. Although this assumption has been criticized by certain economists, I believe it is a highly defensible assumption for my sample of predominantly blue collar workers in the RHS. These individuals have very little net worth outside of their housing equity, and rely nearly exclusively on their pension and Social Security benefits to finance retirement consumption. There is very little evidence in the RHS that individuals spend down their net worth as they age, and indeed the distribution of changes in net worth wt+1-wt has a large spike at 0, indicating that ct=yt is in fact a very good approximation to individuals' behavior in the RHS. Indeed if we were to treat all fluctuations in net worth as arising from random measurement error, statistical test would be unable to reject the hypothesis that ct=yt. Furthermore, Deaton's 1991 Econometrica article demonstrated that it is optimal to set ct=yt if there are borrowing constraints and income is highly serially correlated. Since observed income is in fact highly serially correlated the fact that ct=yt appears to be satisfied in the data may be a confirmation that these individuals are behaving optimally in the face of borrowing constraints, exactly in accordance with Deaton's theoretical results. Borrowing constraints are certainly a fact of life for most older individuals: in fact it is actually illegal to borrow against one's future Social Security benefits.

My painstaking analysis of the RHS data, reported in my 1990 paper ``Behavior of Male Workers at the End of the Life-Cycle: An Empirical Analysis of States and Controls'', has contributed to my reputation for serious concern for survey sampling and data measurement issues. In 1990 I was appointed to as a consultant to assist in the design of the National Institute on Aging's new Health and Retirement Survey (HRS), a successor the RHS designed to obtain much better information on health, working conditions, and pension plan characteristics than was available in the RHS, and in 1993 I was appointed to the Panel on Retirement Income Modelling at the Committee on National Statistics of the National Academy of Sciences. I have recently submitted a 3 year research proposal to the National Institute on Aging to fund further research into retirement behavior using the HRS data. The HRS will enable me to overcome several shortcomings of the DP model that stemmed from limitations in the RHS data. In my opinion the most important shortcoming was the exclusion of 40% of the sample who expected to receive private pension benefits. These individuals were excluded due to the fact that the RHS has very sketchy information on the benefit provisions of the more than 600,000 different types of pension plans available in the U.S. in the 1970's. The HRS collects much more detailed information on the benefit structures of an individual's main pension programs, and this information will allow me to re-estimate a DP model with an integrated treatment of pensions and Social Security. The HRS will also allow me to create improved measures of health status, and to model disability and unemployment as additional exit routes from labor force participation. While I will be devoting substantial effort to devising new ways to obtain improved measurements of consumption and net worth, I am not optimistic that the HRS data will enable me to obtain significantly better measurements of these items than I could obtain using the RHS. Our inability to obtain good measurements of consumption is bad news for those interested in modelling consumption/savings decisions since even the most basic ``stylized facts'' about trends in consumption, savings, and net worth at the end of the life-cycle are still subject of considerable professional dispute. Fortunately I do not believe that the problem of obtaining accurate measurements of consumption will seriously impinge on my ability to model the retirement decision, at least for the vast majority of individuals who have failed to accumulate substantial net worth as they approach retirement. This is certainly true for the vast majority of individuals in the HRS, where 88% of all older full-time workers have accumulated level net worth that is less than 5 times their current annual income, and the mean ratio of net worth to current income for full time workers is only 3.8. My future research will continue to concentrate on the analysis of variables such as labor supply that we can measure relatively accurately, and will resort to reasonable simplifying assumptions (such as ct=yt) for variables such as consumption that are dominated by measurement error.

In summary, my overall assessment of my research on empirical models of SDP's is that the bus replacement and retirement applications are unqualified successes that demonstrate that the NFXP algorithm is a practical econometric tool and that the behavior of the ``man on the street'' can be well-approximated by an optimal decision rule to a dynamic programming problem. However a cynic could argue that any behavior can always be ``rationalized'' as optimal for some choice of preferences and beliefs. Indeed in my recent paper ``Do People Behave According to Bellman's Principle of Optimality?'' I show that in a dynamic context the hypothesis of optimization per se has no empirical content if we are unwilling to place any a priori restrictions on individual's preferences or beliefs. In formal terms, I proved that given an abitrary function $d=\delta(s)$, we can always find a set of preferences and beliefs $(\beta,u,p)$ for which $\delta$ is an optimal decision rule. However it is important to emphasize that there is no guarantee that the preferences and beliefs that rationalize an arbitrary behavior pattern $d=\delta(s)$ will be regarded as ``reasonable''. This implies that in order to judge whether an SDP model provides a ``successful'' explanation of an individual's behavior we need to consider whether it does so via a simple, parsimonious, and intuitively plausible specification of preferences and beliefs. Although the issue of what constitutes a ``reasonable'' specification of preferences and beliefs is clearly subjective, most economists are capable of identifying an artificially concocted theory when they see it. A more practical test of the validity of an SDP model is how well it forecasts behavior, especially in out-of-sample predictive tests and in quasi-experimental settings where agents' beliefs or preferences can be altered in controllable and predictable ways. However I must ackowledge that the hypothesis of expected utility maximization underlying the SDP framework has been rather convincingly rejected by a number of fairly robust experimental tests (such as the famous ``Allais paradox''), so SDP models can at best be regarded as idealized approximations of human decision making processes. My view is that we should judge SDP models the same way we would judge any other idealized, approximate model: are these models of practical value in the sense of improving our ability to predict and understand human behavior? The answer to this question is almost certainly yes at least in terms of large and rapidly growing number of empirical applications of the SDP framework to problems ranging from business investment decisions (Das, 1992) to women's contraceptive decisions (Montgomery, 1988, Ahn, 1994). We also have a number of very concrete examples demonstrating that SDP models do in fact yield much more accurate out-of-sample predictions than alternative models, especially with respect to policy changes.

On a personal level, I have found my research in this area to be very rewarding even though doing empirical work in the SDP framework involves many difficulties and is quite labor intensive and time consuming. I have been fortunate to receive research support from agencies such as the National Science Foundation, the National Institute on Aging, the Sloan Foundation, IBM and the Bradley Foundation, and professional recognition of my research contributions including my 1988 Alfred Sloan Fellowship, an invitation to present a symposium paper on estimation of SDP's at the 1990 Sixth World Congress of the Econometric Society, the 1991 Lief Johansen Lectures at the University of Oslo, the 1992 Ragnar Frisch Medal, an invited lecture on ``Pensions and the Labor Market'' at the Tinbergen Institute in Rotterdam in 1993, an invited chapter for the forthcoming volume 4 of the Handbook of Econometrics, and my 1993 election as a Fellow in the Econometric Society and my appoitment to the Committee on National Statistics of the National Academy of Sciences.

2. Research in Computational Economics

I have had a long interest in computation arising out of my original development of the nested fixed point algorithm for estimating discrete dynamic programming models. In 1992 I was asked to co-edit the Handbook of Computational Economics with Hans Amman of the University of Amsterdam and David Kendrick of the University of Texas. I have written a chapter for this volume entitled ``Numerical Dynamic Programming in Economics''. The chapter summarizes the current state of the art in numerical methods for solving various types of dynamic programming problems.

One of the most frustrating aspects of computing numerical solutions to DP problems is the computational burden involved in solving increasingly realistic models. Although there is a well developed literature on numerical methods for solving DP problems, the usefulness of these methods is limited by a problem that Richard Bellman termed the curse of dimensionality: the amount of computer time required to solve a DP problem increases exponentially fast in any relevant measure of the ``size'' or realism of the DP problem. The curse of dimensionality sets very severe limits on the level of realism and detail that we can expect to incorporate in a DP model. Even though the speed of computer hardware is increasing at an exponential rate, the curse of dimensionality implies that size of problems we can solve with this faster hardware grows at a much less dramatic rate: in practice the size of problems that are solvable seems to grow only linearly as hardware improves. Solving DP models at the level of detail that we encounter in everyday life -- such as choosing which items to buy from the thousands of possibilities at a grocery store or choosing which of the tens to hundreds of possible commuting routes to get from home to work -- is far beyond the reach of current hardware and software. Since humans are able to do these tasks virtually effortlessly, the computational problems involved in solving these problems in the SDP framework cast doubt on whether humans really are behaving ``as if'' they were solving a dynamic programming problem. Put differently, if the world's best mathematicians and economists are only able to solve relatively ``simple'' DP problems (even with the assistance of very powerful supercomputers), can we seriously expect DP models to do a good job of predicting the behavior of the ordinary ``man on the street'' who faces each day vastly more complicated decision problems?

In a recent paper ``Using Randomization to Break the Curse of Dimensionality'' I present a simple monte carlo algorithm that succeeds in breaking the curse of dimensionality of solving DDP problem, i.e. dynamic programming problems where the decision maker has a finite number of possible alternative actions in each state. The cpu time required to solve a DDP problem using this algorithm increases only as polynomial rather than exponential function of the size of the problem using other solution methods. This result is of considerable theoretical significance since most computer scientists have previously believed that this class of DP problems are subject to an inherent curse of dimensionality that no deterministic or randomized algorithm could circumvent regardless of how cleverly it is designed. The practical significance of the result is that it offers the possibility that we can solve much larger and realistic DDP problems than we previously thought was possible. The algorithm will have immediate application to estimation of SDP models, and will be especially useful in relaxing the strong assumption that the unobservable state variables $\epsilon$ in the DDP problem are an IID extreme value process. The new algorithm will make it feasible to estimate DDP models with serially correlated unobservables with more general non-extreme value marginal distributions. The result is also significant from a behavioral perspective, since it leaves open the logical possibility that humans really do use approximately optimal strategies to solve the highly complex decision problems that they face on a daily basis. It also raises the interesting question whether we really use ``algorithms'' that involve implicit randomization, or whether our processing capabilities are more of a result of effective harnessing of massive parallism in our uncounted trillions of neural circuits.

3. Research on Artificial Intelligence and Automata Trading in Double Auction Markets

An outgrowth of my interest in computational methods for solving DP problems is my interest in heuristic methods for solving difficult SDP problems. The heuristic methods I am interested in include ``learning algorithms'' and methods from the artificial intelligence literature. When one looks closely at most practical decision making situations one realizes that it is unrealistic to assume that individuals have well-defined probabilistic beliefs about uncertain aspects of their environment -- at least it seems unrealistic that most people would have highly detailed subjective probability assessments of the huge range of possible outcomes that could happen to them.2 This is especially true in strategic environments where individuals need to make probability assessments about the strategies used by their opponents. A classical example of such an environment is the double auction market, where bidding activity of buyers and sellers leads to endogenous formation of trading prices for commodities (a classic example of a double auction market are the commodity trading pits at the Chicago Board of Trade). The complexities involved in inferring whether certain bids and asks are results of private information possessed by other traders or whether their bids primarily reflect predictable information about their trading strategies poses an extremely difficult learning problem. It is natural to try to model the learning problem in a Bayesian fashion, and treat the observed price trajectories in these markets as Bayesian Nash equilibrium outcomes of a dynamic game of incomplete information. Unfortunately this game is far too complex to be solved either analytically or numerically using the fastest available supercomputers. Another problem with the game-theoretic approach is that it depends criticially on the common knowledge assumption that each trader knows that each of his opponents are rational and knows the exact form of his opponents' equilibrium trading strategies. As discussed above, this is a highly dubious assumption in practice. The common knowledge assumption has also been widely criticized since it presumes a high degree of implicit coordination amongst the traders, begging the question of how decentralized coordination is achieved in the first place. Thus, economists have been able very little about one of the most fundamental economic problems: how are prices determined and why does supply equal demand?

In order to obtain new insights into this problem, I got involved in a project at the Santa Fe Institute in the fall of 1989 which lead to the sponsorship of the 1990 Santa Fe Double Auction Tournament, co-organized with John Miller and Richard Palmer (a physicist at Duke Univeristy). Our computerized tournament attracted a collection of 30 heterogenous computer programs playing the role of buyers and sellers vying for a pool of $10,000 prize money (from my Alfred Sloan Fellowship award). The idea was to have contestants encode their trading intuition or favoriate artificial intelligence procedure into a computerized trading program. The computerized strategies competed over a broad range of trading environments to provide a rigorous test of their relative trading ability, where ability can be measured unambiguously in terms of the profits each program earned (we structured the tournament so that each program had the same potential trading surplus with probability 1). Since we had access to the code, we could essentially ``observe'' the underlying trading strategies, something that is impossible to do in actual or even experimental double auction markets using human subjects. We found that the collection of computerized traders behaved very similar to human subjects in double auction experiments, with price trajectories converging to competitive equilibrium and allocations that were nearly 100% efficient. Surprisingly we found that a very simple ``wait in the background'' trading strategy emerged as the winner of the tournament, beating out many more complex algorithms using statistically based predictions of future transaction prices, explicit optimizing principles, and sophisticated learning principles. The strategy required remarkably little information beyond its private token values, current time, and the current bid and ask. Whether or not good human traders really behave according to a few simple decision rules is the subject of ongoing investigations, but the results do seem to confirm Nobel laureate Friederik von Hayek's conjecture about the ``economy of information needed to take the right action in a competitive market''. This project resulted in a publication in the 1994 Journal of Economic Dynamics and Control and an article in a book I co-edited with Daniel Friedman of University of California Santa Cruz, The Double Auction Market: Institutions, Theories and Evidence. The National Science Foundation funded subsequent joint research with Vernon Smith of the University of Arizona that lead to the creation of the Arizona Token Exchange which allowed us to use the computerized strategies we obtained from the 1990 Santa Fe tournament as computerized opponents to human traders trading worldwide via the Internet. Our goal was to see if humans can learn to exploit or at least form best responses to fixed collections of computerized strategies in the double auction environment under repeated play. We gathered observations on over 10,000 DA trading periods from over 90 subjects, providing the largest dataset on time-series observations of strategy-formation and trading behavior ever collected. We are currently analyzing this voluminous data (several hundred megabytes worth of information on tens of thousands of separate double auctions) and expect to have a paper ready for submission to a journal by the end of this year. We expect analysis of this data to yield insights into the way humans learn effective trading strategies, and to improve understanding of the forces driving markets to competitive equilibria. We also expect to formulate improved computerized trading strategies and more realistic computerized models of working DA markets.

4. Future Research Plans

In rough terms the first decade of my research career focused on the development of dynamic programming as a general tool for understanding human economic behavior, with special focus on its application to understanding retirement behavior. The next decade of my research career will focus on systematic application of this tool for prediction and design of improved policies by governments and firms, with special focus on firm pension plans and government Social Security policy as it affects retirement behavior.

The primary advantage of the dynamic structural approach to econometric forecasting and policy-making is that it allows us to to make much more detailed and accurate predictions of the effects of hypothetical policy changes than is possible using traditional reduced-form econometric methods. I intend to use the SDP framework to design more efficient policies in a number of areas such as improved pension plans (to help private firms encourage the retirement of less productive older workers while retaining and rewarding the most productive), improved government Social insurance schemes (to direct health insurance disability and retirement benefits to individuals who truly need these benefits at minimum cost to taxpayers and with minimum distortionary impacts on savings and labor supply decisions), and regulatory policies for nuclear power plants (helping public utilities strike an appropriate balance between public safety and cost-efficient power generation).

The DP approach to predicting the impacts of policy changes is computationally intensive but conceptually straightforward. Let $\pi$ denote a given policy: for example in the case of Social Security $\pi$ could be represented by a vector $\pi=(\pi_1,\ldots,\pi_k)$ where $\pi_1$ is the age of early retirement, $\pi_2$ is the age of normal retirement, $\pi_3$ is the Medicare co-insurance rate, and so forth. Let $\theta$ denote a vector of parameters characterizing individuals' preferences and beliefs as discussed in section 1. We assume that individuals know their own values of $\theta$ but that this vector is unknown to the econometrician and must be estimated. Thus, $u_\theta(s_t,d_t,\pi)$ represents the utility (i.e. psychological well-being) of an individual whose preferences are given by the vector $\theta$, who is facing a given policy $\pi$, and who is in state st and takes action dt at time t. Similarly, the conditional probability density $p_\theta(s_{t+1}\vert s_t,d_t,\pi)$ represents this individual's beliefs about his next period state st+1 conditional on their current state and action (st,dt). Note that policy $\pi$ can affect a person's current welfare (for example by the level of Social Security retirement benefit currently received) and also one's beliefs about the future (e.g. one's expectations of future Social Security benefits). Given panel data on a sample of individuals' states and actions $\{d_t^i,s_t^i\}$ of $i=1,\ldots,I$ individuals followed over time periods $t=1,\ldots,T$, under an existing policy $\pi_e$, we estimate $\theta$ by finding the value $\hat\theta$ such that the decision rule $d_t=\delta_{\hat\theta}(s_t,\pi_e)$ best fits the observed data $\{d_t^i,s_t^i\}$ using the NFXP algorithm described in section 1. Given $\hat\theta$ we can then predict individuals' behavior under an alternative hypothetical policy $\pi_h$ by re-solving the DP problem under the new policy resulting in a new optimal decision rule $d_t=\delta_{\hat\theta}(s_t,\pi_h)$. A more systematic and ambitious method of policy analysis is the method of ``computational mechanism design'' that I have been developing with my colleague Chris Phelan. This approach use the revelation principle to simplify the process of searching over the space of all possible policies $\pi$ to maximize a certain objective function subject to certain individual rationality and incentive constraints, where the latter constraints arise from the presence of private information. For example, in our paper ``U.S. Social Security Policy: A Dynamic Analysis of Incentives and Self-Selection'' we used the computational mechanism design approach to find new Social Security policies that minimize the cost of providing retiree benefits to taxpayers subject to the individual rationality constraints that retirees should not be any worse off under the new policy $\pi^*$ than under the existing Social Security policy $\pi_e$, and subject to the incentive constraints that individuals should not want to lie about their true health status (e.g. to claim they are disabled in order to gain early retirement benefits). We are currently using this method in a paper entitled ``Measuring the Inefficiency of the U.S. Social Security System'' which we presented at a recent conference on ``The Future of the Welfare State'' in Ebeltoft, Denmark in 1994.

My more immediate research plans are to begin work on three books: Computerized Trading, Stochastic Decision Processes: Theory, Computation, and Empirical Applications, and Dynamic Programming and Behavior at the End of the Life Cycle. I also will be finishing my editorial work for the Handbook of Computational Economics and be delivering a final manuscript to North Holland in early 1995. After this work is finished the major focus for my research will be continued empirical work in the area of retirement behavior with special focus on the incentive effects of government social insurance and tax/transfer schemes. In June 1994 I submitted a 3 year $360,000 proposal to the National Institute on Aging entitled ``Analysis of Dynamic Models of Retirement/Savings Behavior Using the HRS''. If awarded, this grant will allow me to devote 50% research time over the next three years in order to develop substantially more detailed and realistic models of retirement behavior than I was able to estimate using the RHS data set in the research described in section 1. Although my research using the RHS has lead to a number of important breakthroughs in terms of computational and econometric methodology, the RHS data set is now rather out of date. I believe that the forthcoming waves of the HRS data set will lead to a wealth of new insights and much more powerful analyses of the effects of various government policies such as Social Security and health insurance, disability insurance and pension policy. I foresee many research papers and many Phd dissertations stemming out of the HRS, so it will be the major focus for my research over the next decade as successive waves of the data are collected and released. I am currently supervising four students at Wisconsin who plan on using this data set in their dissertation work, including Maria Perozek who is my co-principal investigator on the NIH grant I submitted in June. Maria plans to use the HRS to study the effects of precautionary saving and family transfers in an environment of incomplete insurance and annuities markets. My own research in this area will be heavily influenced by my participation in the National Academy of Science's Panel on Retirement Income Modelling. The goal of the National Academy Panel will be to identify major issues in the retirement area and I will be co-authoring a monograph summarizing the panel's findings and recommendations for future research.

>I believe the time is ripe for work in this area since we now have both the data and the econometric methodology to make substantial progress towards producing results that are both intellectually interesting and relevant for practical policy making. The policy issues confronting Western nations are extremely important in view of the rapidly aging baby boom population, increasing longevity, and continued trend toward early retirement. In the coming year I will be organizing a consortium of researchers to submit a ``grand challenge'' proposal to the National Science Foundation to provide research support to help crack some of the important unsolved problems. Among the problems that I believe we can solve with concerted effort is to develop realistic empirical models of the joint life-cycle consumption/savings and labor/leisure decision accounting for the effects of social insurance schemes such as UI, DI, Social Security, income taxes, and the incomplete pattern of private pension and health insurance markets.

Another grand challenge problem is to make the theory of ``mechanism design'' practical for designing actual, improved social insurance institutions. My research with Chris Phelan on ``Social Security: A Dynamic Analysis of Incentives and Self-Selection'' is a small, but potentially important step in this direction. These questions are not only of major interest in the U.S. but in Europe as well, especially with increased international competition making it increasingly difficult to continue to support social welfare systems at their historic levels of generosity. Increased emphasis will be placed on incentive efficiency and developing new policies that target benefits to those truly in need. However a limitation of the computational mechanism design approach is the curse of dimensionality involved in solving for efficient policies under increasingly realistic specifications of the retirement model. Another limitation is that this approach ignores the general equilibrium repurcussions of changes in Social Security policy. For example, policies that lead to increased retirement ages imply that more older workers will be trying to remain in the labor force. The current model assumes that this extra supply of older workers will not affect wages and unemployment rates - a dubious assumption. In order to address these issues I have embarked on a research project with my former colleague Lars Ljungvist to embed the analysis of retirement in a general equilibrium context. Our research will also focus on the effects of taxation and social insurance policies at all stages of the life cycle, particularly with respect to unemployment and disability insurance policies and policies for retraining individuals in a world of increasingly rapid technological obsolescence of human capital. My hope is that by organizing a group of the best researchers in the U.S. and Europe who are interested in these sorts of topics we will be able to exchange estimation and solution technologies and conduct a concerted attack on the most difficult outstanding problems. My plan is to coordinate this via a series of conferences patterned after the conferences organized by David Wise for the NBER's Economics of Aging Project. I hope to organize funding from a variety of sources in order to finance research release time, graduate student theses, and conferences to present results and exchange ideas. During my lectures in Rotterdam I found out about a Dutch version of the HRS and a large group of economists and graduate students there who are interested in these issues. There is also substantial interest and potential government funding in Scandinavia, Germany, and England. In summary I believe there will be funding and data to support a research consortium in this area, and I believe we are on the threshold of developing the computational and econometric techniques to make major contributions to the analysis of social insurance and tax/transfer policies over the next 5 to 10 years.


John Rust
2001-04-10