stochastic dynamical model for stock-stock correlations · ravindra e. amritkar physical research...

9
Stochastic dynamical model for stock-stock correlations Wen-Jong Ma and Chin-Kun Hu * Institute of Physics, Academia Sinica, Nankang, Taipei 11529, Taiwan Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised manuscript received 1 January 2004; published 4 August 2004) We propose a model of coupled random walks for stock-stock correlations. The walks in the model are coupled via a mechanism that the displacement (price change) of each walk (stock) is activated by the price gradients over some underlying network. We assume that the network has two underlying structures, describing the correlations among the stocks of the whole market and among those within individual groups, respectively, each with a coupling parameter controlling the degree of correlation. The model provides the interpretation of the features displayed in the distribution of the eigenvalues for the correlation matrix of real market on the level of time sequences. We verify that such modeling indeed gives good fitting for the market data of US stocks. DOI: 10.1103/PhysRevE.70.026101 PACS number(s): 89.65.Gh, 05.40.Fb In recent decades, the analysis of many physical and so- cial systems has been based on the idea that randomness in the fluctuations is hallmarked by certain prototypes, such as eigenvalue distribution of a random matrix, and any devia- tion from the latter is a result of the presence of correlations. Such an idea has been used to study the level statistics [1] of locally activated states [2] for electrons in heavy atoms or in solids, atomic vibrations in spatially disordered systems, dis- tribution spectra of eigenvalues for the correlation matrices of stocks [3,4], internet traffic [5], and atmospheric fluctua- tions [6]. While recent advances [3–6] suggest a robust ap- proach to reveal cross correlations from the data of time sequences, a comprehensive address of the shared dynamics behind these different systems, however, is still lacking. In the present paper, we will address this problem by proposing a stochastic dynamic model for stock-stock correlations [3,4]. Our approach is useful for studying time evolution of other interacting many-body systems subject to random noises. The nature of fluctuations in financial markets [3,4,7–11] has been of interest to the traders as well as a variety of professionals for more than a century. If such fluctuations could be completely characterized by random walks, as first proposed by Bachelier in 1900 [9], making profit under con- trolled risks through the transactions in the markets would seem impractical. Correlations in such fluctuations have been demonstrated in recent studies [3,4] of the distribution spec- trum of eigenvalues of the cross correlation matrix for the price changes of stocks in real markets. The matrix measures the statistical overlap of the fluctuations [3,4,11,12] in the price changes (the returns) between pairs of stocks. The spectrum from market data [3,4] possesses a bulk of continu- ously distributed eigenvalues, which is similar to the proto- type predicted by the level statistics of the random matrix theory (RMT)[1,4,13] and may be considered as mainly contributed by the randomness. The effects of the correla- tions [3,4,11] manifest in the eigenvectors of those eigenval- ues isolated from the bulk, which include one large eigen- value l M , corresponding to the correlation among all stocks (market mode)[4] and several much smaller ones scattered in between l M and the bulk component. The patterns in the eigenvectors of these latter eigenvalues were related to the presence of groups of correlated stocks [4,11]. In addition, the bulk component also shows some important deviations from RMT predictions. The spectrum of eigenvalues and the corresponding eigenvectors are related to the cooperative behavior in the fluctuations of the stock prices, which is not visible in the local information of the individual correlation coefficients composing that matrix [14]. Based on former information, the connections between the deviated eigenvalues and the presence of correlated groups of stocks have been clarified [4] and rewarded with an ansatz [11] which was applied to modeling the real market data [15]. In the model [11,15], the return of each stock is linearly decomposed into two uncor- related fluctuating parts. The interdependence of stocks within a group is carried by the part which is synchronously shared by all stocks within that group. This formulation re- fines the conventional multifactor model [8] leading to blocked structures in its correlation matrix [11], which repro- duces those spectral features observed in market data [15]. It assumes that each of the isolated eigenvalues from the bulk is contributed by one block of submatrix, with its eigenvec- tor containing only one activated group. This puts a limita- tion on the model, not able to digest completely the informa- tion carried by the eigenvectors for those deviated eigenvalues in the market data, in which the shared activated stocks are present very often among different eigenmodes. Our paper presents a new formulation to include this fact, at the same time, retaining the realization of grouping in its simplest form. Our approach is a general formulation which describes the cooperative activities in the financial fluctua- tions beyond the statistics of matrices, on the level of time sequences. *Electronic address: [email protected] PHYSICAL REVIEW E 70, 026101 (2004) 1539-3755/2004/70(2)/026101(9)/$22.50 ©2004 The American Physical Society 70 026101-1

Upload: others

Post on 13-Jul-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

Stochastic dynamical model for stock-stock correlations

Wen-Jong Ma and Chin-Kun Hu*Institute of Physics, Academia Sinica, Nankang, Taipei 11529, Taiwan

Ravindra E. AmritkarPhysical Research Laboratory, Navrangpura, Ahmedabad 380009, India

(Received 2 October 2003; revised manuscript received 1 January 2004; published 4 August 2004)

We propose a model ofcoupled random walksfor stock-stock correlations. The walks in the model arecoupled via a mechanism that the displacement(price change) of each walk(stock) is activated by the pricegradients over some underlying network. We assume that the network has two underlying structures, describingthe correlations among the stocks of the whole market and among those within individual groups, respectively,each with a coupling parameter controlling the degree of correlation. The model provides the interpretation ofthe features displayed in the distribution of the eigenvalues for the correlation matrix of real market on thelevel of time sequences. We verify that such modeling indeed gives good fitting for the market data of USstocks.

DOI: 10.1103/PhysRevE.70.026101 PACS number(s): 89.65.Gh, 05.40.Fb

In recent decades, the analysis of many physical and so-cial systems has been based on the idea that randomness inthe fluctuations is hallmarked by certain prototypes, such aseigenvalue distribution of a random matrix, and any devia-tion from the latter is a result of the presence of correlations.Such an idea has been used to study the level statistics[1] oflocally activated states[2] for electrons in heavy atoms or insolids, atomic vibrations in spatially disordered systems, dis-tribution spectra of eigenvalues for the correlation matricesof stocks[3,4], internet traffic[5], and atmospheric fluctua-tions [6]. While recent advances[3–6] suggest a robust ap-proach to reveal cross correlations from the data of timesequences, a comprehensive address of the shared dynamicsbehind these different systems, however, is still lacking. Inthe present paper, we will address this problem by proposinga stochastic dynamic model for stock-stock correlations[3,4]. Our approach is useful for studying time evolution ofother interacting many-body systems subject to randomnoises.

The nature of fluctuations in financial markets[3,4,7–11]has been of interest to the traders as well as a variety ofprofessionals for more than a century. If such fluctuationscould be completely characterized by random walks, as firstproposed by Bachelier in 1900[9], making profit under con-trolled risks through the transactions in the markets wouldseem impractical. Correlations in such fluctuations have beendemonstrated in recent studies[3,4] of the distribution spec-trum of eigenvalues of the cross correlation matrix for theprice changes of stocks in real markets. The matrix measuresthe statistical overlap of the fluctuations[3,4,11,12] in theprice changes(the returns) between pairs of stocks. Thespectrum from market data[3,4] possesses a bulk of continu-ously distributed eigenvalues, which is similar to the proto-type predicted by the level statistics of the random matrixtheory (RMT) [1,4,13] and may be considered as mainly

contributed by the randomness. The effects of the correla-tions [3,4,11] manifest in the eigenvectors of those eigenval-ues isolated from the bulk, which include one large eigen-valuelM, corresponding to the correlation among all stocks(market mode) [4] and several much smaller ones scatteredin betweenlM and the bulk component. The patterns in theeigenvectors of these latter eigenvalues were related to thepresence of groups of correlated stocks[4,11]. In addition,the bulk component also shows some important deviationsfrom RMT predictions.

The spectrum of eigenvalues and the correspondingeigenvectors are related to the cooperative behavior in thefluctuations of the stock prices, which is not visible in thelocal information of the individual correlation coefficientscomposing that matrix[14]. Based on former information,the connections between the deviated eigenvalues and thepresence of correlated groups of stocks have been clarified[4] and rewarded with an ansatz[11] which was applied tomodeling the real market data[15]. In the model[11,15], thereturn of each stock is linearly decomposed into two uncor-related fluctuating parts. The interdependence of stockswithin a group is carried by the part which is synchronouslyshared by all stocks within that group. This formulation re-fines the conventional multifactor model[8] leading toblocked structures in its correlation matrix[11], which repro-duces those spectral features observed in market data[15]. Itassumes that each of the isolated eigenvalues from the bulkis contributed by one block of submatrix, with its eigenvec-tor containing only one activated group. This puts a limita-tion on the model, not able to digest completely the informa-tion carried by the eigenvectors for those deviatedeigenvalues in the market data, in which the shared activatedstocks are present very often among different eigenmodes.Our paper presents a new formulation to include this fact, atthe same time, retaining the realization of grouping in itssimplest form. Our approach is a general formulation whichdescribes the cooperative activities in the financial fluctua-tions beyond the statistics of matrices, on the level of timesequences.*Electronic address: [email protected]

PHYSICAL REVIEW E 70, 026101(2004)

1539-3755/2004/70(2)/026101(9)/$22.50 ©2004 The American Physical Society70 026101-1

Page 2: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

We formulate a stochastic dynamic model, calledcoupledrandom walks, for the stock-stock correlations based on theassumption that the changes in the prices of the stocks aredue to the adjustment to eliminate the difference between theexpected price for each individual stock and that for a col-lection of related stocks. The correlations are imposed as theunderlying network over which the price-balancing pro-cesses are carried out. We assume two underlying structuresbuilt on the network. One includes the stocks of the wholemarket, over which the process underscores the presence ofthe market mode[4]. The other one realizes the formation ofgroups, which accounts for the presence of the other deviatedeigenvalues. The partitioning of groups and the strengths ofthe correlations over each of the two structures are deter-mined to reflect the degree and the extent of comovement forthe stocks, indicated by the location of the largest eigenvaluelM and by the information contained in the eigenvectors cor-responding to those deviated eigenvalues(see latter text).The comparison between the spectra of our model to be in-troduced below and the US stocks under such analysis ispresented in Fig. 1 which shows very good agreement.

I. STOCHASTIC DYNAMIC MODEL

Our model consists of a system ofN walks (correspond-ing to N stocks), and letxistd, i =1,2, . . . ,N, denote the po-sition (price) of the ith walk (stock) at timet. A random walkwithout correlations can be written as

xist + 1d = xistd + jistd,

wherejistd is a random noise. We introduce correlations byexpressing the position at timet+1 by

xist + 1d = s1 − eM − egdf i„xistd… +eM

Noj=1

N

f j„xjstd…

+eg

ngokPg

fk„xkstd…,

if i P g andg P G, seM + eg ø 1.0d, s1d

whereeM andeg are coupling constants(subscripts “M” and“g” denote “market” and “groupg,” respectively) andG is a

FIG. 1. (Color) The simulated eigenvalue distributionrsld with egÞ0 (thick solid line) andeg=0 (dashed line) with N=345 stocks andT=3250 steps, in comparison with the market data(open and filled circles, see text) and the prediction of the random matrix theory[13](dotted line). The inset shows large isolated eigenvalues, includinglM. The fit (thick solid line) in the figure is obtained witheM =0.794, 8major groups, each occupied by more than 10 stocks, and 17 small groups. The number of stocksng in each group and the correspondingcouplingseg are listed in Table I. The simulated data are obtained by averaging over an ensemble of 106 matrices.

MA, HU, AND AMRITKAR PHYSICAL REVIEW E 70, 026101(2004)

026101-2

Page 3: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

class of nonoverlapping subsets of the system ofN stocks.We choose the functionf i(xistd) as

f i„xistd… = xistd + jistd. s2d

The second term in Eq.(1) corresponds to a coupling to themarket mean field determined by all the walks(stocks). Thelast term corresponds to the coupling to the mean field of thegroup g to which the walki belongs and the summation isover all ng members of the group; for the stock which doesnot belong to any group, there is no such term. The type ofcoupling proposed in Eq.(1) is commonly used in coupledmaps where the functionf i is usually a nonlinear map[16].Though the model treats eachjistd as an uncorrelated noise,while comparing with the actual market data it may be con-sidered as an integrated effect of the random fluctuation be-tween the two discrete timest and t+1. In our numericalsimulations,j’s are taken as temporally and mutually uncor-related Gaussian random values with zero mean and variances2

kjistdj jst8dl = s2di jdtt8 si, j = 1, . . . ,Nd, s3d

where the averagingk¯l is over the statistical ensemble.Equation(1) describes a concise model covering various

cases, from the totally random situationseM =eg=0.0d to thecase where all stocks are fully correlatedseM =1.0,eg=0.0d.It can be written as an equation of continuity,

r st + 1d = xst + 1d − xstd = jstd + D„xstd + jstd… s4d

where r , x, and j are the column vectors containing thedisplacements(the returns), the positions and the noises ofthe N walks, respectively.D is a Laplace-type operator de-scribing the flows due to the presence of gradients over anunderlying network. The displacements of the walks are,therefore, governed by the mechanism over the underlyingnetwork tending to eliminate the ever-changing unbalancearisen from the incoming random pulses. In our mean-fieldformulation, we assume

D = DM + DG, s5d

where the operatorsDM andDG, defined as theN3N matri-ces with, respectively, matrix elements

sDMdi j =eM

NHs1 − Nd if i = j

1 otherwise

and

sDGdi j =5eg

ngs1 − ngd if i = j P g andg P G

eg

ngif i Þ j ; i, j P g andg P G

0 otherwise,

s6d

which couple to the collections of flows coming to theithsite, contributed by the whole system and by the individualgroups, respectively. The parameterseM and eg are then re-alized as the kinetic coefficients controlling the magnitudesof such flows.

The information of cross correlation is contained in thematrix C of cross correlation coefficientsCij , which is de-fined as the statistical overlap of the fluctuationsdr i =r i−kr ilT between the two stocksi and j , that is,

Cij =kdr idr jlT

sis j, s7d

wheresi2=ksdr id2lT. The averagek·lT is over a time series of

T events. Each eigenvector(eigenmode) of C describes onepossible way of activation for the stock fluctuations, with themagnitude of the corresponding eigenvalue measuring itscontribution to the fluctuations[17].

II. MARKET MODE

WheneM =1 andeg=0, all the entries of the matrixC are1. The matrix possesses one eigenvalue equal to the numberof stocks in the market and others are zero. As the correla-tions decrease, the single large eigenvalue remains isolatedfrom the bulk of dispersed eigenvalues. This roughly ex-plains the origin of the two major components found in thespectrum from the market data, the bulk and the marketmodelM. The fitting in Fig. 1 suggests the dominant role ofthe market coupling(eM close to 1, see the following text).

In the following, we derive the explicit expressions formatrix C and find analytically the dependence of the eigen-valuelM on eM, for the case with no group couplings. In thiscaseseg=0d, the returns in Eq.(4) can be written as[18],

r std = jst − 1d + DMs1 − eMdt−1−t0xst0d

+ DMos=t0

t−1

s1 − eMdt−1−sjssd. s8d

Here we assume the system, with all walks initially at thesame positionxis0d=0, has been statistically steady at timet0so that the data can be collected starting fromt= t0+1, for theevaluation of correlation matrixC. Denoting the N3Tmatrices J;sjst0d ,jst0+1d , . . . ,jst0+T−1dd and R;fr st0+1d ,r st0+2d , . . . ,r st0+Tdg for the sequences of noises andreturns, respectively; and introducing theT3T matrix P by

Pvw = Hs1 − eMdw−v if w ù v

0 otherwise,s9d

Eq. (8) for t= t0+1,t0+2, . . . ,t0+T can be summarized as

R = J + DMsX0 + JdP, s10d

where X0 is the N3T matrix carrying the information ofinitial conditions, with vectorxst0d in its first column andzeros elsewhere. With properly chosens2 for the randomnoises[Eq. (3)] to assure the unity variances in Eq.(7), thecorrelation matrix can be written as

C <1

TRRT s11d

in the largeT limit (see Appendix). From Eqs.(10) and(11),we can see that the cross correlation in our model is due tothe statistical overlap managed byboth the site-wise operator

STOCHASTIC DYNAMICAL MODEL FOR STOCK-STOCK… PHYSICAL REVIEW E 70, 026101(2004)

026101-3

Page 4: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

DM and the time-wise operatorP. The presence of suchcombination between the structural and the temporary effectsis true for the general caseDGÞ0 [19]. As a result, thestocks activated in the eigenmodes deviated from bulk arenot restricted to one group for each eigenmode(see the fol-lowing text for the numerical results). This is in contrast tothe model proposed in Ref.[11].

Now, we consider the ensemble averagekRitRjtl, whichdescribes the cross correlation between the two walksi and jat time t and the mean overT time steps of which gives theensemble averaged correlation coefficient[from Eq. (11)]

kCijl =1

To

t=t0+1

t0+T

kRitRjtl. s12d

The quantity can be evaluated analytically by using Eq.(3).For t@1, we have(see Appendix)

kRitRjtl = s2Fdi jh1 − aseMdj +aseMd

NG , s13d

where aseMd=eMs3−2eMd / s2−eMd changes monotonicallyfrom 0 to 1 aseM increases from 0 to 1. Note that the cor-relation between a pair of different walks isaseMd, diluted bythe size factor 1/N. From Eqs.(12) and (13), we obtain

kCijl = 5 aseMds1 − a„eMd…N + aseMd

if i Þ j

1 if i = j ,

s14d

for the mean of distribution for each entry over the ensembleof matrices. We can see from Eqs.(13) and (14) that, in thelarge size limit, the local correlations between a pair of dif-ferent walks become diminishing. On the other hand, weshow in the following that the divergence due to collectiveactivities is present for the global quantitylM.

Consider the matrixC* ;kCl defined by Eq.(14). Its larg-est eigenvaluelM

* is [20]

lM* =

N

fh1 − aseMdjN + aseMdg, s15d

which diverges ateM =1, asN→`. In the largeN limit, wecan write Eq.(15) as

lM* < f1 − aseMdg−1 =

2 − eM

2s1 − eMd2 , s16d

which diverges in a power lawlM* ~ s1−eMd−2, aseM→1. We

found that Eq.(15) describes also the market modelM forthe ensemble of matrices, foreM near 1. Figure 2 verifies insimulation that the presence of such divergence forlM, in apower law with exponent 2 aseM→1. The result is similar tocritical phenomena in that the divergent fluctuations emergefor a system approaching to the(critical) state with full cor-relations[20]. This critical state in our model is characterizedby a finite-size scaling relationshiplM

* se ,Nd= e−2LseN1/2d inthe critical region [21], where e;1−eM and Lsxd;x2/ s2x2+1d.

III. SPECTRA OF CROSS CORRELATION MATRIX

We now compare the results of our model with the spectraof the actual market data[22]. Figure 1 shows the simulatedeigenvalue distribution for the cross correlation matrix for asystem of N=345 coupled walks(corresponding to 345stocks) obtained by averaging overT=3250 steps. The figurealso shows the RMT prediction[23] and the results for 345US stocks obtained with 3250 values of the 30 min returns inthe year 1996 for 250 days[24] and 6.5 trading hours eachday. In Fig. 1, the open circles are for one data set startingfrom 9:30 a.m. while filled circles are ensemble average offive data sets starting at different times from 9:30 to10:00 a.m. We see from the figure that our model of coupledrandom walks gives an excellent fit for the actual marketdata [25] and accounts for the three important deviationsfrom the RMT predictions, namely, a very large eigenvalue(market mode), several eigenvalues larger than RMT upperlimit but much less than the market mode, and significantdeviations of the bulk from RMT prediction.

The fitting of the market data to the model includes thedividing of the stocks into groups and the estimation of thecoupling parameters. Without exploring the sophisticated al-gorithms for partitioning[26] and fitting[15], we investigatethe feasibility of a procedure of grouping based on the inte-grated information carried by the eigenmodes of the correla-tion matrix and the fluctuation properties of the time se-quences of each group. The grouping obtained in thisapproach may not be necessary identical with the division bythe market sectors or by the maximum-likelihood fitting[15,27]. After all, the grouping for the cooperative activitiessignaled by eigenmodes(see following paragraph) are af-fected by many dynamic factors, such as the trading behaviorof the stock owners, and may be different from those basedon static interpretations[14] of the correlation data.

The parameters of the model used for the fit in Fig. 1 aredetermined as follows. The couplingeM is estimated by Eq.(15) using the market modelM obtained from the correlationmatrix C and is amended with the determination of the restparameters. The number of groups and the number of stocksng in each groupg are determined by finding the comove-ments of the stocks in the eigenvectors corresponding to theK eigenvalues in between the bulk component and the mar-ket mode. The procedure is as follows. The stocks are con-sidered as in the same group in the mean-field scenario ifthey are activated correspondingly in allK eigenvectors. Wedefine a base noise level corresponding to the fluctuations inthe components of the eigenvector of the market mode. Inthe components of theK eigenvectors a component(stock) istreated as active if it has a magnitude larger than this basenoise level otherwise its contribution is neglected. To deter-mine whether two stocks are in the same group, we considertwo K-dimensional vectors composed of the correspondingcomponents in theK eigenmodes and take the absolute valueof the cosine of the angle between these two vectors. If thisabsolute value is larger than some critical valueb then thetwo stocks are coupled. After we get all the couples we clas-sify them into groups. Two groups are merged if they haveone or more overlapping elements and the direction of the

MA, HU, AND AMRITKAR PHYSICAL REVIEW E 70, 026101(2004)

026101-4

Page 5: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

overlapped part is within the deviation of each group in allthe K directions. The critical valueb<0.835 is determinedsuch that the final number of major groups after merging isK(or close toK). The couplingeg for each groupg can be

estimated by matching theeg-dependent properties betweenthe market and the model[28]. In Table I, we list the param-etershngj and hegj for the groups obtained in fitting to ourmodel.

TABLE I. Fitted grouping parameters.

Group indexg A* B* C* D* E* F* G* H* I J K L M

ng 70 62 30 29 26 17 14 12 8 7 6 5 4

eg3103 104 139 111 44.6 9.25 46.9 13.1 98.7 61.4 135 95.2 41.3 0.06

Group indexg N O P Q R S T U V W X Y

ng 4 3 3 2 2 2 2 2 2 2 2 2

eg3103 0.11 123 7.09 2.14 78.6 0.91 17.7 11.4 94.4 1.60 1.76 62.0

FIG. 2. (Color) The plot of the ensemble averaged valuelM and the widthDlM of the distribution oflM in the ensemble correspondingto the market modefor various values ofeMseg=0.0d for three different systems withN=s345,406,1000d walks, lengths T=s3250,1309,6448d, and over ensembles ofs106,400,50d cross correlation matrices, respectively.[These are the sameN andT values asthe pools of stocks analyzed in this study(Fig. 1), Refs.[3] and [4], respectively.] The corresponding curves for the analytical expression,Eq. (15), for finite N (thin broken, dotted, and dashed lines are forN=345, 406, and 1000, respectively) are plotted for comparison. Theseries of curve approaches in increasingN to that of Eq.(16) (solid line), which is a power law with exponent 2 foreM <1. The inset showsthe same data in log-log plot.

STOCHASTIC DYNAMICAL MODEL FOR STOCK-STOCK… PHYSICAL REVIEW E 70, 026101(2004)

026101-5

Page 6: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

In Fig. 3, we plot the eigenvectors of a few typical modesobtained for the model and for the 345 US stocks describedin Fig. 1. It shows the presence of several simultaneouslyactivated groups in each of the eigenmodes deviated frombulk [eigenvectors corresponding tol1, l2, andl3 in Figs.3(a) and 3(b)], for both the market data[Fig. 3(b)] and the

simulation [Fig. 3(a)]. Such situation, however, cannot bedirectly interpreted in a model based on the blocked-structured correlation matrixC [11,15,19]. In our model, thegroupings cause not only the deviation for the eigenvalues inbetween market mode and the continuous bulk component,but also modify the bulk leading to a shift and an extensionof the distribution at the lower edge(Fig. 1). The eigenvec-tors corresponding to the eigenvalues at lower edge possessvery different patterns compared with those(discretely) dis-tributed above the upper edge of the bulk. The differencesare easily demonstrated in our model.[Compare the eigen-vectors corresponding tol5 andl3 in Fig. 3(a).] The stocksin the latter modes are coherently activated in groups, whileonly individual stocks are activated with large amplitudes inthe former modes[29]. Such differences can also be recog-nized in the market data.[Compare those forl5 and l3 inFig. 3(b).]

It is worth to note that, in our model, the grouped activi-ties are realized as the perturbing parts to the market activi-ties, as a result of the large magnitude inlM as comparedwith the other eigenvalues. The ingredient of cooperativeactivities carried by the market mode(the eigenvector corre-sponding tolM) has been excluded in determining the group-ing parameters(see Table I). Such division can only be fea-sible via the decomposing of the fluctuations into collectivemodes. It is supported by the observation that, in marketmode[see Fig. 3(b)], the stocks prices of the whole marketchange coherently in the same direction(sign). The groupingobtained in our approach is quite different from that via aclustering procedure based merely on the static informationof cross correlation[14]. In Fig. 4, we compare the groupingobtained by our approach with the information of S&P 500sectors, and with the minimum-spanning tree based on theanalysis of correlation coefficients[14]. It is apparent that thestructure of grouping for the lower level collective activitiesobtained in our analysis is different from that based on staticinformation. In the latter case, the dominant market coopera-tive activities are not separated.

IV. FINAL COMMENTS

To conclude, we have proposed a dynamical model ofcoupled random walks for the evolution of stock prices,which properly accounts for the important deviations fromthe RMT predictions. The correlations are realized on thelevel of time sequences rather than on the statistics of corre-lation matrix, which deserves further investigation for widerapplications. In applying to market data, the coupling param-eters suggest the usage of the similar concept of(the inverseof) impedance which effectively describes systems of densefluid. To our knowledge, such kind of analysis has neverbeen carried out from the standpoint of the whole market[8].The kinetic contents of these parameters and the implicationfor the partitioning of the stocks and for the structure ofunderlying network are the relevant issues to explore.Though we have presented results only for the economicdata, we feel that our model and analysis will be useful inother problems as well where significant deviations from theRMT predictions are found[5,6].

FIG. 3. The components(scaled fluctuations of returns) for theeigenvectors of a few typical eigenvalues corresponding to ourmodel(a) and to the market data(b) as described in Fig. 1. They arethe eigenvectors of(top down) the market modelM, the next threelargest eigenvaluessl1,l2,l3d and two typical modes with theireigenvalues falling inside the bulksl4d and at the lower edge of thebulk componentsl5d, respectively. The stocks are numbered so thatthe stocks belonging to the same group are consecutively placedand the groups are arranged in the plots, from left to right, accord-ing to their sizes in the descending order. There are 27 stocks placedat the end that do not belong to any group. The vertical lines indi-cate the boundaries of the groups and the horizontal lines mark thezero levels.

MA, HU, AND AMRITKAR PHYSICAL REVIEW E 70, 026101(2004)

026101-6

Page 7: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

ACKNOWLEDGMENTS

We thank J. Voit and T. Lux for comments. W.J.M. ac-knowledges the helpful conversation with C. N. Chen, K. T.Leung, D. B. Saakian, T. Shimada, J. Skřivánek, and T. M.Wu. This work was supported in part by National ScienceCouncil of the Republic of China(Taiwan) under Grant No.NSC 91-2112-M-001-056.

APPENDIX: CORRELATION MATRIX

According to Eq.(7), the correlation matrix is

C = G−1/2sR − RVdsR − RVdTG−1/2, sA1d

where theN3N matrix G and theT3T matrix V are de-fined by

Gi j = H„sR − RVdsR − RVdT…ii if i = j

0 if i Þ jandVi j =

1

Tfor i, j = 1, . . . ,T, sA2d

FIG. 4. (Color) The 345 US stocks classified by our method and by the minimum spanning tree analysis on the correlation coefficientsin Ref. [14]. We label the stocks in the eight largest groups obtained by our method(see text and Table I) in different colors. The stocks arealso marked by different symbols according to their sectors in S&P 500 classification. In our approach, the grouping is realized as theperturbing part of the cooperative activities in the fluctuations to the dominant market activities(see text). In the minimum spanning treeanalysis, there is no such division.

STOCHASTIC DYNAMICAL MODEL FOR STOCK-STOCK… PHYSICAL REVIEW E 70, 026101(2004)

026101-7

Page 8: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

respectively.To find the distributions for the entries of the correlation

matrix C, we evaluate the ensemble averages ofRRT,RVRT, andRVVRT. It is useful to evaluatekRisRjtl first.We have[from Eq. (3)]

kRisRjtl = s2Fdi jdsts1 − eMd + di jAst + dsteM

N−

Ast

NG ,

sA3dwhere

Ast =eM

eM − 2fs1 − eMds+t + s1 − eMdus−tu+1g. sA4d

From Eq. (A3), we obtain Eq.(13) in the stationary timeregime and, accordingly,

ksRRTdi jl = s2TFSdi j„1 − aseMd… +aseMd

NDG sA5d

and

ksRVRTdi jl = ksRVVRTdi jl

= s2Fdi js1 − eMd + di j1

T ot=t0+1

t0+T

os=t0+1

t0+T

Ast +eM

N

−1

To

t=t0+1

t0+T

os=t0+1

t0+TAst

N G<

s2

N+ oS1

TD , sA6d

where we have used the equality

ot=1

u

os=1

u

Ast = F s1 − eMd2u+2 − s1 − eMd2

eMseM − 2d G − us1 − eMd.

Comparing Eqs.(A5) and(A6), we conclude that, with prop-erly chosens2 for the random noises[Eq. (3)] to assure theunity variances in Eq.(7), Eq. (11) is a good approximationin the largeT limit.

[1] M. L. Mehta, Random Matrices, 2nd ed. (Academic Press,New York, 1991).

[2] See, for example, V. V. Flambaumet al., Phys. Rev. A50, 267(1994); A. D. Mirlin, Phys. Rep.326, 259(2000); W.-J. Maetal., Physica A 321, 364 (2003); S. N. Taraskin and S. R. El-liott, Phys. Rev. B65, 052201(2002).

[3] L. Laloux, P. Cizeau, J.-P. Bouchaud, and M. Potters, Phys.Rev. Lett. 83, 1467(1999).

[4] V. Plerouet al., Phys. Rev. Lett.83, 1471(1999); V. Plerouetal., Phys. Rev. E65, 66126(2002).

[5] M. Barthélemy, B. Gondran, and E. Guichard, Phys. Rev. E66, 056110(2002).

[6] M. S. Santhanam and P. K. Patra, Phys. Rev. E64, 016102(2001).

[7] J. Voit, The Statistical Mechanics of Financial Markets(Springer, New York, 2001); R. N. Mantegna and H. E. Stan-ley, An Introduction to Econophysics: Correlations and Com-plexity in Finance(Cambridge University Press, Cambridge,2000); J.-P. Bouchaud and M. Potters,Theory of FinancialRisk: From Statistical Physics to Risk Management(Cam-bridge University Press, Cambridge, 2000).

[8] E. J. Elton and M. J. Gruber,Modern Portfolio Theory andInvestment Analysis, 3rd ed.(Wiley, New York, 1995); J. Y.Campbell, A. W. Lo, and A. C. MacKinlay,The Econometricsof Financial Markets (Princeton University, Princeton, NJ,1997).

[9] L. Bachelier, Ann. Sci. Ec. Normale Super.17, 21 (1900);English translation ofThe Random Character of Stock MarketPrices, edited by P. H. Cootner(MIT, Cambridge, MA, 1964).

[10] T. Lux and M. Marchest, Nature(London) 397, 498 (1999).[11] J. D. Noh, Phys. Rev. E61, 5981(2000).[12] S. Drozdzet al., Physica A 287, 440 (2000).[13] A. M. Sengupta, and P. P. Mitra, Phys. Rev. E60, 3389(1999).[14] N. Vandewalle, F. Brisbois, and X. Tordoir, Quant. Finance1,

372(2001); G. Bonanno, F. Lillo, and R. N. Mantegna,ibid. 1,

96 (2001); R. N. Mantegna, Eur. Phys. J. B11, 193 (1999).[15] L. Giada and M. Marsili, Phys. Rev. E63, 061101(2001).[16] K. Kaneko, Physica D124, 322(1998); 34, 1 (1989); S. Jalan

and R. E. Amritkar, Phys. Rev. Lett.90, 014101(2003); P. M.Gade and C.-K. Hu, Phys. Rev. E60, 4966(1999); 62, 6409(2000).

[17] The fluctuationdr i can be decomposed as the summation ofproducts of the site-dependent and time-dependent parts,dr istd=Ssls

1/2ussidvsstd, whereussid is theith component of theeigenvector corresponding to the eigenvaluels.

[18] The concise algebraic properties ofDM and DG, DMDM

=s−eMdDM andDMDG=DGDM =s−eMdDG, allow for the ana-lytic evaluation of various quantities in the approximation tothe first order of theeg’s for eg!eM.

[19] The formulation Eq.(10) is valid for the general caseDGÞ0(with DM andP replaced, respectively, byD=DM +DG and anew PG which possesses grouping dependence). Using thesame notations, the returns in the model of Ref.[11] are drivenby J adding a set of noisesJG which has itsN rows dividedinto groups,R=JG+J. For each column ofJG, the entries ofthe rows of the same group are the same.JG provides thestatistical overlap in Eq.(11), leading to the blocked structurein the correlation matrix which accounts for the presence ofthose deviated eigenvalues from the bulk of the distribution.

[20] We are looking for the maximum offTC* f over any columnvector f with unity norm

lM* = max

fTf=1

hfTC* fj

since the maximum occurs for the column vectorf equal to theeigenvector corresponding to the largest eigenvalue. For ma-trix C* , f happens to have all its components in the same value,which reflects the general feature of the cooperative activitiesof the market mode. The above equation is also valid between

MA, HU, AND AMRITKAR PHYSICAL REVIEW E 70, 026101(2004)

026101-8

Page 9: Stochastic dynamical model for stock-stock correlations · Ravindra E. Amritkar Physical Research Laboratory, Navrangpura, Ahmedabad 380009, India (Received 2 October 2003; revised

any C and its largest eigenvaluelM. If we consider the timesequences of the column vectors of the fluctuations of returnsdr std which generate the correlation matrixC, we can write

lM = maxfTf=1H 1

Tot=1

T

ffTdr stdg2J .

lM describes then the maximum possible mean square fluctua-tion over time, and the components of the vectorf giving thatmaximum describe the corresponding portfolio.

[21] Using e;1−eM, Eq. (15) can be written as

lM* =

1 + e

e2

e2N

2e2N + s1 + e − e2d.

We are interested in the region thate is small andN is large. Itis apparent that the equation can be reduced as an equality for

two new quantitieslM* ; e2lM

* andx; eN1/2,

lM* =

x2

2x2 + 1S1 + oSe,

1

eNDD ,

for 1/N!e!1.[22] For market data, we analyze the correlation matrix of the log

returns, that is, the changes of log price ln(Pist+1d)−ln(Pistd) for all stocki ’s. The log return can be approximatedby the price returndPi /Pi normalized by the price, which jus-tifies the use of the stochastically identical walks in our model.

[23] According to the random matrix theory[13], the spectrum ofthe eigenvalues for totally random case(eM =0 andeg=0) isgiven by rsld=Q/2plÎslmax−ldsl−lmind with Q=T/N andlmax,min=1+1/Q±2Î1/Q. For Fig. 1, we haveQ=9.420 andlmax,min=1.7578. . . ,0.4545. . ..

[24] We survey the time sequences of trading prices from theTrades and Quotes(TAQ) database for one pool ofN=345 USstocks selected from S&P 500, which have complete recordsover the four years 1996–1999.

[25] We also use our model to fit the spectral data of Ref.[4] withN=1000 andT=6448 (not shown) and find good agreement.For this fit the groups were not chosen from eigenvector analy-sis.

[26] D. Korenblum and D. Shalloway, Phys. Rev. E67, 056704(2003), and references therein.

[27] M. Marsili, Quant. Finance2, 297 (2002).[28] We can express the deviationkr i

2l of the return for walkianalytically as a linear function of the deviationskjk

2l of all thenoisesjk’s k=1, . . . ,N, with the coefficients containing the pa-rameters,eM, ng’s, andeg’s. The couplingseM andeg’s can beestimated by fillingkr2l’s and kj2l’s with the market data.(Inpractice, it is necessary to carry out certain coarse graining forthis mean-field model by neglecting certain stock-by-stock dif-ferences in the real market data.) The estimation is then furtherfinely adjusted to match the spectra. In the procedure, there isalways some compromise between reproducing the groupedactivities in the eigenvectors and matching the distributionspectra.

[29] Normally, the inverse participation ratio[4,5] is used to esti-mate the fraction of components which significantly contributeto that eigenvector. However, it cannot tell us about the ele-ments(walks) which are common to different groups, an in-formation crucial to determine the different groups in our case.Hence, we use a different procedure to determine the groups asdescribed in the text.

STOCHASTIC DYNAMICAL MODEL FOR STOCK-STOCK… PHYSICAL REVIEW E 70, 026101(2004)

026101-9