hybrid softcomputingsystemsforreservoirpvtpropertiesprediction

11
Hybrid soft computing systems for reservoir PVT properties prediction Amar Khoukhi n Systems Engineering Department, King Fahd University of Petroleum and Minerals, Dhahran, 31261, Saudi Arabia article info Article history: Received 25 April 2011 Received in revised form 15 March 2012 Accepted 18 March 2012 Available online 1 April 2012 Keywords: Pressure–Volume–Temperature Oil formation volume factor Bubble point pressure Correlation Genetically-optimised neural networks Genetic-adaptive neuro-fuzzy inference systems abstract In reservoir engineering, the knowledge of Pressure–Volume–Temperature (PVT) properties is of great importance for many uses, such as well test analyses, reserve estimation, material balance calculations, inflow performance calculations, fluid flow in porous media and the evaluation of new formations for the potential development and enhancement oil recovery projects. The determination of these properties is a complex problem because laboratory-measured properties of rock samples (‘‘cores’’) are only available from limited and isolated well locations and/or intervals. Several correlation models have been developed to relate these properties to other measures which are relatively abundant. These models include empirical correlations, statistical regression and artificial neural networks (ANNs). In this paper, a comprehensive study is conducted on the prediction of the bubble point pressure and oil formation volume factor using two hybrid of soft computing techniques; a genetically optimised neural network and a genetically enhanced subtractive clustering for the parameter identification of an adaptive neuro-fuzzy inference system. Simulation experiments are provided, showing the perfor- mance of the proposed techniques as compared with commonly used regression correlations, including standard artificial neural networks. & 2012 Elsevier Ltd. All rights reserved. 1. Introduction 1.1. Motivation and background A reservoir is a volume of porous sedimentary rock, which has been filled with a substantial amount of hydrocarbons, such as crude oil and natural gas. High-quality rock and fluid data are critical for reliable modelling, reservoir-engineering calculations, and performance predictions by use of reservoir simulators and for subsequent economic analysis (Al-Hussainy and Humphreys, 1996). Typically, reservoir properties consist of a set of parameters which are used to characterise the spatially varied geological information. Fluid characterisation quantifies the reservoir phase behaviour, fluid compositional changes throughout the reservoir, and changes in fluid properties as a result of production and injection processes. The determination of reservoir properties is however a com- plex problem, because laboratory-measured properties of rock samples (‘‘cores’’) are only available from limited and isolated well locations and/or intervals. There is a great demand for the development of correlation models to relate these properties to other, more relatively abundant, measures. One example of such a kind of measures is ‘‘well logs’’, which are a series of multi-type digital measurements along the vertical depth of drilled wells. These models are used to transform the well log data into reservoir properties at locations where no cores are available. Pressure–Volume–Temperature (PVT) properties are key para- meters associated with the characterisation of any hydrocarbon reservoir. In fact, it is not possible to have accurate solutions to many petroleum engineering problems without having accurate estimates of these properties. Furthermore, the control of the relationship between the surface and reservoir hydrocarbon volumes and the underground withdrawal is made possible by knowing the oil PVT parameters. To overcome the abovementioned problems of expensive laboratory-based tests and the possible non-availability of the right samples from the well-bore or well surface, equations of state models and several empirically derived correlations between those properties and well data, and eventually artificial neural networks, have been developed based on available data for different regions of the world. In this paper, the capabilities of soft computing techniques are explored and investigated by gathering their respective power- fulness in hybrid optimised structures, so as to improve their performance compared to the use of each technique alone. A case study of two important PVT properties prediction problem is considered. These two properties are bubble point pressure (P b ), and oil formation volume factor (B ob ). The former defines the pressure at which the first gas comes out of solution in oil. It is a function of temperature, oil gravity, gas gravity and gas/oil ratio, Contents lists available at SciVerse ScienceDirect journal homepage: www.elsevier.com/locate/cageo Computers & Geosciences 0098-3004/$ - see front matter & 2012 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.cageo.2012.03.016 n Tel.: þ966 3 860 7614; fax: þ966 3 860 2965. E-mail address: [email protected] Computers & Geosciences 44 (2012) 109–119

Upload: christian-fabian-garcia-romero

Post on 05-Sep-2015

7 views

Category:

Documents


0 download

DESCRIPTION

Hybrid softcomputingsystemsforreservoirPVTpropertiesprediction

TRANSCRIPT

  • rv

    ahra

    Genetically-optimised neural networks

    Genetic-adaptive neuro-fuzzy inference

    systems

    he k

    , suc

    atio

    nt

    rob

    ite

    late

    cor

    ve s

    sing

    network and a genetically enhanced subtractive clustering for the parameter identication of an

    adaptive neuro-fuzzy inference system. Simulation experiments are provided, showing the perfor-

    mance of the proposed techniques as compared with commonly used regression correlations, including

    sedimnt ofality rovoir-enof resessainynsist

    other, more relatively abundant, measures. One example of such a

    ta for

    s are

    performance compared to the use of each technique alone. A case

    Contents lists available at SciVerse ScienceDirect

    .el

    Computers &

    Computers & Geosciences 44 (2012) 109119function of temperature, oil gravity, gas gravity and gas/oil ratio,E-mail address: [email protected] of measures is well logs, which are a series of multi-type study of two important PVT properties prediction problem isconsidered. These two properties are bubble point pressure (Pb),and oil formation volume factor (Bob). The former denes thepressure at which the rst gas comes out of solution in oil. It is a

    0098-3004/$ - see front matter & 2012 Elsevier Ltd. All rights reserved.

    http://dx.doi.org/10.1016/j.cageo.2012.03.016

    n Tel.: 966 3 860 7614; fax: 966 3 860 2965.well locations and/or intervals. There is a great demand for thedevelopment of correlation models to relate these properties to

    explored and investigated by gathering their respective power-fulness in hybrid optimised structures, so as to improve theirThe determination of reservoir properties is however a com-plex problem, because laboratory-measured properties of rocksamples (cores) are only available from limited and isolated

    neural networks, have been developed based on available dadifferent regions of the world.

    In this paper, the capabilities of soft computing techniquewhich are used to characterise the spatially varied geologicalinformation. Fluid characterisation quanties the reservoir phasebehaviour, uid compositional changes throughout the reservoir,and changes in uid properties as a result of production andinjection processes.

    To overcome the abovementioned problems of expensivelaboratory-based tests and the possible non-availability of theright samples from the well-bore or well surface, equations ofstate models and several empirically derived correlationsbetween those properties and well data, and eventually articial1. Introduction

    1.1. Motivation and background

    A reservoir is a volume of porousbeen lled with a substantial amoucrude oil and natural gas. High-qucritical for reliable modelling, reserand performance predictions by usesubsequent economic analysis (Al-Hu

    Typically, reservoir properties co& 2012 Elsevier Ltd. All rights reserved.

    entary rock, which hashydrocarbons, such asck and uid data aregineering calculations,rvoir simulators and forand Humphreys, 1996).of a set of parameters

    digital measurements along the vertical depth of drilled wells.These models are used to transform the well log data intoreservoir properties at locations where no cores are available.

    PressureVolumeTemperature (PVT) properties are key para-meters associated with the characterisation of any hydrocarbonreservoir. In fact, it is not possible to have accurate solutions tomany petroleum engineering problems without having accurateestimates of these properties. Furthermore, the control of therelationship between the surface and reservoir hydrocarbonvolumes and the underground withdrawal is made possible byknowing the oil PVT parameters.standard articial neural networks.Hybrid soft computing systems for rese

    Amar Khoukhi n

    Systems Engineering Department, King Fahd University of Petroleum and Minerals, Dh

    a r t i c l e i n f o

    Article history:

    Received 25 April 2011

    Received in revised form

    15 March 2012

    Accepted 18 March 2012Available online 1 April 2012

    Keywords:

    PressureVolumeTemperature

    Oil formation volume factor

    Bubble point pressure

    Correlation

    a b s t r a c t

    In reservoir engineering, t

    importance for many uses

    inow performance calcul

    the potential developme

    properties is a complex p

    are only available from lim

    have been developed to re

    models include empirical

    this paper, a comprehensi

    formation volume factor u

    journal homepage: wwwoir PVT properties prediction

    n, 31261, Saudi Arabia

    nowledge of PressureVolumeTemperature (PVT) properties is of great

    h as well test analyses, reserve estimation, material balance calculations,

    ns, uid ow in porous media and the evaluation of new formations for

    and enhancement oil recovery projects. The determination of these

    lem because laboratory-measured properties of rock samples (cores)

    d and isolated well locations and/or intervals. Several correlation models

    these properties to other measures which are relatively abundant. These

    relations, statistical regression and articial neural networks (ANNs). In

    tudy is conducted on the prediction of the bubble point pressure and oil

    two hybrid of soft computing techniques; a genetically optimised neural

    sevier.com/locate/cageo

    Geosciences

  • A. Khoukhi / Computers & Geosciences 44 (2012) 109119110and it is needed to know when the gas is dissolved from oil. Bob isa function of pressure and temperature, which represents theratio of oil and dissolved gas volume under reservoir conditions tothe oil volume at standard conditions.

    1.2. Correlation-based prediction

    Since the pioneering work of Katz (1942), who developed vemethods for predicting crude oil shrinkage in 1942, a considerablevolume of literature has accumulated on the study of PVTcorrelations for crude oil. Various correlations were proposed bydifferent researchers from all over the world, with varyingdegrees of accuracy in terms of average error, and based on crudesamples from different oil elds (Katz, 1942; Glaso, 1980). Most ofthese correlations were developed empirically using graphical orregression methods. Several efforts were also made to develop auniversal correlation.

    In Al-Marhoun (1988), a correlation was published for Bobusing 11,728 experimental values representing samples taken

    Nomenclature

    EOS Equation-of-stateFVF Formation volume factorPVT Pressure/Volume/TemperatureAPI American Petroleum InstituteANN Articial neural networksRBFNN Radial basis feed-forward neural networksGA Genetic AlgorithmANFIS Adaptive neuro-fuzzy inference systemGANFIS Genetic adaptive neuro-fuzzy inference systemGONN Genetically optimised neural networksEr Average percent relative errorEa Average absolute percent relative errorEmin Minimum root mean square errorEmax Maximum root mean square errorRMSE Root mean square errorSD Standard deviationR2 Correlation coefcientfrom 700 reservoirs, mostly from the Middle East and NorthAmerica. Other correlations were also developed for the Gulf ofSuez and Malaysian Crude oils by Al-Marhoun (1988), Macary andEl-Batanoney (1992), respectively. In Omar and Todd (1993),a reliability analysis was conducted for PVT correlations. InDe Ghetto et al. (1995), correlations were published for propertiesof UAE crude oils using 62 data sets from UAE reservoirs.A comprehensive review of various empirical correlations isreported in Almehaideb (1997) and Sutton (2008).

    1.3. Articial neural networks based prediction

    Several articial neural network (ANN) correlations have alsobeen proposed. The prediction performance was assessed throughfour main criteria. The rst is the average percent relativeerror (Er), which measures the relative deviation of the resultsof the prediction from the experimental data. The second is theaverage absolute percent relative error (Ea), which measuresthe relative absolute deviation of the prediction results from theexperimental values. The standard deviation (SD) measures thevariability of the obtained results with respect to the averagevalue. The forth criterion consists of the correlation coefcient R2,which indicates the degree of success in reducing the standarddeviation by regression analysis.In Elsharkawy (1998) a two hidden layers radial basis functionneural network model (RBFNM) was proposed to predict PVTproperties for crude oil and gas. The model predicts oil formationvolume factor, solution gas oil ratio, oil viscosity, saturated oildensity, under-saturated oil compressibility, and evolved gasgravity. The input data to the RBFNM were reservoir pressure,temperature, stock tank oil gravity and separator gas gravity.For the oil formation volume factor, the RBFNM resulted in abetter accuracy than as-to-date available PVT correlations. In thisstudy, the average percent relative error (Er) was estimated forthe training to 0.06% and 0.08% for testing. The average absolutepercent relative error (Ea) was obtained as 0.87% for training and0.53% for testing. The standard deviation (SD) was found to be1.28% for training and 0.57% for testing. The correlation coefcientR2 was determined as 99.46% for the training samples and 98.24%for testing.

    In Varotsis et al. (1999), a novel approach for predicting thecomplete PVT behaviour of reservoir oil and gas condensates wasintroduced using an ANN. The method used key measurements

    Res_Temp Reservoir temperature (1F)Mol_N2 Mole fraction of N2 (mol%)Mol_CO2 Mole fraction of CO2 (mol%)Mol_H2S Mole fraction of H2S (mol%)p Pressure (psi)Pb Bubble point pressure (psi)Pod Pressure at dead oil viscosity (psi)Bob Oil formation volume factor (RB/STB)Rs Solution gas/oil ratio, SCF/STB (m

    3/m3)Rsb Bubble point solution gas/oil ratio, SCF/STB (m

    3/m3)T Temperature (1F)V Volume (m3)ma Viscosity above bubble point (cP)mb Viscosity below bubble point (cP)mo Oil viscosity (cP)mob Bubble point/gas-saturated oil viscosity (cP)mod Dead oil viscosity (cP)gg Gas relative density (API)that can be performed rapidly, either in a lab or at the well site, asinput to the ANN. The ANN was trained by a PVT database of over650 reservoir uids originating from all parts of the world. Thetesting of the trained ANN indicated that, for all uid types, mostof the PVT properties estimates can be obtained with a very lowmean relative error of 0.52.5%, with no data set producing arelative error of more than 5%. This level of error is considered asbetter than that provided by tuned equation-of-state (EOS)models, which are still in common use for the estimation ofreservoir uid properties.

    In Al-Marhoun and Osman (2002), two new models werepresented to predict Pb and Bob at the bubble-point pressure usinga Multi-layer Preceptron (MLP), trained by backpropagation withearly stopping, for Saudi Arabian crudes. Both models were basedon ANNs, and developed using 283 unpublished data setscollected from different Saudi Arabian elds. Off the 283 data sets,142 were used to train the Bob and Pb ANNs, 71 to cross-validate therelationships established during the training process and adjustthe calculated weights, and the remaining 70 were used to test themodel and evaluate its accuracy. The results showed that thedeveloped Bob model provides better predictions and higher accu-racy than previously published empirical correlations.

    In Goda et al. (2003) another ANN correlation was developedto predict both Pb and Bob with the aid of two separate networks.

  • effect of each geologic driver on fractures, in an effort to assist a

    2. Problem statement

    lysareiso

    incper

    A. Khoukhi / Computers & Geosciences 44 (2012) 109119 111geologist or reservoir engineer in being able to identify, locallyand globally, the key geologic drivers affecting fractures. Auliaet al. (2010) used data mining for smart oileld reservoir analysis.In El-Sebakhy (2009), a support vector regression (SVR) was usedfor generating correlations for forecasting bubble point pressureusing three different published PVT databases, which was claimedto outperform empirical correlation and standard neural networkmodels.

    All these studies prove that correlations based on data miningtechniques are more accurate than empirical correlations. How-ever, most of these correlations were found to be appropriate forthe specic region where the parameters were measured, but notfor other regions. Furthermore, the neural network correlationsdeveloped are often limited, and global correlations are usuallyless accurate compared to local correlations. Nevertheless, theachievements of neural networks opened the door to soft com-puting techniques to play a major role in the oil and gas industry.Adaptive neuro-fuzzy inference systems have been proposed as anew intelligence framework for both prediction and classicationbased on fuzzy clustering optimisation criterion and ranking(Jang et al., 1996).

    This paper is an upgraded and extended version of two otherpapers (Khoukhi and Albukhitan, 2010, 2011; Khoukhi et al.,2011) that are concerned with estimating two PVT properties ofcrude oil systems; namely bubble point (Pb) and oil formationvolume factor (Bob). The main extension is the performance offurther experiments, including a genetically optimised neuralnetwork (GONN), and a genetic adaptive neuro-fuzzy inferencesystem (GANFIS), along with comparisons with standard ANN,The data used was a set of 160 measured points collected fromthe Middle East, where 120 points were dedicated to training, and20 for testing. The Bubble point network was constructed of twohidden layers, with ten neurons for each layer. All hidden neuronswere activated by a log sigmoid function. The four input datawere temperature, API gravity, gas oil ratio and gas relativedensity. The output neuron was designed to be activated withpure linear functions. The results showed that the network giveshigher accuracy in the prediction task than other publishedempirical correlations. The network has an average percentrelative error of 0.030704 and correlation coefcient of 0.9981.Other correlations were introduced in Labedi (1990) and Suttanand Farshad (1990) for Libyan and Gulf of Mexico crude oils.

    In Osman and Al-Marhoun (2005), two new models weredeveloped to predict different brine properties. The rst modelpredicted brine density, oil formation volume factor (Bob),isothermal compressibility as a function of pressure and tem-perature and salinity. The second model was developed to predictbrine viscosity as a function of temperature and salinity alone.The models were developed using 1040 published data sets. Thesedata were divided into three groups: training, cross-validationand testing. Radial Basis Functions (RBF) and Multi-layer Precep-tron (MLP) neural networks were utilised in the study. Trend testswere performed to ensure that the developed model would followphysical laws. The results showed that the developed modelsoutperform the published correlations in terms of absoluteaverage percent relative error, correlation coefcient and stan-dard deviation.

    1.4. Hybrid soft computing based techniques

    Soft and mimetic computing techniques other than ANN werealso used (Ouenes, 2000; Jang et al., 1996). In Ouenes (2000),fuzzy neural networks were used to evaluate the hierarchicalANFIS and state-of-the-art regression based correlations.3. Proposed approach

    3.1. Adaptive neuro-fuzzy inference systems (ANFIS)

    Neuro-fuzzy inference systems are hybrid classication/fore-casting frameworks, which learn the rules and membershipperformance indices are detailed in Appendix II.The second comparison is through graphical representation of the

    correlations between the actual and predicted values of Pb and Bob.The third approach is through graphical representation of

    errors as a function of correlations (Al-Marhoun, 1988, 1992;Al-Marhoun and Osman, 2002). Scatter plots are shown for theabsolute percent relative error (Ea) versus the correlation coef-cient for all computational intelligence forecasting schemes andthe most common empirical correlations. Each modeling schemeis represented by a symbol; the good forecasting scheme shouldappear in the upper left corner of the graph.perstafunThe rst standard is an assessment through six error criteria;luding the average percent relative error (Er), average absolutecent relative error (Ea), minimum and maximum absolutecent error (Emin and Emax), root mean square errors (RMSE),ndard deviation (SD), and correlation coefcient (R2). Thesemework to other empirical correlations, statistical error ana-is and quality measures are performed. Three main standardsused in assessing the suggested techniques and their compar-ns with regression-based models.fra2.1. Data acquisition and pre-processing

    Three distinct databases were used in this study to implementthe proposed techniques, to forecast both bubble point pressure(Pb) and oil formation volume factor (Bob) based on the same fourinput parameters; namely solution gasoil ratio (Rs), reservoirtemperature (T), oil gravity (API), and gas relative density (gg).The properties of the three datasets acquired are explained below;a statistical description of the datasets appears in Appendix I:

    The rst dataset refers to work conducted by Al-Marhoun, whopublished correlation for estimating Pb and Bob for MiddleEastern oils (Al-Marhoun, 1988). The dataset consists of 160observations collected from 69 Middle Eastern reservoirs.

    The second dataset were retrieved from works conducted byAl-Marhoun and Osman (2002). The dataset consists of 283observations collected from different Saudi Arabian oil eldsfor the prediction of Pb and Bob at Pb for Saudi crude oils.

    The third dataset was obtained from the work conducted byOsman and Al-Marhoun (2005), this dataset contains 782observations collected from oil elds in Malaysia, the MiddleEast, Gulf of Mexico and Columbia.

    2.2. Statistical quality measures

    To compare the performance and accuracy of the proposedIn Section II, the problem is formulated and the datasets usedare introduced. Section III presents the proposed approach to theproblem and the implemented techniques. Section IV reports onthe simulation experiments and a comparative study. Section Vconcludes this work and offers perspectives and future trends.ctions from data. It is a network of nodes and directional

  • 1996). Subtractive clustering is used in the problem at hand as itprovides a fast one-pass algorithm to take input-output trainingdata to generate an adaptive fuzzy inference system that is bettertailored to the dataset. The subtractive clustering methodassumes each data point is a potential cluster centre, andcalculates a measure of likelihood that each data point will denea cluster centre, based on the density of the surrounding datapoints. It starts with the normalisation of all values in the datasetto t in a hypercube unit (Jang et al., 1996). Each cluster centremay be translated into a fuzzy rule for identifying a class. Thecluster radius indicates the range of its inuence. Specifying asmall cluster radius yields many small clusters in the data, thusresulting in many rules. Conversely, choosing a large radius willlead to few rules, misrepresenting therefore the data. Many trialsconsidering different radii values have led to different predictionaccuracies. A genetic algorithm (GA) is used to ne-tune theclustering parameter radii (ra), which then leads to the geneticadaptive neuro-fuzzy inference system (GANFIS). An importantliterature had been published recently on genetic fuzzy systems(GFS). Zanganeh et al., 2006 used a hybrid genetic fuzzy inferencesystem for wave parameters prediction.

    The step-by-step implementation of GANFIS is clearly denedand given as follows (Fig. 2). For the training data set, each radiushas a value between a lower bound of 0.10 and an upper bound of0.90. The tness function used for the genetic search is the leastsquared error between the predicted and target value for the

    A. Khoukhi / Computers & Geosciences 44 (2012) 109119112links. Associated with the network is a learning rule, for instance,back-propagation. These networks are learning a relationshipbetween inputs and outputs. This type of network covers anumber of different approaches, namely Mamdani type andTakagiSugenoKang (TSK) type (see Jang et al., 1996) for moredetail. Unlike the Mamdani method, also referred to as subjectivefuzzy modelling as it builds the fuzzy if-then rules through expertstatements which might involve vagueness and subjectivity,the TSK fuzzy objective modelling method is a framework forgenerating fuzzy if-then rules from input/output numerical data.A way to construct a TSK fuzzy model from numerical dataproceeds in three steps: fuzzy clustering, setting of the member-ship functions, and parameter estimation (Nikravesh et al., 2003;Jang et al., 1996). Fig. 1 shows an ANFIS System with a two-inputstwo-rules one-output arrangement.

    The implemented ANFIS in the study at hand is made up of sixlayers. The rst layer is the input layer, characterising the crispinputs. The second layer performs the fuzzication of the crispinputs into linguistic variables, through Gaussian transfer func-tions. The third is the rule layer, which applies the product t-normto produce the ring strengths of each rule. This is followed by anormalisation layer, at which each node calculates the ratio of arules ring strength to the sum of the ring strengths of all rules.The fth layer performs the defuzzication. The last layer con-ducts the aggregation, where an output is obtained as thesummation of all incoming signals. The training rule option usedis the LevenbergMarquard version of the gradient back-propaga-tion algorithm.

    Fuzzification T-norm Normalization Defuzzification1 2x x

    1 2x x

    +

    2 2w y

    1 1w y

    y

    N

    11

    1 2

    ww

    w w

    22

    1 2

    ww

    w w

    1w

    N2w

    11

    12

    21

    22

    2x

    1x

    =

    +

    +=

    Fig. 1. ANFIS system with two inputs two-rule one-output.3.2. Genetic adaptive neuro-fuzzy inference system (GANFIS)

    Genetic Algorithms (GAs) are intelligent search mechanismsbased on the principle of natural selection and populationgenetics that are transformed by three genetic operators: selec-tion, crossover and mutation. Each string (chromosome) is apossible solution to the problem being optimised, and each bit(or group of bits) represents a value or of some variable (gene) ofthe problem. These solutions are classied by an evaluationfunction, giving better values, or tness, for better solutionsindividuals. Each solution must be evaluated by the tnessfunction to produce a value. Different crossover and mutationrates are used for the optimisation using genetic algorithms. Theability of genetic programming and adaptive neuro-fuzzy infer-ence system (ANFIS) techniques were considered for groundwater depth forecasting (Shiri and Kis-i, 2010).

    To reduce the number of rules that are generated during ANFISimplementation, different clustering methods have been pro-posed, e.g., subtractive clustering and fuzzy-C means (Jang et al.,output variable.A population initialisation of size N and a termination criterion

    are rst dened. The termination is either the maximum numberof generations or errors. Then the number n of radii are generatedrandomly radii[r1,r2, y, rn] in the solution space. Note hereradiis are synonymous with chromosomes, also called indivi-duals. Therefore each population now has (Nn) chromosomes.

    The whole population in the generation is evaluated throughANFIS, using the root mean squared error of the testing data as acriterion of best t. Candidate chromosomes for the new generationare selected from the population, using a tournament selectionprocedure. Then, crossover and mutation operations are performed

    No1t t= +

    No

    i = i+1

    Run Genetic operators

    Run ANFIS

    Yesi nN

    Data Reading

    RMSE<

    Yes

    No t > T Yes

    Select clustering radiustir

    Run ANFIS

    Data Normalisation

    t =1

    Generate N individual 1[ ,.., ]t t tNr r r=

    i =1

    RMSE<

    Yes

    No

    Stop the program with tr as

    clustering radii Fig. 2. Flowchart of the proposed GANFIS approach.

  • on the selected chromosomes. The Nn best individuals are selectedfrom the parent and child populations (elitism). These new indivi-duals will form the parent population for the next generation. Theprocess is repeated until the termination criterion is met.

    Other GA algorithm settings consist of the number of genera-tions (20), and population size (10). Population type (double),Selection function: (Stochastic uniform), Crossover function:(Scattered with crossover rate: 0.80), Mutation function:(Gaussian with mutation rate: 0.20). The other options of ANFISare set to Matlab default. The results for GANFIS show a sig-nicant improvement in both the bubble point pressure (Pb) andoil formation volume factor (Bob) predictions than when the trial-and-error selection method was used for the radii.

    3.3. Genetically optimized neural networks (GONN)

    At this stage we consider a hybrid genetic neural network forthe Pb and Bob (at Pb) prediction problem.

    Recently, several hybrids of genetic algorithm with ANNs wereproposed [Balan et al., 1996]. In Oloso et al. (2009) a differentialevolutionary articial neural network was introduced for predictingviscosity and gas/oil ratio curves.

    schemes. Based on the results obtained within the external

    Test ANN Generate N individual 11 ,...,

    NRw w

    A. Khoukhi / Computers & Geosciences 44 (2012) 109119 113 Train ANN

    Generate New population

    Run Generate operators

    No

    i = i+1Yes

    Select weight vector

    Train ANN

    i =1

    Stop, Display optimal weights

    i N

    No

    Yes

    RMSE

  • correlations methods. In addition, these schemes have a highaccuracy in predicting Pb and Bob values with stable performancesand achieved the lowest absolute percent relative errors, lowestminimum errors, lowest maximum errors, lowest root meansquare errors, and highest correlation coefcients among theother correlations for the three distinct databases used. Never-theless, because the three datasets used are from differentlocations and reservoirs, the impact produced on the predictionperformance of each dataset is independent and not signicantlycorrelated to the other. Henceforth, the results of the presentstudy are discussed for each data set. Moreover, as noted in theintroduction, most of the empirical correlations developed, orthose developed using graphical or regression methods, are onlyapplicable to a specic region and data set, effort is still beingmade to come up with a universal correlation.

    Figs. (4)(8) illustrate a sample of the scatter plots of thepredicted results versus the experimental data for Pb and Bob

    values using the distinct data sets provided at training and testingstages. These cross-plots indicate the degree of agreementbetween the actual (experimental) and the predicted values. Foran ideal model, all points on the plot should appear on a line ofangle of 451 with respect to the x-axis, with equidistant pointsfrom both axes and with a correlation coefcient equal to 1.

    Also as it is clear from the Figures, that the regression modelshave not good generalization capabilities as compared to softcomputing based models. This fact was also reported in a previousstudy by Al-Marhoun (1988, 1992), Al-Marhoun and Osman(2002). They noticed especially that the developed regressionmodels using Middle East crude oils data, when tested on newdata from different crudes, they fail to provide with acceptableperformance.

    Fig. 9 shows a scatter plot of Ea versus R2 at the training stagefor all modeling schemes that are used to determine Bo based onthe data set used in Osman and Al-Marhoun (2005). We observe

    Fig. 4. Correlation coefcient and cross-plot for GONN Bob (left) and Pb (right) prediction (1st dataset).

    A. Khoukhi / Computers & Geosciences 44 (2012) 109119114Fig. 5. Correlation coefcient and cross-plot of GANFIS for Bob and Pb (1st dataset).

  • A. Khoukhi / Computers & Geosciences 44 (2012) 109119 115that the symbol corresponding to the GONN scheme falls in theupper left corner with Ea0.10015% and R20.9999, whileGANFIS very close to GONN with Ea0.1013% and R20.9998;whereas standalone ANN got Ea1.7886% and R20.9878.

    Fig. 7. Correlation coefcient and cross-plo

    Fig. 8. Bob Correlation coefcient and cross-plot of regression models. (t

    Fig. 6. Pb Correlation coefcient crosEmpirical correlations indicate higher error values with lowercorrelation coefcients, such as, Standing with Ea2.1923% andR20.9874; Al-Marhoun with Ea2.1037% and R20.9846; andGlaso Correlation with Ea2.8173% and R20.97813.

    ts of ANN for Bob and Pb (1st dataset).

    op to bottom: (a) Al-Marhoun (b) Standing, (c) Glaso) (1st dataset).

    s-plot for GANFIS (2nd dataset).

  • Fig. 10. Performance criteria obtained using dataset 2 for the eight methods forbubble point pressure (Pb).

    0.9

    0.92

    0.94

    0.96

    0.98

    1

    1.02

    0

    Cor

    rela

    tion

    Coe

    ffici

    ent R

    2

    Absolute Percent Relative Error EA

    ANN

    Standing

    Marhoun

    Glaso

    GANFIS

    GONN

    0.5 1 1.5 2 2.5 3 3.5

    Fig. 9. Performance of different techniques (3rd dataset Correlation coefcient vs.absolute percent relative error).

    A. Khoukhi / Computers & Geosciences 44 (2012) 109119116Figs. 10 and 11 show different error criteria obtained usingdataset 2 for the six methods for bubble point pressure (Pb) andoil formation volume factor (Bob), respectively. The gures showclearly that the GANFIS and GONN are very competitive andproduce the best performance, as compared to standalone ANN

    Fig. 11. Performance criteria obtained using dataset 2 for the eight methods for oilformation volume factor (Bob).

    Table 1Time complexity of all models for Pb Prediction (1st dataset).

    Model Training Testing

    Cputime (s)

    Standing 34.819 0.0682

    Marhoun 15.254 0.0782

    Glaso 19.025 0.072

    ANN 724.54 0.088

    GONN 57600 0.0182GANFIS 69200 0.0096

  • and common empirical correlations including the correlationsby Al-Marhoun (1988), Osman and Al-Marhoun (2005), andAl-Marhoun and Osman (2002)) and multi-layer back-propaga-tion neural networks. The six considered error criteria, include theaverage percent relative error (Er), average absolute percentrelative error (Ea), minimum and maximum absolute percenterror (Emin and Emax), root mean square errors (RMSE), standarddeviation (SD), and correlation coefcient (R2).

    Finally, regarding time complexity, Table 1 shows the compu-tational processing time for all models for the Pb prediction. It isclear that GANFIS and GONN are very demanding in computa-tional time, as compared to stand alone ANN and regression basedmodels. However this does not jeopardize usage as these compu-tations are performed ofine in the development stage, and oncethe parameters of GANFIS and GONN are optimized these struc-tures are then used for online calculations with signicantlylower computational time as compared to ANN, but with betterprediction accuracy.

    5. Conclusion and future work

    In this paper, a comprehensive study is conducted on theprediction problem of PressureVolumeTemperature (PVT)properties, namely bubble point pressure and oil formationvolume factor. Several soft computing techniques are explored;including articial neural networks, and adaptive neuro-fuzzy

    inference systems. The paper also presented two hybrids; agenetically optimised neural network and a genetically enhancedsubtractive clustering technique for parameter identication ofthe adaptive neuro-fuzzy inference system.

    Three distinct published databases were utilised to investigate thecapabilities of the soft computing techniques. Based on simulations,and by comparing the results obtained, one can conclude that, thegenetic neural network and the genetic neuro-fuzzy inference systemschemes showed better performance in predicting Pb and Bob valueswith stable performance, and achieved the lowest absolute percentrelative error, lowest minimum error, lowest maximum error, lowestRMSE and the highest correlation coefcient R2 compared to themany existing empirical correlations used for the three distinct datasets. The plan is to push this hybrid computing further by incorpor-ating some recent advances in soft computing, like optimized supportvector regression and extreme learning machines and granularcomputing (Khoukhi et al., 2011; Zhu et al., 2005; Xu and Shu,2006; Scholkopf and Smola, 2002; Yu and Pedrycz, 2009) and otherintelligent search techniques. Some these hybrids are being tested inongoing works. The resulting massive computational time required toimplement these hybrids will be tackled using the High PerformanceComputing (HPC) facilities available to achieve higher performancewith a reasonable time complexity. Furthermore, the proposed hybridevolutionary soft computing techniques are exible, reliable and can

    Table A.1Statistical description of dataset 1 used for PVT models (160 records).

    Parameter Min Max Average SD

    Input variablesTemperature (Tf), (1F) 74 240 144.43 39.036Gasoil ratio (Rs), (SCF/STB) 26 1602 557.66 403.12

    Gas relative density (gg) 0.69 1.367 0.96417 0.17103Api oil gravity, (degrees api) 19.4 44.6 32.388 5.7444

    Output variables

    ds).

    Input variables

    ds).

    A. Khoukhi / Computers & Geosciences 44 (2012) 109119 117Temperature (Tf ), (1F) 75Gasoil ratio (Rs) (SCF/STB) 24

    Gas relative density (gg) 0.7527API oil gravity (degrees API) 17.5

    Output variablesBubble point pressure (Pb, psi) 90

    Oil FVF at Pb, (RB/STB) 1.0308

    Table A.3Statistical description of dataset 3 used for PVT models (782 Recor

    Parameter Min

    Input variablesTemperature (Tf ), (1F) 58Gasoil ratio (Rs) (SCF/STB) 8.61

    Gas relative density (gg) 0.511API oil gravity (degrees API) 11.4

    Output variablesBubble point pressure (Pb), (psi) 107.33

    Oil FVF at Pb, (RB/STB) 1.028Bubble point pressure (Pb), (psi) 130 3573 1731.1 1084.6

    Oil FVF at Pb, (RB/STB) 1.032 1.997 1.3036 0.20564

    Table A.2Statistical description of dataset 2 used for PVT models (283 Recor

    Parameter MinAppendix I. Statistical description of datasets used

    See Tables (A.1)(A.3).

    Max Average SD

    240 147.35 47.772

    1453 432.46 303.58

    1.8195 1.008 0.14826

    44.6 31.622 5.2518

    3331 1390.2 860.66

    1.889 1.2504 0.15824

    Max Average SD

    341.6 181.9 51.984

    3617.3 541.75 483.68

    1.789 0.88825 0.18556

    63.7 34.588 8.7286

    7127 2006.1 1291.2

    2.887 1.3362 0.27201Acknowledgments

    This work is supported by King Fahd University of Petroleumand Minerals under Grant no. SB100014.be implemented for other related oil and gas industry problems,especially in the prediction of permeability and porosity, historymatching, data management and rock mechanics properties. Some ofthese tasks are being undertaken in ongoing works.

  • A. Khoukhi / Computers & Geosciences 44 (2012) 109119118Appendix II. Performance indices

    Average percent relative error: Measures the relative devia-tion from the experimental data, given as:

    Er 1n

    Xni 1

    Ei II:1

    where Ei is a relative deviation of an estimated value from anexperimental value.

    Ei BobexpBobest

    Bobexp

    " #i

    100 i 1,2,. . .n II:2

    Average absolute percent relative error: Measures the rela-tive absolute deviation from the experimental values, dened as:

    Ea 1

    n

    Xni 1

    9Ei9 II:3

    Minimum absolute percent relative error: To dene therange of error for each correlation, the calculated absolute percentrelative error values are scanned to determine the minimumvalues. They are dened by:

    Emin minn

    i 19Ei9 II:4

    Maximum absolute percent relative error: Similarly, themaximum absolute percent relative error is

    defined as : Emax maxn

    i 19Ei9 II:5

    Root mean squares error: Measures the data dispersionaround zero deviation, dened by:

    RMSE 1n

    Xni 1

    E2i

    " #1=2II:6

    The correlation coefcient: Represents the degree of successin reducing the standard deviation by regression analysis. It isdened by:

    R1

    Xni 1

    BobexpBobest2i =Xni 1

    BobexpBob2i

    vuut II:7where

    Bob 1

    n

    Xni 1

    Bobexpi II:8

    Appendix III. Regression models used (Standing, 1947;Glaso, 1980; Al-Marhoun, 1992)

    Standing (1947) proposed the following empirical expressionsfor the formation volume factor and bubble point pressure:

    Bbob 0:97590:00012 Rsgggo

    0:51:25t

    " #1:2III:1

    Pb 18:2Rs=gg0:8310a1:4 III:2

    a 0:00091T 0R 4600:0125API

    Glaso (1980) suggested the following expression:

    Bbob 110A III:3where

    n n 2A6:585112:91329logBob0:27683logBob III:4With Bnob given as:

    Bnob Rsggg0

    0:5260:968T460 III:5

    With respect to the bubble point pressure, he found:

    logPb 1:76691:7447logPnb0:30218logBnob2 III:6

    Pnb Rs=ggaTbAPIc III:7

    a 0:816, b 0:172, c0:989Al-Marhoun, 1992, determined the following empirical corre-

    lation of the formation volume factor with respect to the gas/oilratio, gas gravity, oil gravity and temperature:

    B0 0:4970690:862963:103T0:182594:102F0:318099:105F2 III:8

    with

    F R2s gbggc0 and a 0:74239, b 0:323294, c120204III:9

    He proposed the following for the bubble point pressure:

    Pb aRbsgcggd0Tc III:10

    with a5.38088103, b0.715082, c1.87784,d 3:1437 e 1:32657

    References

    Al-Hussainy, R., Humphreys, N., 1996. Reservoir management principles andpractices. Journal of Petroleum Technology 48 (12), 11291135.

    Al-Marhoun, M.A., 1988. PVT correlations for Middle East crude oils. Journal ofPetroleum Technology 40 (5), 650666.

    Al-Marhoun, M.A., 1992. New correlations for formation volume factor of oil andgas mixture. Journal of Canadian Petroleum Technology 31 (3), 2226.

    Al-Marhoun, M.A., Osman, E.A., 2002, Using Articial neural networks to developNew PVT correlations for saudi crude oils, in 10th Abu Dhabi InternationalPetroleum Exhibition and Conference (ADIPEC), Abu Dhabi, UAE, paper SPE78592.

    Almehaideb, R.A., 1997. Improved PVT correlations for UAE crude oils: SPE MiddleEast Oil Show and Conference, Manama, Bahrain, V1, SPE 37691, pp. 109120.

    Aulia, A., Keat, T.B., Bin Maulut, M.S., El-Khatib, N., Jasamai, M., 2010. Smart oilelddata mining for reservoir analysis. International Journal of Engineering &Technology 10, 06.

    Balan, B., Ameri, S., Mohaghegh, S., 1996. A hybrid neuro-genetic approach tohydraulic fracture treatment design and optimization. SPE 36602.

    De Ghetto, G., Paone, F., Villa, M., 1995. PressureVolumetemperature correla-tions for heavy and extra heavy oils. In: SPE International Heavy OilSymposium, Calgary, Alberta, Canada.

    El-Sebakhy, E., 2009. Data mining in forecasting PVT correlations of crude oilsystems based on type 1 fuzzy logic inference systems. Computers &Geosciences 35 (9), 18261917.

    Elsharkawy, A.M., 1998. Modelling the properties of crude oil and gas systemsusing RBF network. In: SPE Asia Pacic Oil & Gas Conference, Perth, Australia,paper SPE 49961.

    Glaso, O., 1980. Generalized PressureVolumeTemperature correlations. Journalof Petroleum Technology 32 (5), 785795.

    Goda H.M., Fattah K.A., Shokir E.M., Sayyouh M.H., 2003.Prediction of the PVT datausing neural network computing theory. In: 27th Annual SPE InternationalTechnical Conference and Exhibition, Abuja, Nigeria, paper SPE 85650.

    Jang, S.R., Sun, C.T., Mizutani, E., 1996. Neuro-Fuzzy and Soft Computing: AComputational Approach to Learning and Machine Intelligence. Prentice-Hall.

    Katz, D.L., 1942. Prediction of the shrinkage of crude oils. Drilling and ProductionPractice, API, 137147.

    Khoukhi, A., Albukhitan, S., 2010.A data driven genetic neuro-fuzzy system to PVTproperties prediction. In: Proceedings of the North American Fuzzy Informa-tion Processing Society (NAFIPS10), Toronto, Canada.

    Khoukhi, A., Albukhitan, S., 2011. PVT properties prediction using hybrid geneticneuro-fuzzy systems. International Journal of Oil Gas and Coal Technology 4(1), 4763.

    Khoukhi, A., Oloso, M., Abdulraheem, A., El-Shafei, M., Al-Majed, A., 2011. Supportvector machines and functional networks for viscosity and gas/oil ratio curvesprediction. International Journal of Computational Intelligence and Applica-

    tions 10 (3), 269293.

  • Labedi, R., 1990. Use of production data to estimate volume factor density andcompressibility of reservoir uids. Journal of Petroleum Science & Engineering4, 357390.

    Macary, S.M., El-Batanoney, M.H., 1992. Derivation of PVT correlations for the Gulfof Suez crude oils. In: EGPC 11th Petroleum Exploration & ProductionConference.

    Nikravesh, M., Zadeh, L.A., Aminzadeh, F., 2003. Soft Computing and IntelligentData Analysis in Oil Exploration, 51. (Developments in Petroleum Science),Elsevier Science.

    Oloso, M., Khoukhi, A., Abdulraheem, A., El-Shafei, M., 2009. A genetic-optimizedarticial neural networks for predicting viscosity and gas/oil ratio curves.SPE/EAGE Reservoir Characterization and Simulation Conference & Exhibition,Abu-Dhabi, UAE.

    Omar, M.I., Todd, A.C., 1993, Development of new modied black oil correlation forMalaysian crudes. In: SPE Asia Pacic Oil and Gas Conference, Singapore, SPE25338.

    Osman, E.A., Al-Marhoun, M.A., 2005. Articial neural networks models forpredicting PVT properties of oil eld brines. In: 14th SPE Middle East Oil &Gas Show and Conference, Bahrain, paper SPE 93765.

    Ouenes, A., 2000. Practical application of fuzzy logic and neural networks tofractured reservoir characterization. Computers & Geosciences 26 (8),953962.

    Shiri, J., Kis-i, O., 2010. Comparison of genetic programming with neuro fuzzysystems for predicting short-term water table depth uctuations. Computers &Geosciences 37 (10), 16921701.

    Scholkopf, P., Smola, A.J., 2002. Learning with Kernels: Support Vector Machines,Regularization, Optimization, and Beyond. MIT Press, Cambridge.

    Suttan, R.P., Farshad, F.F., 1990. Evaluation of empirically derived PVT propertiesfor Gulf of Mexico crude oils. SPE Reservoir Engineering 5 (1), 7986.

    Sutton, R.P., 2008. An accurate method for determining oil PVT properties usingthe StandingKatz gas Z-factor chart. SPE Reservoir Evaluation & Engineering11 (2), 246266.

    Standing, M.B., 1947A PressureVolumeTemperature correlation for mixture ofCalifornia oils and gases. In: API Drilling and Production Practice, Dallas,pp. 275287.

    Varotsis, N., Gaganis, V., Nighswander, J., Guieze, P., 1999. A novel non-iterativemethod for the prediction of the PVT behavior of reservoir uids. In: SPEAnnual Technical Conference and Exhibition, Houston, Texas, paper SPE56745.

    Yu, F., Pedrycz, W., 2009. The design of fuzzy information granules: tradeoffsbetween specicity and experimental evidence. Journal of Applied SoftComputing 9 (1), 264273.

    Xu, Y., Shu, Y., 2006. Evolutionary extreme learning machine based on particleswarm optimization. In: Wang, J., et al. (Eds.), ISNN, LNCS 3971. Springer-Verlag, pp. 644652.

    Zanganeh, M., Mousavi, S., Etemad-Shahidi, A., 2006. A genetic algorithm-basedfuzzy inference system in prediction of wave parameters, computationalintelligence. Theory and Applications (Part 25), 741750, http://dx.doi.org/10.1007/3-540-34783-6_72.

    Zhu, Q., Qin, A.K., Suganthan, P.N., Huang, G.B., 2005. Evolutionary extremelearning machine. Pattern Recognition 38, 17591763.

    A. Khoukhi / Computers & Geosciences 44 (2012) 109119 119

    Hybrid soft computing systems for reservoir PVT properties predictionIntroductionMotivation and backgroundCorrelation-based predictionArtificial neural networks based predictionHybrid soft computing based techniques

    Problem statementData acquisition and pre-processingStatistical quality measures

    Proposed approachAdaptive neuro-fuzzy inference systems (ANFIS)Genetic adaptive neuro-fuzzy inference system (GANFIS)Genetically optimized neural networks (GONN)

    Simulation experiments and comparative studiesIntroduction and experimental set upSimulation results and comparative studies

    Conclusion and future workAcknowledgmentsStatistical description of datasets usedPerformance indicesRegression models used (Standing, 1947; Glaso, 1980; Al-Marhoun, 1992)References