final research report - carnegie mellon university · conference presentation. liu j., ”a damage...

29
UTC Report: Infrastructure Monitoring for Gradual Damage Detection From an In-service Light Rail Vehicle Jacobo Bielak 1,* , Hae Young Noh 1,+ , Mario Berg ´ es 1,+ , James H. Garrett 1 , Jelena Kova ˇ cevi ´ c 2,- , and Jingxiao Liu 1 1 Department of Civil and Environmental Engineering, Carnegie Mellon University, Pittsburgh, PA, 15213. 2 Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, 15213. * Principal Investigator: [email protected] + Co-Principal Investigator - Jelena Kovaˇ cevic ´ performed this work while she was with the Department of Electrical and Computer Engineering, Carnegie Mellon University; she is currently with Tandon School of Engineering, New York University. DISCLAIMER The contents of this report reflect the views of the authors, who are responsible for the facts and the accuracy of the information presented herein. This document is disseminated under the sponsorship of the U.S. Department of Transportation’s University Transportation Centers Program, in the interest of information exchange. The U.S. Government assumes no liability for the contents or use thereof. 1 FINAL RESEARCH REPORT Contract No. DTRT-13-GUTC-26

Upload: others

Post on 12-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

UTC Report:Infrastructure Monitoring for Gradual DamageDetection From an In-service Light Rail VehicleJacobo Bielak1,*, Hae Young Noh1,+, Mario Berges1,+, James H. Garrett1, Jelena Kovacevic2,-, and Jingxiao Liu1

1Department of Civil and Environmental Engineering, Carnegie Mellon University, Pittsburgh, PA, 15213. 2Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA, 15213.*Principal Investigator: [email protected]+Co-Principal Investigator-Jelena Kovacevic performed this work while she was with the Department of Electrical and ComputerEngineering, Carnegie Mellon University; she is currently with Tandon School of Engineering, New York University.

DISCLAIMERThe contents of this report reflect the views of the authors, who are responsible for the facts and the accuracy ofthe information presented herein. This document is disseminated under the sponsorship of the U.S. Departmentof Transportation’s University Transportation Centers Program, in the interest of information exchange. The U.S.Government assumes no liability for the contents or use thereof.

1

FINAL RESEARCH REPORT

Contract No. DTRT-13-GUTC-26

Page 2: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

1 SummaryThe private freight rail industry makes $9.7 billion capital investment in maintaining the network, which iscomprised of almost 140,000 miles of track and over 100,000 bridges in 2015. However, the Federal RailroadAdministration still reported 1,096 derailments including 428 track caused accidents in the nation in 2017.This makes apparent the need for monitoring rail networks and railway bridge conditions. Recently, weintroduced an indirect structural health monitoring approach which uses sensors on the vehicle to detectinfrastructure changes and damage for low-cost maintenance. The ultimate objective of this project isto develop a system that would provide continuous monitoring of bridges and rail tracks by collectingvibration data from sensors on in-service trains. Ideally, such a system would be able to detect, localizeand quantify the damage of tracks soon after they begin to occur.

To achieve the goal, we organized this problem into three tasks, starting with quantifying and localizinggradual damage on a laboratory model (section 3), followed by describing the DR-Train dataset collectedfrom in-service trains over the past five years (section 4), and proposing a novel anomaly detection approachfor monitoring longitude elevation of track geometry after revisiting the DR-Train dataset (section 5). Thefirst task allows us to understand how the vehicle responds to an infrastructure with gradual damage; thesecond task provides us with a comprehensive understanding of our test-bed and dataset; and the thirdtask takes a step forward to achieving the indirect monitoring in the real-world dataset.

Task 1: Supported by the University Transportation Center (UTC) at CMU, we have conductedlab-scale experiments and provided evidence of the applicability of the indirect damage diagnosis approach.Specifically, we introduced a physical-insight-based data-driven framework for comparing and estimatingdamage severity and location using only vibration signals collected from vehicles. By studying the physicalinsight of the vehicle dynamics, we achieved indirect gradual damage diagnosis in an unsupervised ora semi-supervised way. Further study is required to validate the robustness of this damage diagnosisframework with a more complex and realistic system. For instance, on-site tests are needed to verify thefeasibility of this work in practice, and we also need to study the influence of different types of damagescenarios, vehicle velocity, ongoing traffic, and environmental factors.

Task 2: We published the DR-Train dataset, an open-access dataset recording dynamic responsesfrom two in-service light rail vehicles that run on a 42.2-km light rail network in the city of Pittsburgh,Pennsylvania (USA). The DR-Train dataset also contains GPS positions and environmental data (includingtemperature, wind, weather, and precipitation) as the vehicles were running on the track and the trackmaintenance logs from the light-rail operator. The data were recorded from two light rail vehicles for fouryears with a variety of influential factors. This dataset could be used for comparing different vibration-baseddamage diagnosis algorithms, validating signal processing methods, investigating environmental influenceson train responses, etc.

Task 3: Currently, revisiting the DR-Train dataset has allowed us to propose a learning-based anomalydetection approach for monitoring longitude elevation of track geometry from the dynamic response ofan in-service train. The proposed method uses a variational autoencoder (VAE) to detect the anomaly.The VAE takes accelerations as input and learns a mapping from the frequency-domain of accelerationsto a low-dimensional latent space that represents the distribution of the observed data. After the modelis calibrated, the reconstruction probability of the input data is used as an anomaly score for indicatinghow well the input follows the normal pattern. Our proposed approach outperforms two widely usedsupervised baseline models, a logistic regression model (Baseline) and a fully-connected neural network(NN-Feature), in terms of recall, precision, and F1-score. The results make the proposed indirect structuralhealth monitoring approach a strong candidate for low-cost and frequent track geometry inspection.

Over the year, one journal paper and one peer-reviewed conference paper have been submitted and areunder review, one dataset has been published, another journal paper has been drafted. Also, this granthas helped us to deploy vibration sensors on the Panhandle bridge, and we have been collecting data forvalidating our indirect damage diagnosis approaches in practice.

2/29

Page 3: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Contents

1 Summary 22 List of Papers, Dataset, and Presentations 43 A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring of Bridges

Using Multi-Task Learning 53.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53.2 Numerical Solution of the Vehicle-Bridge Interaction System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63.3 Semi-supervised Damage Location and Severity Estimation Algorithm using Multi-task learning . . . . . . . . . . . . 83.4 Experiments & Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93.5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104 Dynamic responses, GPS positions and environmental conditions of two light rail vehicles in Pittsburgh114.1 Background & Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12

The Pittsburgh Light Rail and instrumented Light Rail Vehicles • Data management system • Environmental conditions •Maintenance schedules • Code availability • Known issues

4.3 Data Records . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164.4 Technical Validation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

Data Collection Failures • Basic requirements • Publications based on the presented dataset4.5 Usage Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225 Variational autoencoder for detecting anomalies in longitude elevation of track geometry using the dynamic

response of an in-service train 236 Project Contributions 25

3/29

Page 4: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

2 List of Papers, Dataset, and PresentationsPapersLiu J., Chen S., Noh H. Y., Bielak J., Garrett J. H., Kovacevic J., and Berges M., Diagnosis for IndirectStructural Health Monitoring of a Bridge Model, Manuscript, 2018.

Liu J., Chen S., Lederman G., Kramer D. B., Noh H. Y., Bielak J., Garrett J. H., Kovacevic J., andBerges M., Dynamic responses, GPS positions and environmental conditions of two light rail vehicles inPittsburgh, Nature: Scientific Data, Submitted, 2018.

Liu J., Noh H. Y., Bielak J., Garrett J. H., Kovacevic J., and Berges M., A Damage Localization andQuantification Algorithm for Indirect Structural Health Monitoring of Bridges Using Multi-Task Learning,AIP Conference Proceedings, Under Review, 2018.

DatasetLiu J., Chen S., Lederman G., Kramer D. B., Noh H. Y., Bielak J., Garrett J. H., Kovacevic J., andBerges M., The DR-Train dataset: dynamic responses, GPS positions and environmental conditions of twolight rail vehicles in Pittsburgh (Version 1.0.0). Zenodo, 2018. http://doi.org/10.5281/zenodo.1432702

PresentationsLiu J., ”Damage Diagnosis Algorithms for Indirect Structural Health Monitoring of Bridges”, PIANC-SMART Rivers Conference, Pittsburgh, PA, 2017. Poster Presentation.

Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoringof Bridges Using Multi-Task Learning”, Engineering Mechanics Institute Conference, Boston, MA, 2018.Conference Presentation.

Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoringof Bridges Using Multi-Task Learning”, Machine Learning in Science and Engineering, PA, 2018. PosterPresentation.

Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoringof Bridges Using Multi-Task Learning”, 45th Annual Review of Progress in Quantitative NondestructiveEvaluation, Burlington, VT, 2018. Conference Presentation.

4/29

Page 5: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

3 A Damage Localization and Quantification Algorithm for Indirect Structural Health Mon-itoring of Bridges Using Multi-Task Learning1

3.1 IntroductionThe 2017 National Bridge Inventory of the Federal Highway Administration found that 47,619 out of615,002 bridges in the U.S. were in poor condition across the nation [1]. For instance, Of Pennsylvania’smore than 22,660 bridges, 23% are considered structurally deficient [2]. This makes apparent the need formonitoring the bridge condition. In this paper, we focus on developing a damage diagnosis algorithm forestimating both the location and magnitude of damage occurring on an experimental bridge.

Recently, there is a trend to establish monitoring systems on bridges by indirectly leveraging sensors ona passing vehicle. This approach has gained great popularity because of its low cost and low maintenance.In 2005, Yang et al. [3] proposed the indirect approach to detect the bridge damage information bymodeling the vehicle-bridge interaction (VBI) system and extracting bridge frequencies from the dynamicresponse of a passing vehicle. Over the past decade, many researchers have proposed novel methods inthis field, which can be divided into two categories depending on whether or not they rely on modalparameters [4]. Natural frequency-based methods [3, 5, 6, 7] are only able to do damage detection, whilethe methods of identifying the mode shapes of a bridge [8, 9, 10] can do damage detection and localization.In addition, although the non-modal parameter-based methods, such as signal processing and machinelearning techniques [11, 12, 13, 14, 15], perform well in indicating bridge damage and have the potential toquantify the severity, these methods lack robustness to noise and require more physical insights for modelselection and performance improvement. Figure 1 shows the procedure of the non-modal parameter-basedmethod for indirect bridge structural health monitoring.

0 100 200 300 400 500 600 700 800Frequnecy (Hz)

0

20

40

60

Ampl

itude

Record 1

0 100 200 300 400 500 600 700 800Frequnecy (Hz)

0

20

40

60

Ampl

itude

Record 2

Feature extraction

Damage diagnosis

Data acquisition

system

Vehicle-bridge interaction system

Damage of record 1

<Damage of

record 2

Signal pre-processing

Mappingf(x)

Collect acceleration signals from a

passing vehicle

X

Figure 1. The procedure of the non-modal parameter-based method for the indirect bridge structuralhealth monitoring system. Acceleration signals collected from a moving vehicle are pre-processed andinputted to a feature extraction system (learning a feature mapping from the spectrum or spectrogram ofsignals), and then the extracted features are used for damage diagnosis.

In our study, we propose a semi-supervised damage diagnosis algorithm using multi-task learning (MTL),which aims at estimating the damage locations and severity levels simultaneously by using accelerationsignals collected from a vehicle passing over a bridge. We first derive the theoretical formulation of theVBI system with a mass attached at different locations on the bridge (representing damage severity) andshow the nonlinear property of the dynamic response of the passing vehicle. The derivation also informs usthat we must localize and estimate the severity mass simultaneously so that to minimize the uncertaintypropagating from the location estimation. Hence, our MTL model uses non-linear activation functionsand learns three tasks, namely feature extraction, damage location, and damage severity regression,simultaneously. A lab-scale experiment, an instrumented vehicle passing over an experimental aluminumbridge, is conducted for evaluating our model. The results show that our model can estimate locations of

1This work is based on the manuscript: Liu J., Noh H. Y., Bielak J., Garrett J. H., Kovacevic J., and Berges M., ADamage Localization and Quantification Algorithm for Indirect Structural Health Monitoring of Bridges Using Multi-TaskLearning, AIP Conference Proceedings, Under Review, 2018.

5/29

Page 6: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

the damage with an accuracy of 0.08 m (3.30% of the total length of the bridge) and changes in severitylevel with an accuracy of 17.81 grams (8.9% of the maximum severity mass). We also compare our MTLmodel with a two-step model which performs damage localization and severity estimation step-by-stepand show that uncertainty propagating from location estimation affects the accuracy of damage diagnosis.

3.2 Numerical Solution of the Vehicle-Bridge Interaction SystemTo help inform our choice of statistical analysis methods used to analyze acceleration records collectedfrom real-world experiments, we revisit the theoretical formulation derived by Yang et al. [3]. Figure 2shows a one degree-of-freedom oscillator moving over a simply-supported beam. We assume the beamis of the Euler-Bernoulli type with constant cross sections. Under this assumption, there is no frictionforce between the ’wheel’ and the beam. The damage is simulated by attaching a mass (ms) at differentlocations of the beam (Ls).

ystyydy

udy

ust u

ms

mv

kv cv

r, A, E, I

l=vt

L

v

f(t)

L"

Figure 2. Simple vehicle-bridge interaction system with a mass attached at different locations.

The equations of motion for the vehicle and bridge are:

mvudy(t) +kv[udy(t)−ydy(x= vt, t)−yst(x= vt)]+ cv[udy(t)− ydy(x= vt, t)− yst(x= vt)] +f(t) = 0

(1)

ρAydy(x,t) +msydy(Ls, t)δ(x−Ls) +EIy′′′′dy (x,t) ={−mvg

+kv[udy(t)−ydy(x= vt, t)−yst(x= vt)]+ cv[udy(t)− ydy(x= vt, t)− yst(x= vt)]+f(t)

}δ(x−vt)

(2)

where mv, kv, cv,u are the mass, stiffness, damping coefficient, and total displacement of the vehicle,respectively; ρ, A, E, I, yst are the density, sectional area, Young’s modulus, moment of inertia, and thestatic displacement of the bridge, respectively; δ(x−vt) is the Dirac delta function; udy(t) and ydy(t) arethe dynamic displacement of vehicle and bridge, respectively.

Before deriving the displacement formulas, the nth mode frequency of the beam (ωbn) with a mass ms

attached at the beam can be obtained using Rayleigh’s method:

ω2bn = EI

∫ L0 [y′′st(x)]2dx

ρA∫ L

0 [yst(x)]2dx+ms[ystm,n(Ls)]2=[n2π2

√EI

(ρAL+ 2ms sin2 (nπLsL ))L3

]2(3)

where ystm,n(Ls) is the maximum deflection at Ls for the nth mode.

6/29

Page 7: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

The displacement of bridge can be expressed in terms of the modal shape, φn(x), and modal coordinates,qbn(t), as:

ydy(x,t) =∞∑n=1

φn(x)qbn(t) =∞∑n=1

[sin nπx

Lqbn(t)

](4)

By using the orthogonality conditions, for the nth mode, we have:

(ρAL2 +ms sin2 (nπLs

L)){qbn(t)

+[n2π2

√EI

(ρAL+ 2ms sin2 (nπLsL ))L3

]2qbn(t)

}= sin nπvt

Lfc(x= vt, t)

(5)

Therefore,

qbn(t) +ω2bnqbn(t) = 2

ρAL+ 2ms sin2 (nπLsL )

sin nπvtL

fc(x= vt, t) (6)

where

fc(x= vt, t) =−mvg+kv[u(t)dy−ydy(x= vt, t)−yst(x= vt)]+ cv[u(t)dy− ydy(x= vt, t)− yst(x= vt)] +f(t)

(7)

The solution of Eq. (6) can be calculated using a convolution integral

qbn(t) = 2ρAL+ 2ms sin2(nπLs

L )

∫ t

0sin nπvθ

Lfc(vθ,θ)hn(t−θ)dθ (8)

where hn(t−θ) is the impulse response function for the nth mode:

hn(t−θ) = 1ωbn

sinωbn(t−θ) (9)

Then, for the nth mode, the bridge displacement can be written as:

ydy,n(x,t) = 2ρAL+ 2ms sin2(nπLs

L )sin nπx

L

∫ t

0sin nπvθ

Lfc(vθ,θ)hn(t−θ)dθ (10)

We can then transform Eq. (10) into the frequency domain:

Ydy,n(x,ω) = 2ρAL+ 2ms sin2(nπLs

L )sin nπx

LF{[

sin nπvtL·fc(vt, t)

]∗hn(t)

}(11)

where, F{f} is defined as the Fourier transform of function f ; f ∗g is the convolution of f and g. Inorder to solve the Eq. (11), we need to use the convolution theorem:

F{f ∗g}= F{f} ·F{g}F{f ·g}= F{f}∗F{g}

(12)

Thus, the frequency response of the displacement of the beam is obtained:

Ydy(x,ω) =∞∑n=1

2πi(ρAL+ 2ms sin2 (nπLs

L ))sin nπx

LHn(ω) ·

[Fc(ω−

nπv

L)−Fc(ω+ nπv

L)]

(13)

7/29

Page 8: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

where Hn(ω) = 1/(ω2bn−ω2); Fc(ω) is the frequency response of the contract force, and is defined as

follows:

Fc(ω) =−mvg√

2πδ(ω) +kv[Udy(ω)−Ydy(x= vt,ω)−Yst(ω)]+ iωcv[Udy(ω)−Ydy(x= vt,ω)−Yst(ω)] +F (ω)

(14)

where Udy(ω), Ydy(x= vt,ω) are the frequency response of displacement of vehicle and bridge, respec-tively; Yst(ω) is the frequency response of the initial deflection of bridge.

Then, the mid-span acceleration of the bridge in the frequency domain can be written as:

Ydy(L

2 ,ω) =∞∑n=1

2πmvL3ω2

iπ4EI− iω2L3(ρAL+ 2ms sin2 (nπLsL ))

sin(nπ

2){Udy(ω−

nπv

L)− Udy(ω+ nπv

L)

+√

2πg[δ(ω− nπv

L)

− δ(ω+ nπv

L)]}

(15)

The n-th mode frequency response of the oscillator’s acceleration is:

Udy,n(ω) = Udy,n(ω− 2nπv

L

)−iπ4EI− i(ω− nπv

L )2L3(ρAL+ 2ms sin2 (nπLsL ))

2πmvL3(ω− nπvL )2 sin

(nπ2) Ydy,n(L2 ,ω−

nπv

L)

−√

2πg[δ(ω− 2nπv

L)− δ(ω)

] (16)

From the derivation, we can draw the following conclusions:

• Non-linear property: Eq. (16) illustrates that the transformation from the severity mass to theamplitude of the acceleration signal’s frequency response is non-linear.

• Uncertainty propagation: according to Eq. (16), different damage locations and severity levels onlyvary the term S =ms sin2(nπLs/L). If we localize and quantify the damage severity step by step (atwo-step model), the estimation of damage severity is: ms = S/sin2(nπLs/L). Then the propagationof uncertainty from the damage location estimation can be calculated as follows:

σms =±√σ2S

1sin4 (nπLs/L)

+σLs

4n2π2S2 cos2 (nπLs/L)L2 sin6 (nπLs/L)

. (17)

The sinusoidal function in the denominators makes the uncertainty of ms very large, especially whenthe damage location is near the supports (‖Ls−L/2‖ is large). As a result, if a wrong damagelocation estimation appears, this error will propagate and consequence a wrong estimation of thedamage severity level.

To overcome the problem of uncertainty propagation and preserve the non-linear property of theinteractive VBI system, we propose a semi-supervised algorithm using MTL.

3.3 Semi-supervised Damage Location and Severity Estimation Algorithm using Multi-task learningIn practice, because of the lake of labeled examples, we want to achieve damage localization and quantifica-tion in a semi-supervised fashion, which generally needs to localize the damage first and then quantify thedamage severity in the same location. However, as mentioned in the previous section, this 2-step methodmakes the uncertainty of the severity level estimation large and can result in an incorrect damage severityestimation. Thus, instead of a two-step model, we introduce a semi-supervised MTL model to localize andquantify the damage simultaneously with three tasks including reconstruction of input signals, damagelocation regression, and damage severity regression. The multi-task learning improves generalization by

8/29

Page 9: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

leveraging the domain-specific information contained in the training signals of related tasks [16]. Thearchitecture of our model is shown in Figure 3. For the first task (input reconstruction), we reduce thedimensionality of the fast Fourier transformation (FFT) outputs of acceleration signals by using stackedautoencoders with non-linear activation functions (ReLU). A stacked autoencoder is a neural networkconsisting of multiple layers of autoencoders to reconstruct its input. The encoder learns a representation zfor a set of data x, and the decoder can reconstruct x from its encoding z. The architecture of our stackedautoencoder is 1936→ 512→ 16→ 512→ 1936. For the second and the third tasks (damage severityestimation and localization), two regression layers are stacked on the bottleneck (or the 16-dimensionfeature layer) of the stacked autoencoder to predict damage location and severity. Furthermore, as asemi-supervised model, only a part of location and severity labels are given, which means that for the restof the input signals that are unlabeled, the gradients of the second and the third task will be stoppedduring the training phase.

Figure 3. The architecture of the semi-supervised MTL model.

3.4 Experiments & ResultsIn this section, we use our semi-supervised MTL model to localize and quantify damage occurring onan experimental bridge. Figure 6 shows pictures of the experimental vehicle and the lab-scale bridge.The damage proxy is represented by adding mass at different locations on the bridge. A heavier massmeans more severe damage since it appears as a more significant change from the initial condition. Inthis experiment, the mass ranges from 60 to 200 grams with an interval of 10 grams, so the attachedmass varies from 0.38% to 1.29% of the mass of the bridge, which is 15.5 kg. For each severity level, theexperiment is run at 3 different damage locations (the first quarter, the half and the third quarter of thebridge span), and for each damage scenario, the experiment is run 30 times. Furthermore, there are fouraccelerometers installed on the experimental vehicle, two of which are on the front and back chassis, andtwo on the front and back wheels. Thus, we have 15 (mass) × 3 (locations) × 30 (iterations) = 1350(trials), and 1350 (trials) × 4 (sensors) = 5400 (signals).

In our study, we compare the performance of our MTL model with that of a 2-step model in damagelocalization and quantification. For the MTL model, we first train it with acceleration signals of lowerand upper severity levels and their corresponding severity and location labels (severity from 60 grams to90 grams and from 180 grams to 200 grams), and then predict the other damage locations and severitylevels simultaneously. For the 2-step model, we first estimate damage locations using k-means clusteringto partition 16-dimension features of FFT of acceleration signals learned by a stacked autoencoder into 3clusters, and then we train regression models with the lower- and upper-severe acceleration signals in eachpredicted damage location (cluster), and then predict the other damage locations and severity levels. To

9/29

Page 10: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Back Chassis

Back Wheel

Front Chassis

Front Wheel

Figure 4. Vehicle.

Vehicle

Bridge

Figure 5. Bridge.

Figure 6. (a) Details of the passing vehicle and (b) the Lab-scale bridge under a moving vehicle.

evaluate these models, we calculate the root mean squared error (RMSE) between the predicted damagelocations and severity levels with the corresponding labels. Figure 9 shows that in most cases our MTLmodel outperforms the 2-step model in terms of RMSE. In addition, even with a better damage locationprediction, the damage severity prediction of the 2-step model is worse than that of the MTL model, whichprovides evidence of the problem of large uncertainty propagation from the location estimation.

0

0.1

0.2

0.3

0.4

0.5

0.6

BackWheel

FrontChasis

FrontWheel

BackChasis

RMSE

2-step

MTL

Figure 7. Damage location prediction.

0

10

20

30

40

50

BackWheel

FrontChasis

FrontWheel

BackChasis

RMSE

2-step

MTL

Figure 8. Damage severity prediction.

Figure 9. (a) Damage location and (b) severity prediction results using a 2-step method and MTL.

3.5 ConclusionsIn this paper, we propose a semi-supervised damage localization and quantification algorithm usingmulti-task learning, which preserves the non-linear property of the VBI system and prevents from largeuncertainty propagating from the damage location estimation. We evaluate our proposed model on anexperimental dataset, and we show that it outperforms a two-step baseline model that localizes andquantifies the damage step by step. Our proposed model can estimate locations of the damage with anaccuracy of 0.08 m (3.30% of the total length of the bridge) and changes in severity level with an accuracyof 17.81 grams (8.9% of the maximum severity mass). These results point to the applicability of indirectstructural health monitoring with low cost and low maintenance.

10/29

Page 11: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

4 Dynamic responses, GPS positions and environmental conditions of two light rail vehiclesin Pittsburgh2

The previous section provides an understanding of the physical and mathematical aspects of a methodologyfor detecting gradual change in a structural model. In this and the next sections, we apply our damagediagnosis methodologies to a real-world dataset collected from light rail vehicles, and develop new approachesfor improving their diagnosis capabilities.

Over the last five years, with the support of the UTC, we have collected an extensive data set fromtwo light rail vehicles. In earlier projects, we addressed the difficulties associated with aligning the vehicleaccelerations with the position data, and how to fuse signals from different sensor channels and differentvehicles. In the present project, our group organized, validated, and released the collected data. Thissection provides a summary of the earlier work, as well as a description of the approach we followed toperform these tasks. Then, in Section 5, with the background of our earlier work, we describe a newapproach we have developed for achieving indirect track geometry monitoring, which results in fewer falsealarms than existing methods.

4.1 Background & SummaryThe private freight rail industry in the U.S. makes $9.7 billion capital investment in maintaining thenetwork, which is comprised of almost 140,000 miles of track and over 100,000 bridges in 2015 [17]. However,in 2017, the Federal Railroad Administration still reported 11,699 train accidents/incidents including 1,223derailments and 470 track-caused accidents/incidents in the nation [18]. To ensure safety and reducemaintenance cost, it is necessary to develop low-cost and reliable techniques to monitor the status ofrailroad networks continuously, especially track geometries. In practice, two traditional approaches areusually adopted to inspect track infrastructure: (1) visual inspection and (2) inspection using a dedicatedtrack geometry car. Visual inspection is neither reliable nor convenient. While inspection using a dedicatedtrack geometry car can provide accurate track geometry data, it requires interruptions of regular trainoperations, and each inspection session has a more expensive cost than visual inspection. Due to itshigh cost and interruptions, it is difficult to conduct frequent inspections using a track-geometry car. Inrecent years, researchers have proposed many indirect track inspection methods using sensors, such asaccelerometers and GPS, installed on in-service trains for track geometry monitoring and change detection[19, 20, 21, 22, 23, 24, 25, 26, 27] since it can be more reliable than visual inspection and costs less thaninspection using a track-geometry car. Also, sensors installed on in-service trains can provide continuousmonitoring of the track without affecting regular operations.

We monitored Pittsburgh’s light rail network from sensors placed on passenger trains, as a moreeconomical monitoring approach than either visual inspection or inspection with dedicated track vehicles.Over time, we learned how the trains respond to each section of track and use a data-driven approachto detect changes to the track condition relative to its historical baseline. We instrumented one train inFall 2013, and a second train in Summer 2015. We have been continuously collecting data on the trains’position using GPS and their dynamic responses using accelerometers; in addition, our dataset includesenvironmental data as the trains were running on the track and the track maintenance logs from thelight-rail operator.

Although there are some acceleration datasets for structure vibration testing [28], human activityrecognition [29], senior fall detection [30] and gait recognition [31], at the time of writing, the DR-Traindataset is the only one to include multi-channel and high-frequency acceleration signals and GPS positionsof light rail vehicles. The data were recorded from two light rail vehicles for four years with a variety ofinfluential factors. This could be a benchmark dataset for comparing different vibration-based damage

2This work is based on the manuscript: Liu J., Chen S., Lederman G., Kramer D. B., Noh H. Y., Bielak J., Garrett J. H.,Kovacevic J., and Berges M., Dynamic responses, GPS positions and environmental conditions of two light rail vehicles inPittsburgh, Nature: Scientific Data, Submitted, 2018.

11/29

Page 12: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

diagnosis algorithms. As a validation of our dataset, we have been able to detect changes in the tracks,which correspond to known maintenance activities. Besides detecting those changes, another usage ofthis dataset is for developing or validating data fusion methods. Data fusion methods integrate multipledata sources, such as multiple sensor channels on multiple light rail vehicles, to produce more consistent,accurate, and useful information than that provided by any individual data source. Our group has proposeda data fusion approach that integrates multiple accelerometers and GPS data sources from the sameor different vehicles [27]. Also, the DR-Train dataset has many other potential usages. For example,environmental factors of each service run are logged in the dataset; researchers can reuse the dataset toinvestigate influences of the weather and temperature on the dynamic response of the light rail vehicles.

4.2 MethodsWe use a data management system to collect and process dynamic responses and GPS positions of twopassenger vehicles. We first introduce our monitoring target, the Pittsburgh Light Rail system, and ourmonitoring carrier, light rail vehicles in the following subsection.

4.2.1 The Pittsburgh Light Rail and instrumented Light Rail VehiclesThe Pittsburgh Light Rail, called the ‘T’ lines, is a 42.2-km light rail network in Pittsburgh, Pennsylvania.Figure 10 shows the transit map of the ‘T’ lines. This network is owned and operated by the PortAuthority of Allegheny County (PAAC). It has 53 stations and around 28,000 daily ridership. The railtrack, including street running track and ballasted track, in this network uses the Pennsylvania Trolleygauge rail whose track gauge is 1,588 mm. Also, the network contains bridges, viaducts, and tunnels, andis exposed to variable environmental conditions. For example, the temperature we observed ranges from−20◦C to 35◦C. The variety of influential factors makes it a viable test-bed.

A light rail vehicle (LRV) is a standardized vehicle for U.S. cities. LRVs of the Pittsburgh’s lightrail network have two models: Siemens SD-400 LRVs were built from 1985 to 1987 and assigned fleetnumbers 4201 to 4255; Construcciones y Auxiliar de Ferrocarriles S.A. (CAF) LRVs were built from 2003to 2004 and assigned fleet numbers 4301 to 4328. Those LRVs are supplied by a 650 voltage direct currentelectrification system. We installed accelerometers and GPS antennas on LRVs 4306 and 4313. These twotrains are run in a two-car configuration with total length 25.810 m, empty weight 40 metric tons, totalpassenger capacity 264 and maximum speed 80 km/h.

4.2.2 Data management systemOur data management system consists of four key modules (as shown in Figure 11): Sensing module,data-acquisition module, data-storage module, and data-processing module.

Sensing moduleFigures 12 and 13 show our instrumentations of two LRVs (fleet number 4306 and 4313). In 2013,we instrumented the LRV 4306 by placing two uni-axial accelerometers inside the cabin of the train(VibraMetrics 5102 [32]) and a tri-axial accelerometer (PCB 354C03 [33]) on the central wheel truck. Thecentral wheel truck, or the central bogie, is a chassis attached to a vehicle and carrying wheelsets witha suspension system. Since it is not powered, the electrical noise is minimized. However, it has higherinstallation and maintenance costs than the system inside the cabin. For the second LRV instrumented(fleet number 4313) in 2015, we placed more sensors, including two uni-axial accelerometers (VibraMetrics5102) and two tri-axial accelerometers (PCB 354C03) inside the train, for improving the system. To collectthe position data, we placed a low-cost BU-353 GPS [34] antenna on the LRV 4306 and 4313. The GPSantenna in the LRV 4313 was installed within the interurban light enclosure for having a view of the sky.Tables 1 and 2 show important specifications of those sensors.

Data-acquisition and data-storage moduleFor data acquisition, we connected National Instruments USB-powered data acquisition hardware to acomputer. Our data acquisition system samples acceleration signals at 1.6KHz and GPS position at 1

12/29

Page 13: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Free

Far

e Zo

nePo

rt Au

thor

ity b

us ri

des

are

free

with

in th

e G

olde

n Tr

iang

leev

ery

day

until

7:0

0 p.

m. T

he li

mits

of t

his

zone

are

bou

nded

by R

oss

St.,

11th

St.

and

the

Alle

ghen

y an

d M

onon

gahe

lariv

ers.

Lig

ht ra

il rid

es o

n th

e T

are

free

with

in F

REE

FAR

EZO

NE

at a

ll tim

es. T

his

incl

udes

T ri

des

betw

een

Alle

ghen

y,N

orth

Sid

e, G

atew

ay, W

ood

Stre

et, S

teel

Pla

za a

nd

Firs

t Ave

nue

stat

ions

.Sp

ecia

l Far

esCh

ildre

n (6

-11)

pay

hal

f the

full

cash

fare

.Ch

ildre

n 5

and

unde

r (U

p to

four

) rid

e fre

e w

hen

acco

mpa

nied

by

a fa

re-p

ayin

g pa

ssen

ger.

Pers

ons

with

Dis

abili

ties

pay

half

the

full

cash

fare

whe

ndi

spla

ying

a M

edic

are

ID, s

tate

-issu

ed h

alf-f

are

card

or

Port

Auth

ority

redu

ced-

fare

Con

nect

Car

d.Se

nior

Citi

zens

Rid

e Fr

ee w

hen

disp

layi

ng a

Med

icar

e ID

or

sta

te-is

sued

sen

ior c

itize

n tra

nsit

pass

.Li

ght R

ail "

T" -

Peak

Dire

ctio

n Su

rcha

rge

on C

ash

Fare

sA

0.75

sur

char

ge is

app

lied

to T

cas

h fa

re in

boun

d fro

m

6:00

a.m

. to

9:00

a.m

. and

out

boun

d 4:

00 p

.m. t

o 6:

30 p

.m o

nw

eekd

ays.

Rid

ers

payi

ng w

ith p

asse

s, ti

cket

s an

d tra

nsfe

rsar

e no

t sub

ject

to th

e su

rcha

rge.

Base

Far

eT

Surc

harg

e1

Zone

2.

500.

752

Zone

3.

750.

75Tr

ansf

ers

A tra

nsfe

r may

be

purc

hase

d fo

r 1.0

0 ad

ditio

nal a

nd is

val

idfo

r thr

ee (3

) hou

rs in

any

dire

ctio

n fo

r a o

ne-z

one

ride .

Adul

ts1.

00C

hild

ren

6-11

and

peo

ple

with

dis

abilit

ies

0.50

Pass

es a

nd T

icke

ts1

Zone

2 Zo

neW

eekl

y Pa

ss25

.00

37.5

0M

onth

ly P

ass

97.5

014

6.25

Annu

al P

ass

1,07

2.50

1,60

8.75

Ten

Trip

Tic

kets

25.0

037

.50

Port

Auth

ority

Far

esCa

sh F

ares

Zone

1Zo

ne 1

AZo

ne 2

Full

Half

Full

Half

Full

Half

2.50

2.50

3.75

1.25

1.25

1.85

2.50

2.50

2.50

1.25

1.25

1.25

3.75

2.50

2.50

1.85

1.25

1.25

Zone1

Zone1A

Zone2

Two-

Car T

rain

sTr

ains

do

not h

ave

an o

pera

tor o

n th

e se

cond

car

and

the

seco

nd c

ar o

pens

its

door

s on

ly at

sel

ecte

d hi

gh-le

vel

plat

form

s. R

ider

s wi

shin

g to

boa

rd o

r exi

t at a

ny

stre

et-le

vel s

top

can

use

only

the

first

car

of t

he tr

ain.

On

ly e

nter

the

seco

nd c

ar if

you

are

exi

ting

at o

ne o

f th

e fo

llowi

ng s

tatio

ns:

Alle

ghen

yNo

rth S

ide

Gate

way C

ente

rW

ood

Stre

et

Stee

l Pla

zaFi

rst A

venu

eSt

atio

n Sq

uare

Sout

h Hi

lls Ju

nctio

nPo

tom

acDo

rmon

t Jun

ctio

nM

t. Le

bano

nCa

stle

Sha

nnon

In c

ase

of e

mer

genc

y, yo

u m

ay s

peak

to th

e op

erat

or

by to

uchi

ng th

e em

erge

ncy s

trip

or b

utto

n an

d sp

eaki

ng

into

the

spea

ker.

412.

442.

2000

Cast

leSh

anno

n

EFFE

CTIV

E 6/1

5/14

RED

LINE

Red

Line

FREE

FAR

E ZO

NE E

XTEN

DED

The

Down

town

Fre

e Fa

re Z

one

has

been

ext

ende

d to

incl

ude

the

North

Sid

e an

d Al

legh

eny s

tatio

ns. N

orth

Sid

e St

atio

nfre

e fa

re is

mad

e po

ssib

le th

roug

h fu

ndin

g pr

ovid

ed b

yth

e St

adiu

m A

utho

rity a

nd A

lco

Park

ing.

Alle

ghen

y Sta

tion

fund

ing

prov

ided

by t

he P

ittsb

urgh

Ste

elers

and

Rive

rs C

asin

o.

Via B

eech

view

Alleg

heny

Wes

t • N

orth

Shor

e • Do

wntow

nSt

ation

Squa

re •

Beec

hview

• Do

rmon

tMo

unt L

eban

on •

Castl

e Sha

nnon

Exten

sion t

o Beth

el Pa

rk

RED

LIN

E - C

AST

LE S

HA

NN

ON

VIA

BEE

CH

VIEW

SU

ND

AY

AN

D H

OL

IDA

Y S

ER

VIC

EN

OR

THB

OU

ND

To D

ownt

own

Pit

tsbu

rgh

- N

orth

Sho

reS

OU

THB

OU

ND

To B

eech

view

- C

astl

e S

hann

on

UpperSaintClairSouthHillsVillage(ParkandRide)BethelParkWashingtonJunction(ParkandRide)Castle ShannonCastle Shannon (ParkandRide)MountLebanonMt.LebanonStation(ParkandRide)DormontDormontJunction(ParkandRide)DormontPotomacStation(ParkandRide)BeechviewFallowfieldStation

SouthHillsJunctionInboundPlatform

StationSquareInboundPlatformDowntownWoodStStation InboundPlatformNorth ShoreAllegheny Station

North ShoreAllegheny StationDowntownWoodStStation OutboundPlatformStationSquareOutboundPlatform

SouthHillsJunctionOutboundPlatform

BeechviewFallowfieldStationDormontPotomacStation(ParkandRide)DormontDormontJunction(ParkandRide)MountLebanonMtLebanonStation(Park and Ride)CastleShannonCastleShannon(Park and Ride)BethelParkWashingtonJunction(ParkandRide)UpperSaintClairSouthHillsVillage(Park and Ride)

5:13

5:18

5:22

5:26

5:28

5:30

5:35

5:39

5:43

5:48

5:57

6:00

6:08

6:13

6:16

6:21

6:26

6:28

6:30

6:34

........

5:33

5:38

5:42

5:46

5:48

5:50

5:55

5:59

6:03

6:08

6:17

6:20

6:28

6:33

6:36

6:41

6:46

6:48

6:50

6:54

........

5:53

5:58

6:02

6:06

6:08

6:10

6:15

6:19

6:23

6:28

6:37

6:40

6:48

6:53

6:56

7:01

7:06

7:08

7:10

7:14

........

6:13

6:18

6:22

6:26

6:28

6:30

6:35

6:39

6:43

6:48

6:57

7:00

7:08

7:13

7:16

7:21

7:26

7:28

7:30

7:34

........

6:33

6:38

6:42

6:46

6:48

6:50

6:55

6:59

7:03

7:08

7:17

7:20

7:28

7:33

7:36

7:41

7:46

7:48

7:50

7:54

........

........

7:02

7:06

7:08

7:10

7:15

7:19

7:23

7:28

7:37

7:40

7:48

7:53

7:56

8:01

8:06

8:08

8:10

8:14

........

........

7:22

7:26

7:28

7:30

7:35

7:39

7:43

7:48

7:57

8:00

8:08

8:13

8:16

8:21

8:26

8:28

8:30

8:34

........

........

7:42

7:46

7:48

7:50

7:55

7:59

8:03

8:08

8:17

8:20

8:28

8:33

8:36

8:41

8:46

8:48

8:50

8:54

........

........

8:02

8:06

8:08

8:10

8:15

8:19

8:23

8:28

8:37

8:40

8:48

8:53

8:56

9:01

9:06

9:08

9:10

9:14

........

........

8:22

8:26

8:28

8:30

8:35

8:39

8:43

8:48

8:57

9:00

9:08

9:13

9:16

9:21

9:28

9:31

9:33

9:38

........

........

8:37

8:42

8:44

8:47

8:53

8:59

9:03

9:08

9:17

9:20

9:28

9:33

9:36

9:41

9:48

9:51

9:53

9:58

........

........

8:57

9:02

9:04

9:07

9:13

9:19

9:23

9:28

9:37

9:40

9:48

9:53

9:56

10:01

10:08

10:11

10:13

10:18

........

........

9:17

9:22

9:24

9:27

9:33

9:39

9:43

9:48

9:57

10:00

10:08

10:13

10:16

10:21

10:28

10:31

10:33

10:38

........

9:22

9:28

9:32

9:37

9:39

9:42

9:48

9:54

9:58

10:03

10:12

10:15

10:23

10:28

10:31

10:36

10:43

10:46

10:48

10:53

........

........

9:47

9:52

9:54

9:57

10:03

10:09

10:13

10:18

10:27

10:30

10:38

10:43

10:46

10:51

10:58

11:01

11:03

11:08

........

........

10:02

10:07

10:09

10:12

10:18

10:24

10:28

10:33

10:42

10:45

10:53

10:58

11:01

11:06

11:13

11:16

11:18

11:23

........

........

10:17

10:22

10:24

10:27

10:33

10:39

10:43

10:48

10:57

11:00

11:08

11:13

11:16

11:21

11:28

11:31

11:33

11:38

........

........

10:32

10:37

10:39

10:42

10:48

10:54

10:58

11:03

11:12

11:15

11:23

11:28

11:31

11:36

11:43

11:46

11:48

11:53

........

10:37

10:43

10:47

10:52

10:54

10:57

11:03

11:09

11:13

11:18

11:27

11:30

11:38

11:43

11:46

11:51

11:58

12:0

112

:03

12:0

8....

........

....11

:0211

:0711

:0911

:1211

:1811

:2411

:2811

:3311

:4211

:4511

:5311

:5812

:01

12:0

612

:13

12:1

612

:18

12:2

3....

........

....11

:1711

:2211

:2411

:2711

:3311

:3911

:4311

:4811

:5712

:00

12:0

812

:13

12:1

612

:21

12:2

812

:31

12:3

312

:38

........

........

11:32

11:37

11:39

11:42

11:48

11:54

11:58

12:0

312

:12

12:1

512

:23

12:2

812

:31

12:3

612

:43

12:4

612

:48

12:5

3....

........

....11

:4711

:5211

:5411

:5712

:03

12:0

912

:13

12:1

812

:27

12:3

012

:38

12:4

312

:46

12:5

112

:58

1:01

1:03

1:08

........

........

12:0

212

:07

12:0

912

:12

12:1

812

:24

12:2

812

:33

12:4

212

:45

12:5

312

:58

1:01

1:06

1:13

1:16

1:18

1:23

........

........

12:1

712

:22

12:2

412

:27

12:3

312

:39

12:4

312

:48

12:5

71:

001:

081:

131:

161:

211:

281:

311:

331:

38....

........

....12

:32

12:3

712

:39

12:4

212

:48

12:5

412

:58

1:03

1:12

1:15

1:23

1:28

1:31

1:36

1:43

1:46

1:48

1:53

........

........

12:4

712

:52

12:5

412

:57

1:03

1:09

1:13

1:18

1:27

1:30

1:38

1:43

1:46

1:51

1:58

2:01

2:03

2:08

........

........

1:02

1:07

1:09

1:12

1:18

1:24

1:28

1:33

1:42

1:45

1:53

1:58

2:01

2:06

2:13

2:16

2:18

2:23

........

........

1:17

1:22

1:24

1:27

1:33

1:39

1:43

1:48

1:57

2:00

2:08

2:13

2:16

2:21

2:28

2:31

2:33

2:38

........

........

1:32

1:37

1:39

1:42

1:48

1:54

1:58

2:03

2:12

2:15

2:23

2:28

2:31

2:36

2:43

2:46

2:48

2:53

........

........

1:47

1:52

1:54

1:57

2:03

2:09

2:13

2:18

2:27

2:30

2:38

2:43

2:46

2:51

2:58

3:01

3:03

3:08

........

........

2:02

2:07

2:09

2:12

2:18

2:24

2:28

2:33

2:42

2:45

2:53

2:58

3:01

3:06

3:13

3:16

3:18

3:23

........

........

2:17

2:22

2:24

2:27

2:33

2:39

2:43

2:48

2:57

3:00

3:08

3:13

3:16

3:21

3:28

3:31

3:33

3:38

........

........

2:32

2:37

2:39

2:42

2:48

2:54

2:58

3:03

3:12

3:15

3:23

3:28

3:31

3:36

3:43

3:46

3:48

3:53

........

........

2:47

2:52

2:54

2:57

3:03

3:09

3:13

3:18

3:27

3:30

3:38

3:43

3:46

3:51

3:58

4:01

4:03

4:08

........

........

3:02

3:07

3:09

3:12

3:18

3:24

3:28

3:33

3:42

3:45

3:53

3:58

4:01

4:06

4:13

4:16

4:18

4:23

........

........

3:17

3:22

3:24

3:27

3:33

3:39

3:43

3:48

3:57

4:00

4:08

4:13

4:16

4:21

4:28

4:31

4:33

4:38

........

........

3:32

3:37

3:39

3:42

3:48

3:54

3:58

4:03

4:12

4:15

4:23

4:28

4:31

4:36

4:43

4:46

4:48

4:53

........

........

3:47

3:52

3:54

3:57

4:03

4:09

4:13

4:18

4:27

4:30

4:38

4:43

4:46

4:51

4:58

5:01

5:03

5:08

........

........

4:02

4:07

4:09

4:12

4:18

4:24

4:28

4:33

4:42

4:45

4:53

4:58

5:01

5:06

5:13

5:16

5:18

5:23

........

........

4:17

4:22

4:24

4:27

4:33

4:39

4:43

4:48

4:57

5:00

5:08

5:13

5:16

5:21

5:26

5:28

5:30

5:34

5:38

5:43

........

4:37

4:41

4:43

4:45

4:50

4:54

4:58

5:03

5:12

5:15

5:23

5:28

5:31

5:36

5:41

5:43

5:45

5:49

........

........

4:52

4:56

4:58

5:00

5:05

5:09

5:13

5:18

5:27

5:30

5:38

5:43

5:46

5:51

5:56

5:58

6:00

6:04

........

........

5:07

5:11

5:13

5:15

5:20

5:24

5:28

5:33

5:42

5:45

5:53

5:58

6:01

6:06

6:11

6:13

6:15

6:19

........

........

5:22

5:26

5:28

5:30

5:35

5:39

5:43

5:48

5:57

6:00

6:08

6:13

6:16

6:21

6:26

6:28

6:30

6:34

........

........

5:37

5:41

5:43

5:45

5:50

5:54

5:58

6:03

6:12

6:15

6:23

6:28

6:31

6:36

6:41

6:43

6:45

6:49

........

........

5:52

5:56

5:58

6:00

6:05

6:09

6:13

6:18

6:27

6:30

6:38

6:43

6:46

6:51

6:56

6:58

7:00

7:04

........

........

6:07

6:11

6:13

6:15

6:20

6:24

6:28

6:33

6:42

6:45

6:53

6:58

7:01

7:06

7:11

7:13

7:15

7:19

........

........

6:22

6:26

6:28

6:30

6:35

6:39

6:43

6:48

6:57

7:00

7:08

7:13

7:16

7:21

7:26

7:28

7:30

7:34

........

........

6:37

6:41

6:43

6:45

6:50

6:54

6:58

7:03

7:12

7:15

7:23

7:28

7:31

7:36

7:41

7:43

7:45

7:49

........

........

6:52

6:56

6:58

7:00

7:05

7:09

7:13

7:18

7:27

7:30

7:38

7:43

7:46

7:51

7:56

7:58

8:00

8:04

........

........

7:07

7:11

7:13

7:15

7:20

7:24

7:28

7:33

7:42

7:45

7:53

7:58

8:01

8:06

8:11

8:13

8:15

8:19

........

........

7:22

7:26

7:28

7:30

7:35

7:39

7:43

7:48

7:57

8:00

8:08

8:13

8:16

8:21

8:26

8:28

8:30

8:34

........

........

7:37

7:41

7:43

7:45

7:50

7:54

7:58

8:03

8:12

8:15

8:23

8:28

8:31

8:36

8:41

8:43

8:45

8:49

........

........

7:52

7:56

7:58

8:00

8:05

8:09

8:13

8:18

8:27

8:30

8:38

8:43

8:46

8:51

8:56

8:58

9:00

9:04

........

........

8:07

8:11

8:13

8:15

8:20

8:24

8:28

8:33

8:42

8:45

8:53

8:58

9:01

9:06

9:11

9:13

9:15

9:19

........

........

8:22

8:26

8:28

8:30

8:35

8:39

8:43

8:48

8:57

9:00

9:08

9:13

9:16

9:21

9:26

9:28

9:30

9:34

........

........

8:37

8:41

8:43

8:45

8:50

8:54

8:58

9:03

9:12

9:15

9:23

9:28

9:31

9:36

9:41

9:43

9:45

9:49

........

........

8:52

8:56

8:58

9:00

9:05

9:09

9:13

9:18

9:27

9:30

9:38

9:43

9:46

9:51

9:56

9:58

10:0

010

:04

........

........

9:07

9:11

9:13

9:15

9:20

9:24

9:28

9:33

9:42

9:45

9:53

9:58

10:0

110

:06

10:1

110

:13

10:1

510

:19

10:2

310

:28

........

9:22

9:26

9:28

9:30

9:35

9:39

9:43

9:48

9:57

10:0

010

:08

10:1

310

:16

10:2

110

:26

10:2

810

:30

10:3

410

:38

10:4

3....

....9:

379:

419:

439:

459:

509:

549:

5810

:03

10:1

210

:15

10:2

310

:28

10:3

110

:36

10:4

110

:43

10:4

510

:49

10:5

310

:58

........

9:52

9:56

9:58

10:0

010

:05

10:0

910

:13

10:1

810

:27

10:3

010

:38

10:4

310

:46

10:5

110

:56

10:5

811

:00

11:0

411

:08

11:1

3....

....10

:07

10:1

110

:13

10:1

510

:20

10:2

410

:28

10:3

310

:42

10:4

510

:53

10:5

811

:01

11:0

611

:11

11:1

311

:15

11:1

911

:23

11:2

8....

....10

:22

10:2

610

:28

10:3

010

:35

10:3

910

:43

10:4

810

:57

11:0

011

:08

11:1

311

:16

11:2

111

:26

11:2

811

:30

11:3

411

:38

11:4

3

AN

NO

UN

CEM

ENTS

Jun

e 15

, 20

14 S

ervi

ce C

han

ges

:

Red

Lin

e: O

ne la

te n

ight

trip

leav

ing

Dow

ntow

n at

1:1

4 a.

m. h

as b

een

adde

d on

w

eekd

ay e

veni

ngs.

Mos

t Sat

urda

y an

d S

unda

y se

rvic

e w

ill

term

inat

e at

Ove

rbro

ok J

unct

ion,

one

m

inut

e be

yond

Cas

tle S

hann

on S

tatio

n.

Fre

quen

cy o

f tr

ips

has

been

incr

ease

d to

ev

ery

15 m

inut

es b

etw

een

the

hour

s of

10

a.m

. and

11

p.m

.T

he

SL

Su

bw

ay L

oca

l has

bee

n

elim

inat

ed.

The

Red

Lin

e S

unda

y se

rvic

e op

erat

es o

n N

ew Y

ears

Day

, Mem

oria

l Day

, Ind

epen

denc

e D

ay, L

abor

Day

, Tha

nksg

ivin

g an

d C

hris

tmas

.

All

Sat

urda

y, S

unda

y an

d ho

liday

ser

vice

is

exte

nded

from

Ove

rbro

ok J

unct

ion

to S

outh

H

ills

Vill

age.

Let P

ort A

utho

rity

take

you

to P

ittsb

urgh

In

tern

atio

nal A

irpor

t. 2

8X A

IRP

OR

T F

LYE

R

oper

ates

sev

en d

ays

a w

eek

from

ap

prox

imat

ely

4:30

a.m

. to

12 M

idni

ght a

nd

depa

rts

ever

y 30

min

utes

from

Oak

land

, D

ownt

own

and

mos

t Wes

t Bus

way

sta

tions

.

Cal

l 412

.442

.200

0 or

pic

k up

a 2

8X ti

met

able

fo

r co

mpl

ete

sche

dule

info

rmat

ion.

SER

VIC

E N

OTE

S

AIR

POR

T SE

RVI

CE

Figure 10. Pittsburgh Light rail (the ‘T’ Lines) system map.

Hz and logs the data to an external hard drive. We downloaded the data manually from the onboardcomputers to our local computer every two weeks, because we are not allowed to install wireless devices onLRVs. We use National Instruments LabVIEW [35] to control the hardware and acquire acceleration data.

13/29

Page 14: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Data-acquisitionModule

Data-storageModule

Data-processingModule

Data-sensingModule

Figure 11. Proposed data-management system.

Figure 12. Data collection system of LRV 4306. Top left picture shows the exteral view of the LRV.Bottom left figure shows a schematic of the sensor locations on the LRV, and the inside view of the trainand one highlighted uni-axle accelerometer are shown in the right hand side pictures.

Figure 13. Data collection system of LRV 4313. Top left picture shows the exteral view of the LRV.Bottom left figure shows a schematic of the sensor locations on the LRV, and the inside view of the trainand one highlighted uni-axle accelerometer are shown in the right hand side pictures.

Data-processing moduleBefore analyzing the collected data, it is necessary to segment it by geographical region. There are tworeasons: first, we want to have the LRVs travel along the same path in each particular region; second, theGPS signal will be lost in tunnels, and it is difficult to determine the location of excitations. The railnetwork is divided into eight regions to ensure continuous GPS trace in each region. Figure 14 shows theGPS trace of several passes and the associated track regions.

The next step in processing the GPS data is to register it to a ground truth of the track positionbecause the measurement error from collected GPS data could cause misalignment among different passes.PAAC provided us the foot by foot GPS position data, which allowed us to achieve this registration. We

14/29

Page 15: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Type Model No. ofaxles

Sensitivity Amplituderange

ResonantFrequency

PiezoelectricAccelerometers

VibraMetrics5102

1 500 mV/g(±5%)

±10 g 2.5 kHz

PCB 354C03 3 500 mV/g(±10%)

±50 g ≥12 kHz

Table 1. Operating specifications of accelerometers.

Model Number of channels Sensitivity Update rate AccuracyBU-353 48 -163 dBm 1 Hz < 2.5 m

Table 2. Operating specifications of GPS receivers.

first utilized the iterative closest point (ICP) [36] algorithm to eliminate the global mismatch of the GPSdata by minimizing the difference between every two GPS point clouds. Figure 15 show the GPS traces of40 different outbound runs in the 5th track region before and after registration using ICP, respectively.For the local mismatch, the one-nearest neighbor algorithm is applied to register GPS position data ofdifferent runs to the nearest point of the ground truth GPS.

We also present environmental conditions and maintenance schedules during this monitoring period inthe following sections.

4.2.3 Environmental conditionsEnvironmental conditions, including temperature, wind, weather, and precipitation, vary significantlywhen we collect the data. It is not only because all the sensors are sensitive to the operating temperature,but also steel rail expands as it heats up. To record the environmental conditions, we used the timestamp and the trains’ GPS position to query environmental conditions from a weather data provider calledForecast.io (https://forecast.io/) when we were processing the collected data. This weather data providergathered hour-by-hour environmental observations from tens of thousands of stations worldwide.

4.2.4 Maintenance schedulesBecause we were not allowed to conduct an experiment on the track, we had to wait for changes madeby PAAC’s track maintenance. PAAC provides us with track allocation reports, which are stored in theDR-Train dataset as well. Those reports allow us to calibrate our data collection system and validate ourtrack monitoring approaches.

4.2.5 Code availabilityThe code used to register GPS positions via ICP can be downloaded from MathWorks’ file exchange [37].The script and function used to load and process the data files can be downloaded from the online coderepository. The codes have been tested using MATLAB 2017 on a typical personal computer and can runusing different MATLAB versions and computers.

4.2.6 Known issues• Even after GPS registration using ICP and one-nearest neighbor algorithms, misalignments of GPS

positions and accelerations in the spatial domain still exist. This is because the GPS has variableaccuracy levels at different locations.

• We found that the train’s ventilation system is a source of noise. On a warm day, the air conditionerturns on, and there is higher energy at 30 Hz. Whereas on a cold day, the air conditioner turns off,

15/29

Page 16: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Figure 14. An example of the GPS trace of several passes through the ‘T’ lines and the associated trackregions used for analysis.

so we observed less energy at 30 Hz. The signal energy at 30 Hz does not depend on the train speedor the track roughness.

• Individual sensors or sensor channels can be malfunctioning. Our group has developed a data fusionapproach to address this problem [27].

4.3 Data RecordsFrom 2013 to 2016, we collected acceleration and GPS position data with corresponding environmentalconditions and maintenance logs. The DR-Train dataset includes 31 months of data from LRV 4306 and 11months of data from LRV 4313. Table 3 shows the number of passes collected from the eight geographicalregions mentioned in the Method section. ’Inbound’ means that the LRV goes from South to North, and’outbound’ means the opposite direction. In some regions, the number of outbound passes is larger thanthat of inbound passes because, as the train emerges from the tunnel, there is a delay before the GPS can

16/29

Page 17: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

-80.02 -80.01 -80 -79.99 -79.98

Longitude

40.37

40.38

40.39

40.4

Latit

ude

-3000 -2000 -1000 0 1000East (m)

3000

4000

5000

6000

7000

Nor

th (

m)

Figure 15. The GPS traces of 40 different outbound passes in the 5th track region before and afterregistation using ICP.

get a position lock.All the data details and directories are stored in ‘pass’ MATLAB objects in ‘\data files\LRV4306’

and ‘\data files\LRV4313’ folders. The ‘pass’ object is defined in the MATLAB class function ‘pass.m’.Properties and Methods of the ‘pass’ class are described in Table 4. For LRV 4306, there are 30,096 ‘pass’objects stored in the file ‘\data files\LRV4306\obj dict.mat’. LRV 4306 has five acceleration channels(property ‘sensor’), corresponding to the two uni-axial accelerometers inside the train cabin and the threechannels of the tri-axial accelerometer on the wheel truck. For LRV 4313, there are 18,929 ‘pass’ objectsstored in the file ‘\data files\LRV4313\obj dict.mat’. LRV 4313 has 8 acceleration channels (property‘sensor’), corresponding to the two uni-axial accelerometers and two tri-axial accelerometers inside thetrain cabin. The raw acceleration and GPS position data are stored in ‘acceleration data’ and ‘gps data’folders. These data can be retrieved from files into the MATLAB/Octave workspace by loading accelerationfilenames (property ‘acc raw’) and GPS filenames (property ‘gps raw’). Each raw acceleration file only hasa one-column MATLAB matrix in double-precision type, which is the logged temporal acceleration dataof one single channel during one service run. Each raw GPS file has a five-column MATLAB matrix indouble-precision type, corresponding to the longitude, latitude, altitude, velocity and time stamp duringone service run. The time stamp is a serial date number that represents the whole and fractional numberof days from a fixed, preset date ,January 0, 0000, in the proleptic ISO calendar.

Folder ‘\data files\track maintenance logs’ stores weekly maintenance schedule sheets from Light RailSystem, PAAC. Those files provide information on what was happening on the rail network. Typically, theonly work, which matters for this research, is done by the ‘Way Department’ that maintains the tracks.

This dataset is available from http://dx.doi.org/10.5281/zenodo.1432702.

4.4 Technical ValidationWe validated the technical quality of the presented dataset from three perspectives. First, we consider aseries of possible failure situations for the process of data collection and justify why we can rule them out;second, we consider a series of basic requirements that a high-quality dataset should satisfy and validatethat the presented dataset satisfies all the requirements; third, a series of experiments have shown thatchanges to the tracks can be successfully detected based on the presented dataset.

4.4.1 Data Collection FailuresWe consider the possible failure situations (missing data or corrupted data) as follows.

17/29

Page 18: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

LRV 4306 LRV 4313Region Inbound Outbound Inbound Outbound Total

1 226 363 51 82 7222 569 577 138 136 14203 579 567 135 131 14124 288 292 33 34 6475 317 317 102 96 8326 440 425 116 110 10917 356 342 85 80 8638 180 182 65 63 490

Total 2955 3065 725 732 7477

Table 3. Number of passes collected from LRVs 4306 and 4313 through the eight geographical regions.

• The installed accelerometers fail to sense acceleration signals;

• The data-acquisition system fails to transfer acceleration signals to the installed computer;

• The installed computer fails to store the acceleration signals on the local disk;

• The data-processing system fails to organize acceleration signals correspondingly in the database.

Data visualization is an efficient way to rule out the first and the second failure situations. We addedchart blocks in our LabVIEW implementation to visualize the collected acceleration signals instantaneously.We also plotted the collected data on our local computer. Figure 16 shows 27 properly collected spatial-domain acceleration samples of accelerometer channel 5 in geographical region 5 (outbound direction)during May 2014. By checking the size of logged data directly, we can rule out the third failure situation.Also, for this dataset, we rule out the top three failures by checking whether signals are sampled continuouslywith a constant sampling rate in the time domain. To rule out the fourth failure situation we have todownload the data back to our local computer and validate the technical quality of them after dataprocessing. The following section introduces two basic requirements of a high-quality dataset and theirvalidations.

4.4.2 Basic requirementsWe consider the basic requirements that a high-quality dataset should satisfy.

• R1: The acceleration signals collected from the same accelerometer channel should be consistentacross trials during a period, such as one day, one week or one month. Since the acceleration signalof each trial reflects the roughness of the same track, the overall profiles of acceleration signals shouldbe similar across trials. Otherwise, this dataset is problematic;

• R2: The acceleration signals should be correlated with the GPS positions. As a discrete-space signal,the acceleration signal of each trial is associated with the position of the track. For example, theamplitude of the acceleration signal around the train station should be low. Otherwise, this datasetis problematic;

Figure 17 and 18 show time periods (days) when sensors on LRVs 4306 and 4313 were recording. Bluelines indicate recording days, and white gaps indicate recording gaps in the dataset. The recording gapsare caused by one of three reasons: 1) Sometimes, the data-storage module runs out of space beforedownloading the data, and we are not allowed to install wireless sensors on the LRV in order to avoidinterference between signals from our system and those of train control & communication systems; 2) The

18/29

Page 19: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Figure 16. Visulizing acceleration samples in the spatial domain. We visualize the 27 accelerationsamples of accelerometer channel five in region five during May 2014.

Figure 17. Time periods when sensors on LRV 4306 were recording. Blue lines indicate days whenaccelerometers and GPS were recording. The causes of recording gaps are explained in section 4.4.2.

data management system may not restart automatically after restarting the light rail vehicles, althoughwe have programmed it to do that; 3) The light rail vehicles were not in service because of maintenance,

19/29

Page 20: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Figure 18. Time periods when sensors on LRV 4313 were recording. Blue lines indicate days whenaccelerometers and GPS were recording. The causes of recording gaps are explained in section 4.4.2.

inspection and repair.

Validation of R1To prove the consistency of acceleration signals from the same accelerometer channel, we use the two-sampleKolmogorov–Smirnov test (K-S test) [38]. The two-sample K-S test is a general nonparametric methodfor comparing two samples. It quantifies a distance between the empirical distribution functions of twosamples. The Kolmogorov–Smirnov statistic is

Dn,m = supx|F1,n(x)−F2,m(x)|,

where F1,n and F2,m are the empirical distribution functions of the two samples, respectively, and supis the supremum function. The null hypothesis that the samples are drawn from the same distribution isrejected at level α if

Dn,m > c(α)√n+m

nm,

where n and m are the sizes of the two samples respectively, and for 5% and 1% rejection levels, c(α) isequal to 1.36 and 1.63, respectively.

The null hypothesis of our test is that the acceleration signals from the same sensor channel in thesame time period and the same geographical region are drawn from the same distribution. The boxplot(Figure 19) shows a result of this test for LRV 4306. We randomly sample 1,000 acceleration amplitudesfrom each trail of the same sensor channel of LRV 4306 during May 2014 in region five. LRV 4306 ran27 outbound trails during May 2014 in region five. We then calculate the K-S statistics of each pair ofacceleration samples. For the tri-axle accelerometer installed on the central wheel truck (less electricalnoise), the tests of the three channels are not rejected, both at 5% significance level and at 1% significancelevel. However, the two uni-axle accelerometers installed in the cabin are rejected at 5% significance level,but not rejected at 1% significance level.

Validation of R2Because the trains were moving at different speeds, it is difficult to prove that every single accelerationand GPS position are correlated in the time domain. However, we can prove the geographical alignment ofacceleration signals and GPS positions with the information of train stations. When arriving at a trainstation, the LRV stays stationary and idles. We assume that amplitude of the idling accelerations wouldbe lower than the traveling accelerations due to the excitation of the tracks. Figure 20 shows two examplesof acceleration and position data in Region 5. There are five stations in Region 5: Memorial Station (MS),Killarney Station (KS), South Bank Station (SS), Denise Station (DS), and Bon Air Station (BS). Wecan observe that the amplitude of the acceleration signal is small when the LRV was at stations, and themileage stays the same at stations.

20/29

Page 21: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

1 2 3 4 5

Sensor channel

0

0.05

0.1

5% Rejection Threshold: 0.0611% Rejection Threshold: 0.073

Figure 19. A result of the two-sample K-S test. For LRV 4306, there are 27 outbound trails during May2014 in region five. We randomly sample 1,000 acceleration amplitudes from each trail and calculate theK-S statistics of each pair of samples. If the statistic is above the red line, the null hypothesis that twosamples are drawn from the same distribution is rejected at level 5%. If the statistic is above the blackline, the null hypothesis is rejected at level 1%.

Figure 20. Two examples of acceleration and position data in region five. When the LRV was at those fivestations: Memorial Station (MS), Killarney Station (KS), South Bank Station (SS), Denise Station (DS),and Bon Air Station (BS), the amplitude of the acceleration signal is small, and mileage stays the same.

4.4.3 Publications based on the presented datasetPapers [25, 26, 27, 39] using the data described here have been published. Lederman et al. used thecollected dataset to detect changes in the tracks, including changes in the tracks due to repair and changesin track geometry due to tamping, by applying implicit and explicit models. The implicit model firstextracts different features from the raw acceleration signals and then performs change detection withsome common methods, including cumulative sum chart control (CUSUM) [40], generalized likelihoodratio (GLR) [40] and Haar filter [41]. The explicit model solves for the parameters of the train’s mainsuspension by enforcing sparsity in modeling the train system and learns where in the tracks the train ismost excited by enforcing sparsity in the track profile. Lederman et al. [27] also proposed a data fusionapproach for enabling data-driven rail-infrastructure monitoring from multiple in-service trains using theimplicit model of the tracks.

21/29

Page 22: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

4.5 Usage NotesThe data is provided in a MAT-file format with query files, and therefore it is convenient to load it inMATLAB. The script file (\code\main script.m) calls the function (\code\load processing.m) for loadingand processing data and returns ‘pass’ objects, acceleration signals, and GPS positions.

22/29

Page 23: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

5 Variational autoencoder for detecting anomalies in longitude elevation of track geometryusing the dynamic response of an in-service train

Track geometry is the three-dimensional geometry of track layouts such as transverse elevations andlongitudinal elevations, whose correct positions are of critical importance to ensure the safety of trainoperations. In this section, we propose a learning-based anomaly detection approach for monitoringlongitude elevation of track geometry from the dynamic response of an in-service train. Also, we havevalidated this approach on the DR-Train dataset.

In general, the methods for estimating track geometry using acceleration data can be divided intotwo groups: model-driven methods [23, 42] and data-driven methods. In model-driven methods, there isusually a train-track interaction system that models the inverse relationship between vehicle accelerationand track geometry using prior knowledge about the system. However, the dynamics model depends onthe characteristics of the train system, such as mass and the suspension system properties, which makemodel-driven methods susceptible to system characteristic uncertainties. For data-driven methods, FourierTransform [22] and Wavelet [43] approaches are commonly employed to extract features from accelerationlog and identify track geometry changes. However, data-driven methods, which rely much on the data, lackrobustness to noise and require more physical insight for model selection and performance improvement.

It is worth noticing that both approaches mentioned above heavily rely on hand-crafted features.However, due to the noise in the collected accelerations, detecting anomalies using hand-crafted featuresoften results in more false alarms. Many researchers have turned to learning-based methods to extractfeatures from time-series signals automatically. Ordonez et al. [44] used a many-to-one Long-Short-Term-Memory (LSTM) for wearable activity recognition. DiPietro et al. [45] proposed an LSTM-ConditionalRandom Field (CRF) model to extract features from raw acceleration signals for surgical activity labeling.Though learning-based methods obtained decent results in the activity recognition field, they might notbe suitable for the geometry inspection problem for three reasons: (1) The acceleration signal is a directmeasurement of the activity recognition application but an indirect measurement in the track inspectiontask. Therefore, acceleration signals recorded from an in-service train will contain more noise and willnot directly reflect the track geometry. (2) activity recognition is a well-defined classification problem,but track geometry inspection is an anomaly detection problem, which is likely to have an imbalancedistribution over normal samples and abnormal samples. (3) Directly using raw acceleration signals asinput results in a high-dimension feature space, which can easily cause overfitting.

Considering the limitations of the LSTM-based methods, we turned to a variational autoencoder(VAE)-based anomaly detection method based on the insight of the underlying physical motion model.Through the solution of a single-mode differential equation of the motion model, we found that thespectrum of an acceleration signal is closely related to the spectrum of its corresponding track geometry.We considered the track geometry with a change as an anomaly, measured by the energy of the geometryslopes. The proposed method uses a VAE to detect anomalies. The VAE takes accelerations as inputand learns a mapping from the frequency-domain of accelerations to a low-dimensional latent spacethat represents the distribution of the observed data. After the model is calibrated, the reconstructionprobability of the input data is used as an anomaly score for indicating how well the input follows thenormal pattern.

We validate the proposed VAE-based approach on the vibration data (DR-Train) collected froman in-service train that runs on a 42.2-km light rail network in the city of Pittsburgh, Pennsylvania.Figure 21 to 24 shows the results of our proposed method (VAE) with different anomaly thresholds for theenergy of geometry slopes. It successfully addresses the overfitting problem presented in the LSTM-basedmethods and outperforms two supervised baseline models, a logistic regression model (Baseline) and afully-connected neural network (NN-Feature), in terms of recall, precision, F1-score and area under thereceiver operating characteristic (ROC) curve. The results outperform the other two methods and make theproposed indirect structural health monitoring approach a strong candidate for low-cost and frequent track

23/29

Page 24: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

geometry inspection. Nonetheless, the authors identified three primary sources of error of the proposedapproach: (1) GPS noise and drifting. Due to the noise from the GPS unit, acceleration signals fromdifferent passes are not well aligned with each other spatially; (2) Accelerometer noise. Accelerometersusually experience a high noise level that requires data denoising for acceleration data; (3) Asynchronousdata. Data from different passes with different train velocities are collected asynchronously. In the future,we will concentrate on studying uncertainties of the model and improving its performance by addingphysical constraints and prior knowledge.

Figure 21. Recall, precision, F1-Score,and AUC with a 50 threshold of theenergy of 4-second slopes.

Figure 22. Recall, precision, F1-Score,and AUC with a 55 threshold of theenergy of 4-second slopes.

Figure 23. Recall, precision, F1-Score,and AUC with a 60 threshold of theenergy of 4-second slopes.

Figure 24. Recall, precision, F1-Score,and AUC with a 65 threshold of theenergy of 4-second slopes.

24/29

Page 25: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

6 Project ContributionsWe have made contributions to indirect structural health monitoring, which is a low-cost and low-maintenance technology to monitor the rail and bridge infrastructure continuously.

Over the one-year project, we have developed and validated algorithms which are able to estimate,localize and quantify gradual changes in the infrastructure. We have investigated this problem in atheoretical way and conducted lab-scale experiments to prove the applicability of the indirect structuralhealth monitoring. Signal processing and dimensionality reduction methods are used to achieve damageseverity comparison, estimation, and localization with high accuracy. We hope we can continue to conducton-site experiments and further studies for validating and consolidating the robustness of this damagediagnosis framework in the real world.

Also, we have contributed to released an open-access dataset recording dynamic responses from twoin-service light rail vehicles that run on a 42.2-km light rail network in the city of Pittsburgh, Pennsylvania(USA). Our dataset also contains GPS positions and environmental data (including temperature, wind,weather, and precipitation) as the vehicles were running on the track and the track maintenance logs fromthe light-rail operator. We hope to publicize this dataset widely, which could inspire more research in thisfield, and allow us to understand the collected data better.

We have proposed a novel learning-based anomaly detection approach for monitoring longitude elevationof track geometry from the dynamic response of an in-service train. This approach overcomes limitationsof model-based methods and methods using hand-crafted features, which are susceptible to characteristicuncertainties of the train-track interaction system and have many false alarms. It also outperformsconventional data-driven methods that use high dimensional time-domain accelerations as input and sufferoverfitting for the track geometry anomaly detection problem. Our results as presented in section 5 makethe proposed indirect structural health monitoring approach a strong candidate for low-cost and frequenttrack geometry inspection.

25/29

Page 26: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

References1. FHWA. Highlights from fhwa’s 2017 national bridge inventory data (2017).

2. ASCE. 2014 infrastructure report card: Pennsylvania infrastructure grades (2014).

3. Yang, Y. & Lin, C. Vehicle–bridge interaction dynamics and potential applications. J. sound vibration284, 205–226 (2005).

4. Malekjafarian, A. & OBrien, E. J. Identification of bridge mode shapes using short time frequencydomain decomposition of the responses measured in a passing vehicle. Eng. Struct. 81, 386–397(2014).

5. Kim, C.-W. et al. Experimental investigation of drive-by bridge inspection. In 5th InternationalConference on Structural Health Monitoring of Intelligent Infrastructure (SHMII-5), Cancun, Mexico,11-15 December, 2011 (Instituto de Ingenierıa, UNAM, 2011).

6. Lin, C. & Yang, Y. Use of a passing vehicle to scan the fundamental bridge frequencies: Anexperimental verification. Eng. Struct. 27, 1865–1878 (2005).

7. Siringoringo, D. M. & Fujino, Y. Estimating bridge fundamental frequency from vibration responseof instrumented passing vehicle: analytical and experimental study. Adv. Struct. Eng. 15, 417–433(2012).

8. Oshima, Y., Yamamoto, K. & Sugiura, K. Damage assessment of a bridge based on mode shapesestimated by responses of passing vehicles. Smart Struct. Syst. 13, 731–753 (2014).

9. Malekjafarian, A. & OBrien, E. J. Identification of bridge mode shapes using short time frequencydomain decomposition of the responses measured in a passing vehicle. Eng. Struct. 81, 386–397(2014).

10. Yang, Y., Li, Y. & Chang, K. Constructing the mode shapes of a bridge from a passing vehicle: atheoretical study. Smart Struct. Syst. 13, 797–819 (2014).

11. Nguyen, K. V. & Tran, H. T. Multi-cracks detection of a beam-like structure based on the on-vehiclevibration signal and wavelet analysis. J. Sound Vib. 329, 4455–4465 (2010).

12. McGetrick, P. & Kim, C. A wavelet based drive-by bridge inspection system. In Proceedings of the7th International Conference on Bridge Maintenance Safety and Management (IABMAS’14) (2014).

13. Lederman, G. et al. Damage quantification and localization algorithms for indirect shm of bridges. InProc. Int. Conf. Bridge Maint., Safety Manag., Shanghai, China (2014).

14. Chen, S. et al. Semi-supervised multiresolution classification using adaptive graph filtering withapplication to indirect bridge structural health monitoring. IEEE Transactions on Signal Process. 62,2879–2893 (2014).

15. Cerda, F. et al. Indirect structural health monitoring of a simplified laboratory-scale bridge model.Smart Struct. Syst. 13, 849–868 (2014).

16. Caruana, R. Multitask learning. Mach. learning 28, 41–75 (1997).

17. Association of american railroads. total annual spending: 2015 data (2015).

18. Federal railroad administration office of safety analysis. train accidents and rates (2017).

26/29

Page 27: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

19. Caprioli, A., Cigada, A. & Raveglia, D. Rail inspection in track maintenance: A benchmark betweenthe wavelet approach and the more conventional fourier analysis. Mech. Syst. Signal Process. 21,631–652 (2007).

20. Westeon, P. et al. Monitoring vertical track irregularity from in-service railway vehicles. Proc.institution mechanical engineers, Part F: J. Rail Rapid Transit 221, 75–88 (2007).

21. Oshima, Y., Yamamoto, K., Sugiura, K., Tanaka, A. & Hori, M. Simultaneous monitoring of thecoupled vibration between a bridge and moving trains. In Proc. 5th IABMAS Conference, Philadelphia,USA, 186 (2010).

22. Molodova, M., Li, Z. & Dollevoet, R. Axle box acceleration: Measurement and simulation for detectionof short track defects. Wear 271, 349–356 (2011).

23. Real, J., Salvador, P., Montalban, L. & Bueno, M. Determination of rail vertical profile throughinertial methods. Proc. Inst. Mech. Eng. Part F: J. Rail Rapid Transit 225, 14–23 (2011).

24. Lee, J. S., Choi, S., Kim, S.-S., Park, C. & Kim, Y. G. A mixed filtering approach for track conditionmonitoring using accelerometers on the axle box and bogie. IEEE Transactions on InstrumentationMeas. 61, 749–758 (2012).

25. Lederman, G. et al. Track-monitoring from the dynamic response of an operational train. Mech. Syst.Signal Process. 87, 1–16 (2017).

26. Lederman, G. et al. Track monitoring from the dynamic response of a passing train: a sparse approach.Mech. Syst. Signal Process. 90, 141–153 (2017).

27. Lederman, G. et al. A data fusion approach for track monitoring from multiple in-service trains.Mech. Syst. Signal Process. 95, 363–379 (2017).

28. Wallace, J., Stewart, J. & Taciroglu, E. Field vibration testing and analytical studies of a four-storyrc building (nees-2011-1074) (2017).

29. Micucci, D., Mobilio, M. & Napoletano, P. Unimib shar: A dataset for human activity recognitionusing acceleration data from smartphones. Appl. Sci. 7, 1101 (2017).

30. Medrano, C., Igual, R., Plaza, I. & Castro, M. Detecting falls as novelties in acceleration patternsacquired with smartphones. PloS one 9, e94811 (2014).

31. Zhang, Y. et al. Accelerometer-based gait recognition by sparse representation of signature pointswith clusters. IEEE transactions on cybernetics 45, 1864–1875 (2015).

32. Mistras group, inc. product data sheet: Vibrametrics models 5102 (2013).

33. Pcb piezotronics mts systems corporation. model: 354c03.

34. Usglobalsat inc. usb gps receiver: Bu-353.

35. National instruments. labview (2012).

36. Bergstrom, P. & Edlund, O. Robust registration of point sets using iteratively reweighted least squares.Comput. Optim. Appl. 58, 543–561 (2014).

37. Bergstrom, P. Matlab function for iterative closest point method.

27/29

Page 28: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

38. Corder, G. W. & Foreman, D. I. Nonparametric statistics: A step-by-step approach (John Wiley &Sons, 2014).

39. Lederman, G., Noh, H. Y. & BIELAK, J. Rail-infrastructure monitoring through the dynamic responseof a passing train. Struct. Heal. Monit. 2015 (2015).

40. Gustafsson, F. & Gustafsson, F. Adaptive filtering and change detection, vol. 1 (Citeseer, 2000).

41. Vetterli, M., Kovacevic, J. & Goyal, V. K. Foundations of signal processing (Cambridge UniversityPress, 2014).

42. O’Brien, E. J., Bowe, C. & Quirke, P. Determination of vertical alignment of track using accelerometerreadings. In IMechE Stephenson Conference for Railways: Research for Railways, 21-23 April, 2015(2015).

43. Bocciolone, M., Caprioli, A., Cigada, A. & Collina, A. A measurement system for quick rail inspectionand effective track maintenance strategy. Mech. Syst. Signal Process. 21, 1242–1254 (2007).

44. Ordonez, F. J. & Roggen, D. Deep convolutional and lstm recurrent neural networks for multimodalwearable activity recognition. Sensors 16, DOI: 10.3390/s16010115 (2016).

45. DiPietro, R. et al. Recognizing surgical activities with recurrent neural networks. In InternationalConference on Medical Image Computing and Computer-Assisted Intervention, 551–558 (Springer,2016).

46. Esveld, C. Modern railway track: Digital edition (2015).

47. Kingma, D. P. & Welling, M. Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114(2013).

28/29

Page 29: FINAL RESEARCH REPORT - Carnegie Mellon University · Conference Presentation. Liu J., ”A Damage Localization and Quantification Algorithm for Indirect Structural Health Monitoring

Name Description

Properties

acc al address where aligned accelerometer data is storedacc raw address where accelerometer data is stored

acc samp accelerometer sampling rate (Hz)count this is the indice of the pass in terms of all passesdate date in UTC timedatel date string in local timedaten property date in ’datenum’ formatdatenl property datal in ’datenum’ format

direction ’inbound’ our ’outbound’gps al address where aligned GPS is stored

gps raw address where GPS is storedgps samp gps sampling rate (Hz)

region region on the map where the signal comes fromsensor number of sensor channel

summary weather summary text at time of passsummary8 weather summary text 8 hours prior to pass

temp temperature at the time of the passtemp8 temperature 8 hours prior to passtrain number of the train

Methods

addlistener Add listener for event.addprop Add dynamic property to MATLAB object.

coor show the coordinate system for the selected datadate bounds This function take an array of passes, as well as lower and

upper bounds for the dates (as strings), then plots the datathat falls between the two specified date strings. Note thisoutputs the dates selected.

delete Delete a handle object.eq == (EQ) Test handle equality.

findobj Find objects matching specified conditions.findprop Find property of MATLAB handle object.

ge >= (GE) Greater than or equal relation for handles.gt > (GT) Greater than relation for handles.

isvalid Test handle validity.le <= (LE) Less than or equal relation for handles.

listener Add listener for event without binding the listener to thesource object.

lt < (LT) Less than relation for handles.ne = (NE) Not equal relation for handles.

notify Notify listeners of event.plot both Plot the data in time and frequency domainplot freq Plot the data in the frequency domainplot gps Plot the GPS trace on a mapplot time Plot the data in the time domainscatter Plot energy of the signal at the GPS points

Table 4. Properties and methods of the ’pass’ class.

29/29