thesis - wiredspace homewiredspace.wits.ac.za/jspui/bitstream/10539/21639/3/plagiarism... ·...

thesis.pdfby Warwick Masson

FILE

TIME SUBMITTED 18-MAR-2016 12:42PM

SUBMISSION ID 647041006

WORD COUNT 11311

CHARACTER COUNT 54865

WARWICK_MASSON_388188_THESIS.PDF (2.41M)

9%SIMILARITY INDEX

6%INTERNET SOURCES

7%PUBLICATIONS

2%STUDENT PAPERS

1 <1%

2 <1%

3 <1%

4 <1%

5 <1%

6 <1%

7 <1%

thesis.pdfORIGINALITY REPORT

PRIMARY SOURCES

Submitted to University of WitwatersrandStudent Paper

www.ausy.tu-darmstadt.deInternet Source

S. Schaal. "Reinforcement Learning forParameterized Motor Primitives", The 2006IEEE International Joint Conference onNeural Network Proceedings, 2006Publicat ion

J. Zico Kolter. "Regularization and featureselection in least-squares temporaldifference learning", Proceedings of the 26thAnnual International Conference on MachineLearning - ICML 09 ICML 09, 2009Publicat ion

Lecture Notes in Computer Science, 2008.Publicat ion

leenissen.dkInternet Source

Bartana, A.. "Laser cooling of molecules bydynamically trapped states", Chemical

8 <1%

9 <1%

10 <1%

11 <1%

12 <1%

13 <1%

14 <1%

15 <1%

Physics, 20010601Publicat ion

www.eea-esem.comInternet Source

web.it.kth.seInternet Source

www.ausy.informatik.tu-darmstadt.deInternet Source

Jens Kober. "Policy search for motorprimitives in robotics", Machine Learning,11/06/2010Publicat ion

Bei Li, , Siddharth Gangadhar, SamuelCheng, and Pramode K. Verma. "Maximizeuser rewards in distributed generationenvironments using reinforcement learning",IEEE 2011 EnergyTech, 2011.Publicat ion

www.cs.cmu.eduInternet Source


A.M. Tehrani. "FUZZY REINFORCEMENTLEARNING FOR EMBEDDED SOCCERAGENTS IN A MULTI-AGENT CONTEXT",International Journal of Robotics andAutomation, 2006Publicat ion

16 <1%

17 <1%

18 <1%

19 <1%

20 <1%

21 <1%

22 <1%

23 <1%

24 <1%

25 <1%

26

Natural Language Dialog Systems andIntelligent Assistants, 2015.Publicat ion


www.ijcai.orgInternet Source

citeseerx.ist.psu.eduInternet Source

Adaptation Learning and Optimization, 2012.Publicat ion

www.lri.frInternet Source

ijcai.orgInternet Source

www.bio-nica.infoInternet Source

Submitted to University of EdinburghStudent Paper

R.E. Skelton. "A convexifying algorithm forthe design of structured linear controllers",Proceedings of the 39th IEEE Conference onDecision and Control (Cat No 00CH37187)CDC-00, 2000Publicat ion

Submitted to University of Florida

<1%

27 <1%

28 <1%

29 <1%

30 <1%

31 <1%

32 <1%

33 <1%

34 <1%

Student Paper

www.cs.duke.eduInternet Source


www.db-thueringen.deInternet Source


db.s2.chalmers.seInternet Source

H. Tsujino. "Connectionist ReinforcementLearning with Cursory Intrinsic Motivationsand Linear Dependencies to MultipleRepresentations", The 2006 IEEEInternational Joint Conference on NeuralNetwork Proceedings, 2006Publicat ion

Roberto Battiti. "Learning While Optimizingan Unknown Fitness Surface", Lecture Notesin Computer Science, 2008Publicat ion

Cheng, Yuhu, Huanting Feng, and XuesongWang. "Eff icient data use in incrementalactor–critic algorithms", Neurocomputing,2013.

35 <1%

36 <1%

37 <1%

38 <1%

39 <1%

40 <1%

41 <1%

42 <1%

Publicat ion

www.dtic.upf.eduInternet Source

rldm.orgInternet Source

roboticsproceedings.orgInternet Source

users.soe.ucsc.eduInternet Source

dsl.serc.iisc.ernet.inInternet Source

Yamaguchi, Akihiko, Jun Takamatsu, andTsukasa Ogasawara. "DCOB: Action spacefor reinforcement learning of high DoFrobots", Autonomous Robots, 2013.Publicat ion

Nguyen, Sao Mai, and Pierre-Yves Oudeyer."Socially guided intrinsic motivation for robotlearning of motor skills", AutonomousRobots, 2014.Publicat ion

Levine, Sergey, Nolan Wagener, and PieterAbbeel. "Learning contact-rich manipulationskills with guided policy search", 2015 IEEEInternational Conference on Robotics andAutomation (ICRA), 2015.Publicat ion

43 <1%

44 <1%

45 <1%

46 <1%

47 <1%

48 <1%

49 <1%

50 <1%

51 <1%

www.irp.oist.jpInternet Source

people.cs.ubc.caInternet Source

Apelian, . "The Derivative", Pure and AppliedMathematics, 2009.Publicat ion

Magdi S. Mahmoud. "MathematicalFoundations", Switched Time-Delay Systems,2010Publicat ion


busoniu.netInternet Source

hto-b.usc.eduInternet Source

Lassaigne, Richard, and Sylvain Peyronnet."Approximate planning and verif ication forlarge Markov decision processes",International Journal on Software Tools forTechnology Transfer, 2015.Publicat ion

Reinhart, René Felix, and Jochen Jakob Steil."Eff icient policy search in low-dimensionalembedding spaces by generalizing motionprimitives with a parameterized skill

52 <1%

53 <1%

54 <1%

55 <1%

56 <1%

57 <1%

58 <1%

59 <1%

memory", Autonomous Robots, 2015.Publicat ion

www.ri.cmu.eduInternet Source

da Motta Salles Barreto, A.. "Restrictedgradient-descent algorithm for value-functionapproximation in reinforcement learning",Artif icial Intelligence, 200803Publicat ion

refoteka.ruInternet Source

Dijst, Martin. "ICTs and Accessibility: AnAction Space Perspective on the Impact ofNew Information and CommunicationTechnologies", Advances in Spatial Science,2004.Publicat ion

Wawrzyński, Paweł, and Ajay KumarTanwani. "Autonomous reinforcementlearning with experience replay", NeuralNetworks, 2012.Publicat ion

allserv.kahosl.beInternet Source

www.elec.qmul.ac.ukInternet Source

www.davidandre.comInternet Source

60 <1%

61 <1%

62 <1%

63 <1%

64 <1%

65 <1%

66 <1%

67 <1%

68 <1%

148.204.65.169Internet Source

www.cs.mcgill.caInternet Source

apps.cs.utexas.eduInternet Source

Undergraduate Texts in Mathematics, 1996.Publicat ion

Xiaoming Chen. "Stability analysis andcontrol design for 2-D fuzzy systems viabasis-dependent Lyapunov functions",Multidimensional Systems and SignalProcessing, 11/17/2011Publicat ion

"Asymptotic Optimality", Springer Texts inStatistics, 1998Publicat ion


Shun-ichi Amari. "Natural Gradient WorksEfficiently in Learning", Neural Computation,02/1998Publicat ion

Rosa, Paulo, Gary J. Balas, Carlos Silvestre,and Michael Athans. "A Synthesis Method ofLTI MIMO Robust Controllers for UncertainLPV Plants", IEEE Transactions on Automatic

69 <1%

EXCLUDE QUOTES ON

EXCLUDEBIBLIOGRAPHY

ON

EXCLUDE MATCHES OFF

Control, 2014.Publicat ion

Wookey, Dean S., and George D. Konidaris."Regularized feature selection inreinforcement learning", Machine Learning,2015.Publicat ion

thesis - wiredspace homewiredspace.wits.ac.za/jspui/bitstream/10539/21639/3/plagiarism... ·...

Documents