745
1 INTRODUCTION
General collision avoidance rules between two or
more vessels are described in convention known as
the COLREGs‐the International Regulations for
Preventing Collisions at Sea, published by IMO
(International Maritime Organization) at London on
20 October 1972. It replaced similar previous
regulationsof1960.TheConventioncontainsrules
of
the road for seagoing vessels, definitions of ships,
arrangement of lights and shapes, description of
soundandlightsignals.
It is important to mention that regarding
COLREGs there is no absoluteʺright of wayʺ
privilege of a seagoing vessel over any other
encountered vessel. It is rather
described asʺgive
wayʺ(burdened)vesselandaʺstandonʺ(privileged)
vessel, sometimes there may be two “give way”
vesselswithno“standon”vessel.Astandonvessel
maystillbeobligedto giveway,ifthereisa risk of
collision. A commander on the bridge shall never
assume
theothervesselshareshisviewofwhichrules
apply in particular situation. The decision is not
always clear, it may differ depending on weather
condition, visibility, experience and many other
factors. It may be aided by intelligent decision
supportsystemwithfuzzyvalues(Pietrzykowskiand
Małujda, 2012). There
are also solutions for
unmanned vessels (Naeem et al., 2012) and finding
safeshiptrajectorieswithfocusonbetterhandlingof
COLREGS (Szłapczyński and Sz łapczyńska, 2012).
Interestingapproachtomultishipcollisionavoidance
underCOLREGsregulationshasbeenproposedwith
usage of Artificial Potential Field method (Wang
et
al.,2017)tocalculateshipssafetydomain.
Neuroevolutionary Approach to COLREGs Ship
Maneuvers
M.Łącki
GdyniaMaritimeUniversity,Gdynia,Poland
ABSTRACT:Thepaperdescribestheusageofneuroevolutionarymethodincollisionavoidanceoftwopower
drivenvesselsapproachingeachotherregardingCOLREGsrules.Thismaybealsobeseenastheshiphandling
systemthatsimulatesalearningprocessofagroupofartificialhelmsmen‐autonomous
controlunits,created
withartificialneuralnetworks.Thehelmsmanobservesanenvironmentbyitsinputsignalsandaccordingto
assignedCORLEGs rule, hecalculates the values of requiredparameters of maneuvers (propellers rpm and
rudderdeflection)inacollisionavoidancesituation.Inneuroevolutionsuchunits aretreatedasindividualsin
population of artificial neural networks, which through environmental sensing and evolutionary algorithms
learntoperformgiven task safelyand efficiently. The main task of this projectis to evolvea population of
helmsmenwhichisabletoeffectivelyimplementchosenrule:crossingorovertaking.
http://www.transnav.eu
the International Journal
on Marine Navigation
and Safety of Sea Transportation
Volume 13
Number 4
December 2019
DOI:10.12716/1001.13.04.06
746
2 NEUROEVOLUTION
Neuroevolution is an artificial intelligence method
that uses evolutionary algorithms (EA) to generate
artificial neural networks (ANN), its parameters,
topology and rules. Such combination gives the
advantage of flexibility and adaptability, which
allows to adjust the computational structures to the
dynamicallychangingconditionsencounteredduring
ship maneuvering and
are intensively studied and
implementedin many differentfields of science and
research, including robotics (Lee et al., 2013),
automation processes (Stanley et al., 2005), multi
agent systems (Nowak et al., 2008) designing and
diagnostics(Larkinetal.,2006),andmanymore.
Neuroevolutionary algorithms are successful
methods for optimizing neural
networks topologies
fordynamiccontinuousreinforcementlearningtasks.
Their significant advantage over gradientbased
algorithms is the capability to modify network
topologies along with connection weights, resulting
withbroadersearchspaceofpossiblesolutions.
The proper maneuvers of ship maneuvering
accordingtoCOLREGsrulesisessentialtothesafety
of people,
equipment, cargo and the environment.
Increaseofcomputationalpowerofelectronicdevices
allows to implement as complex algorithms as
neuroevolution into advanced decision support
systems(DSS)alsointhefieldofmarinenavigation.
Throughcontinuousenvironmentobservationand
learning process, such DSS shall predict the vessels’
rudderangleandpropeller
revolutionsasaccurate as
possibletoensuresafeimplementationofchosenrule.
It is possible to calculate these output signals when
thereisasimulationmodelofthevesselavailable.
InneuroevolutionANNistreatedasanindividual
inapopulationofmultiplenetworks.Neuroevolution
is able to find a solution
of a complex and
dynamically changing task with ANN created and
modifiedwithEA. Thebasictopologiesof theinitial
populationarerandomlydeterminedatthebeginning
oflearningprocess.Eachindividualactsasabrainof
artificialhelmsmanandbeginstheprocessoffinding
a solution with the same
starting parameters (ships
initial position, course, velocity, rudder angle, rpm).
Theactionofeachindividualisusuallyassessedwith
thereinforcementlearningalgorithms(Stanleyetal.,
2005)andevolutionarystageofthesystemshallselect
individuals best suited to the task during selection
stage, which determines the whole population to
improve
itsgeneticmaterialovertime.
Evaluation of each individual is being processed
duringwholesimulationaftersomeimportantevents
tookplace,asforexample:
movingthevesseloutoftheareaoronforbidden
sector, i.e. the safety domain of an encountered
vessel,
making rapid and/or frequently changing
maneuvers,i.e.tofrequentalterationofrpm,
leadingtoimproper ships’ movement parameters
values,i.e.linearand/orangularvelocity too low
ortoohigh,
movingthevesselawayfromgoal,
reachingagoal.
Alltheseeventmustbearbitrarilyrated,resulting
inarewardtoevaluatedindividual,thus
valuatingits
fitness important in evolutionary stage of the
algorithm and consequently his chance of
reproductionandsurvivaltothenextgeneration.
Evolutionary process of the system consists three
mainsteps:
selectionofthebestindividualorindividuals,
reproduction (with crossover and mutation sub
processes),
replacement
(offspring replaces worst
individuals).
Forthepurposeofthistasktheneuroevolutionary
method, the modified NEAT algorithm, with direct
encoding of neural network topology has been
implemented.
NEAT (NeuroEvolution of Augmenting
Topologies) (Stanley and Risto, 2002) adjust the
topology of ANN’s with EA gradually, allows to
obtainasetofindividuals
thatarebestfittedtogiven
task.
Input and output signals of ANN’s have been
determinedatthebeginningofdesigningphaseofthe
system.Properlydesignedsetofsignalsconsideredin
the model is crucial for efficiency of the system as
muchasforitsfidelityandaccuracyin
comparisonto
therealnavigationalsituation.
Inputsignalsinthesystem,withthreedegreesof
freedomofthevesselmovement,areasfollows:
ships’courseoverground,
ships’angularvelocity,
ships’speedoverground,
ships’position,
ships’distanceandangletogoal,obstaclesandto
the
encounteredvessel,
mainpropellerrevolutions(currentandpreset),
rudders’deflection(currentandpreset).
Infutureresearchothersignalsfromenvironment
maybetakenintoaccount,i.e.wind,current,waves,
cargo,trimandroll,ifdeliveredinashipmodel.
Outputsignals of ANNs generates the values for
steering
thevessel:
rpmofmainpropeller,
rudders’deflection.
Alloftheinputandoutputsignalsarenormalized
andencodedasrealvaluesbetween0and1.
Each node in ANN represents a neuron that
producesarealvaluebetween0and1asaresultof
normalized weighted sum
of its inputs.
Normalization of weighted sum is performed with
sigmoidfunction.
The simulation results are shown below for
simulationmodel of threedegreesoffreedom VLCC
crude oil tanker “Esso Norway” with the single
propellerandsinglerudder(Figure1).
747
Figure1.VLCCoiltanker„EssoNorway” (19691986)
Mainparametersofthesimulatedvesselhasbeen
placedintable1.
Table1.Mainparametersof“EssoNorway”
______________________________________________
ParameterValue
______________________________________________
Lengthoverall323,8m
Lengthbetweenperpendiculars 304,8m
Beam47,3m
Max.draft18,46m
Deadweighttonnage193048t
Max.revolutionsofpropeller80rpm
Max.rudderdeflection±20°
_____________________________________________
Inthissimulationithasbeenassumedthat“Esso
Norway”willencountersecondvesselofsimilarsize
and heading forward on steady course. Her safety
domain has been established as simplified rectangle
shape 3 length ahead of her bow and one length
behindthestern(Figure2).Widthofthisdomain
is2
lengthofthevessel.
Figure2.Simplifiedsafetydomainoftheencounteredvessel
3 SIMULATIONOFCOLREGSMANEUVERS
3.1 Rule13Overtaking
Rule13statesthat:
“a)Notwithstandinganything containedintheRules
ofPartB,SectionsIandII,anyvesselovertakingany
other shall keep out of the way of the vessel being
overtaken.
(b)A vessel shall be deemed to
be overtaking when
coming up with a another vessel from a direction
morethan22.5degreesabaftherbeam,thatis,insuch
a position with reference to the vessel she is
overtaking,thatatnightshewouldbeabletoseeonly
the sternlight of that vessel but neither
of her
sidelights.
(c)Whenavesselisinanydoubtastowhethersheis
overtaking another, she shall assume that this is the
caseandactaccordingly.
(d)Anysubsequentalterationofthebearingbetween
thetwovesselsshallnotmaketheovertakingvessela
crossingvesselwithinthe
meaningoftheseRulesor
relieve her of the duty of keeping clear of the
overtakenvesseluntilsheisfinallypastandclear.”
Overtakingsituationispresentedonfigure3.
Figure3.Initialsituationofovertakingmaneuvers
VesselAisheadingnorthfaster(5m/s)thanvessel
B (3 m/s), thus overtaking maneuver occurs. In this
case it is strongly recommended that vessel A shall
overtakeBonherportside.
748
Figure4. An example of recorded routes of a whole
population
As one can see on figure 4 there is a broad
spectrumofroutestakenontheportsideofovertaken
vessel B. Many of them end width failure upon
enteringothervessels’safetydomain,ormovingout
of the area. Some routes end within the goal with
improper course over
ground, but fitness values of
these individuals are better than those leaving the
area and their chance of survival to the next
generation is greatly higher. It is of course not
sufficient to reproduce as intense as the best
individuals but is not totally excluded and has a
chanceto
improveitsfitness insubsequent episodes
ofsimulationprocess.
Figure5.Finalrouteofthebestindividual
The results of simulation is the route of the best
individual, chosen after specified duration of
simulation. His actions taken and maneuvering
parametersvalueshasbeenpresentedinfigures610.
Figure6.Suggestedandactualrudderangle
The system calculated 3 rudder angle changes,
resultingchangesinangularvelocityaswell.
Figure7.Angularvelocity
Figure8.Linearvelocityinm/sandinknots
Figure9.Suggestedandactualrevolutions[rpm]
Propeller revolutions changes suggested by the
system seem to be too low, causing in linear speed
decrease below 4 m/s [7 kn]. It may be solved in
further simulation by the adjustment of a reward
valuerelatedtoships’velocityortimeoftravel.
Figure10.Courseoverground
3.2 Rule15CrossingSituation
Crossingmaneuversaredescribedinrule15:
“Whentwopowerdrivenvesselsarecrossingsoasto
involve risk of collision, the vessel which has the
otheronherownstarboardsideshallkeepoutofthe
wayandshall,ifthecircumstancesofthe
caseadmit,
avoidcrossingaheadoftheothervessel.”
An example of simulation process of crossing
situation is presented on figure 11. Giveway vessel
749
starts with course 110° on the port side of the
encounteredvesselwhichisheadingnorth.
Figure11. An example of simultaneous positions of each
individualinpopulationofhelmsmenduringsimulation
Duringsimulationeachartificialhelmsmantriesto
safelynavigatethroughareatoagoal,andaccording
torule15,hetriestoavoidcrossingaheadoftheother
vesselduringmaneuvers.
Figure12. An example of recorded routes of a whole
population
There are some individuals that prefer relatively
safe circulation than risky maneuvering near
forbidden domain of the encountered vessel. Their
fitnessisslightlybetterthanonesleavingthearea,but
they are still too weak to compete with better units
duringselection.
Figure13.Finalrouteofthebestindividual
The results of simulation are presented as final
route of the best individual and its actions and
parametersinfigures1418.
Figure14.Suggestedandactualrudderangle
Figure15.Angularvelocity
Figure16.Linearvelocityinm/sandinknots
750
Figure17.Suggestedandactualrevolutions[rpm]
Figure18.Courseoverground
Shipofthatsizehas verylimitedmaneuverability,
what implies higher awareness of early orders of
changing her rudder deflection or/and propeller
thrust.Simulationresultsshowthatchangeofthrust
has no big impact of ships’ speed in considered
situations.Moreimportantis rudder angle, which is
limitedto±20°.
4
REMARKS
ManydiscussionsonCOLREGshaveraisedandbeen
continued since its creation and first submission
relatedtoitsapplication.Therearesomeproposalsof
improvements in the maritime education to reduce
the negative impacts during the implementation of
theCOLREGs(DemirelandBayer,2015).
Neuroevolutionary approach to ship handling
duringcrossoverorovertakingofencounteredvessel
may improve a quality of maneuvers and safety of
navigation. For the simulation study, mathematical
model of threedegreesoffreedom VLCC crude oil
tanker with the singlepropeller and singlerudder
hasbeenusedtotesttheperformanceofthesystem.
In
comparison to a classic state machine learning
algorithms(Łącki,2007)theartificialneuralnetworks
based on modified NEAT method may increase
complexity and performance of considered modelof
ship maneuvering according to COLREGs.
Neuroevolutionaryshiphandlingsystembringssome
valuablebenefitstothisapproach:
increaseofthesafety
ofnavigationby improving
the data analysis for decisionmaker during
maneuvers,
reduction of operating costs of vessels due to
reduced number of extreme and unnecessary
maneuvers,
minimizationoftheoccurrenceofhumanerrors,
reduction of the harmful impact of transport on
theenvironment
finding some new
solutions related to heuristic
characteristicsofneuroevolution.
It is important to notice that all these benefits in
neuroevolutionstrictlydependonproperadjustment
ofevolutionaryparametersandprocesses,thesizeof
ANNs population and the encoding methods of
signalsconsideredinservicedenvironment.
Successfulsimulationresultsencouragetofurther
research
of the neuroevolutionary methods with
additional disturbances from the influence of sea
waves, ocean currents and winds, for different ship
models,whichmaybesuccessfullyimplementedinto
advancednavigationalsystemstoincreasethesafety
ofnavigation.
REFERENCES
Demirel,E. &Bayer, D. 2015. The Further Studies OnThe
COLREGs (CollisionRegulations), TransNav,the
InternationalJournalonMarineNavigationandSafetyofSea
Transportation,vol.9,17–22.
Łącki, M. 2007. Machine Learning Algorithms in Decision
Making Support in Ship Handling. , KatowiceUstroń:
WKŁ,p.
Larkin, D., Kinane, A. & O’Connor, N. 2006. Towards
hardware acceleration of neuroevolution for multimedia
processing applications on mobile devices , Hong Kong,
China.
Lee, S., Yosinski, J., Glette, K., Lipson, H. & Clune J 2013.
EvolvinggaitsforphysicalrobotswiththeHyperNEAT
generative encoding: the benefits of simulation,
Applications
ofEvolutionaryComputing,.
Naeem, W., Irwin, G.W. & Aolei, Y. 2012. COLREGsbased
collisionavoidancestrategiesforunmannedsurfacevehicles,
Oxford,ROYAUMEUNI:Elsevier.
Nowak, A., Praczyk, T. & Szymak, P. 2008. Multiagent
system of autonomous underwater vehicles‐
preliminary report, Zeszty Naukowe Akademii Marynarki
Wojennej,vol.4,
99–108.
Pietrzykowski,Z.&Małujda,R.2012.Applicabilityoffuzzy
logic to the COLREG rules interpretation, Zeszyty
Naukowe/AkademiaMorskaWSzczecinie,109–114.
Stanley, K.O. & Risto, M. 2002. Efficient Reinforcement
LearningThroughEvolvingNeuralNetworkTopologies.
Stanley, K.O., Bryant, B.D. & Risto, M. 2005. Realtime
neuroevolution in the
NERO video game, IEEE
TransactionsonEvolutionaryComputation,vol.9,653–668.
Szłapczyński,R.&Szłapczyńska,J.2012. EvolutionarySets
of Safe Ship Trajectories: Evaluation of Individuals,
TransNav, the International Journal on Marine Navigation
andSafetyofSeaTransportation,vol.6,345–353.
Wang,T.,Yan,X.P.,
Wang,Y.& Wu,Q.2017.ShipDomain
Model for Multiship Collision Avoidance Decision
making with COLREGs Based on Artificial Potential
Field, TransNav, the International Journal on Marine
NavigationandSafetyofSeaTransportation,vol.11,85–92.