fulltext ais

Upload: iam-abhisek

Post on 07-Jul-2018

219 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/19/2019 FullText Ais

    1/108

    TitleArtificial immunity inspired cooperative failure recoveryframework for mobile multi-robot system

    Advisor(s) Lau, HYK

    Author(s) Chan, Ching-man; –sk ‡

    Citation

    Issued Date 2014

    URL http://hdl.handle.net/10722/206677

    RightsThe author retains all proprietary rights, (such as patent rights)and the right to use in future works.

  • 8/19/2019 FullText Ais

    2/108

     

    Artificial Immunity Inspired Cooperative

    Failure Recovery Framework for Mobile

    Multi-Robot System

    By

    C. M. Chan

    B. Sc. H.K.

    A thesis submitted in partial fulfilment of the requirements for

    the degree of Doctor of Philosophy

    at The University of Hong Kong

    June 2014 

  • 8/19/2019 FullText Ais

    3/108

    I

    Abstract of thesis entitled

    Artificial Immunity Inspired Cooperative Failure Recovery

    Framework for Mobile Multi-Robot System

    Submitted by

    C. M. Chan 

    for the degree of Doctor of Philosophy

    at The University of Hong Kong

    in June 2014

    Robots are sophisticated machines which are specially designed to have the

    capabilities to handle operations, on behalf of human, in many different scenarios. In

    the past decades, the design of robot systems has been evolving and there are

    increasing numbers of possible applications of robot. Some systems can even be able

    to overcome the individual limitations and handle complex problems by combining

    the strengths of multiple robots.

    To reduce the risk of human life, robots are now being put into missions under

    extremely dangerous or hazardous environment where human intervention is not

    tolerable , such as search-and-rescue missions inside damaged buildings after natural

    disasters and cleaning up of radioactive materials in nuclear accidence. Even though

    robots are dispensable, if they are damaged, disabled or trapped, the mission would

    not be accomplished. Therefore, the longevity of a robot system is always a challenge

    for robotic operations in such difficult environments.

    To tackle this challenge, many studies focus on improving the design of individual

    robot, minimizing the chance of robot failure; or the way that how functioning robots

    may share the job of the failed robots. The way that how other robots can help failed

    robots recover, however, has yet to be widely discussed.

  • 8/19/2019 FullText Ais

    4/108

    II

    This thesis studies the feasibility of having multi-robot system with different

    automatic cooperative recovery abilities on top of its primary functions. A novel

    cooperative recovery framework is proposed for generic control among system

     primary functions and recovery behaviours. A number of experiments have been done

    to study the influence of cooperative recovery on a multi-robot system and how it can

    affect the system in terms of system performance, sustainability and overhead. An

    Immunity-based cooperative recovery model has also been created to overcome the

    drawback introduced by cooperative recovery, finding a balance between the two

    system objective among system productivity and longevity.

    Two modified versions of cooperative recovery model are also included in this study

    to further maximize the system potential.

    .

    .(322 words)

  • 8/19/2019 FullText Ais

    5/108

    I

    Declaration 

    I declare that the thesis represents my own work, except where due

    acknowledgement is made, and that it has not been previously included in a thesis,dissertation or report submitted to this University or any other institution for a degree,

    diploma or other qualifications.

     __________________________

    C. M. Chan

    June 2014

  • 8/19/2019 FullText Ais

    6/108

    II

    Acknowledgements 

    I would like to express my most sincere gratitude and deepest appreciation to my

    supervisor, Dr. Henry Y. K. Lau, for all his encouragement, assistance andenthusiastic guidance throughout the course of my PhD study. Especially when I

    encounter the very difficult moments in my life, leaving of family members, he gave

    me invaluable chances, encouragements and inspirations which are not only essential

    to complete this thesis, but also courage on future personal development.

  • 8/19/2019 FullText Ais

    7/108

    III

    Table of Contents 

    Declaration ..................................................................................................................... I 

    Acknowledgements ....................................................................................................... II 

    Table of Contents ......................................................................................................... III 

    List of Figures .............................................................................................................. VI 

    List of Tables ............................................................................................................ VIII 

    List of Symbols ............................................................................................................ IX 

    Chapter 1 Introduction .............................................................................................. 1-1 

    1.1.  Goal of Thesis ............................................................................................. 1-3 

    1.2.  Contributions ............................................................................................... 1-3 

    1.3.  The Outline .................................................................................................. 1-4 

    Chapter 2 Literature Review ...................................................................................... 2-1 

    2.1.  Automatic Recovery on Robot Systems...................................................... 2-2 

    2.1.1.  Single Robot System and Recovery ........................................................ 2-2 

    2.1.1.1.  Assembly Robots ................................................................................. 2-2 

    2.1.1.2.  Mobile Robots ...................................................................................... 2-4 

    2.1.1.2.1.  Robot Recharging Mechanics .......................................................... 2-6 

    2.1.2.  Multi-Robot System and Recovery ......................................................... 2-8 

    2.1.2.1.  Task and Resources Re-allocation ....................................................... 2-8 

    2.1.2.2.  Replaceable Modules ........................................................................... 2-9 

    2.1.2.3.  Mobile Recharging Mechanics .......................................................... 2-11 

    2.1.2.4.  Inter-robot Docking ........................................................................... 2-12 

    2.1.2.5.  Communications Recovery ................................................................ 2-14 

    2.2.  Artificial Immunity ................................................................................... 2-15 

    2.2.1.  Human Immune System ........................................................................ 2-15 

  • 8/19/2019 FullText Ais

    8/108

    IV

    2.2.2.  Artificial Immune System ..................................................................... 2-18 

    2.2.2.1.  Immune Inspired Algorithms ............................................................. 2-18 

    2.2.2.1.1.   Negative Selection theory .............................................................. 2-18 

    2.2.2.1.2.  Clonal selection theory ................................................................... 2-19 

    2.2.2.1.3.  Immune network theory ................................................................. 2-21 

    2.2.2.2.  Immune-Based Single Robot Control ................................................ 2-24 

    2.2.2.3.  Immune-Based Multi-Agent System Control .................................... 2-25 

    2.3.  Summary ................................................................................................... 2-30 

    Chapter 3 Cooperative Recovery Model.................................................................... 3-1 

    3.1.  Cooperative Failure Recovery ..................................................................... 3-1 

    3.2.  Recoverable Mechanics............................................................................... 3-2 

    3.2.1.  Parts Replacing ........................................................................................ 3-3 

    3.2.2.  Strength Combining ................................................................................. 3-3 

    3.3.  Different roles of robots .............................................................................. 3-4 

    3.4.  The Cooperative Recovery Trade-off Problem ........................................... 3-6 

    3.5.  Immunity Inspired Recovery Model ........................................................... 3-8 

    3.5.1.  A Two-level Response Architecture........................................................ 3-9 

    3.5.2.  The Control System ............................................................................... 3-12 

    3.5.3.  Design of Antibodies ............................................................................. 3-14 

    3.5.4.  Classification of Recovery Operations .................................................. 3-16 

    3.5.5.  Behavioural Decision Making Algorithm ............................................. 3-17 

    3.5.6.  Favourability Functions of Operations .................................................. 3-21 

    3.5.6.1.  Productive Function ........................................................................... 3-21 

    3.5.6.2.  Recovery Function ............................................................................. 3-22 

    3.5.6.3.  Cost Function ..................................................................................... 3-23 

    3.6.  Experiments and Analysis ......................................................................... 3-24 

    3.6.1.  Influence of Cooperative Recovery on Multi-Robot System ................ 3-24 

  • 8/19/2019 FullText Ais

    9/108

    V

    3.6.2.  On-site Cooperative Recovery Mechanics ............................................ 3-29 

    3.7.  Summary ................................................................................................... 3-31 

    Chapter 4 Sacrificing Cooperative Recovery Model ................................................ 4-1 

    4.1.  Concept of Sacrifice .................................................................................... 4-2 

    4.2.  Resources Sharing Recoverable Mechanics ................................................ 4-3 

    4.3.  Role of the Grey Robot ............................................................................... 4-4 

    4.4.  Cost Function for Sacrificing Recovery ...................................................... 4-5 

    4.5.  Experiments and Analysis ........................................................................... 4-6 

    4.6.  Summary ..................................................................................................... 4-7 

    Chapter 5 Productivity Boost Recovery Model ......................................................... 5-1 

    5.1.  Sustainability Confidence and Productivity ................................................ 5-2 

    5.2.  Experiments and Analysis ........................................................................... 5-3 

    5.3.  Summary ..................................................................................................... 5-6 

    Chapter 6 Conclusions ............................................................................................... 6-1 

    5.4.  Research Summary ...................................................................................... 6-1 

    5.5.  Future Works ............................................................................................... 6-4 

    References ................................................................................................................... R-i 

  • 8/19/2019 FullText Ais

    10/108

    VI

    List of Figures

    Figure 1 Photography of the snake robot which recover from damage on muscle wires

    (Mahdavi and Bentley 2003) ................................................................... 2-4 Figure 2 Quadrupedal robot (top) and hexapedal robot (bottom) used in (Bongard and

    Lipson 2004a, 2004b), ............................................................................. 2-5 

    Figure 3 Photography of the recharging station (top) and corresponding docking

    mechanism (bottom) in Silverman et al. (2002, 2003) ............................ 2-6 

    Figure 4 Robot which autonomously plugging itself into electricity slot (Meeussen et

    al. 2010) ................................................................................................... 2-7 

    Figure 5 Self-repair process of modular reconfigurable robots (Murata et al. 2001) ....

    …………………………………………………………………………...2-9 

    Figure 6 Photography of FRACTUM robot doing Self-repairing (Yoshida et al. 2003)

    .................................................................................................................. 2-9 

    Figure 7 Vision of self-sufficient robot colonies (Bererton and Khosla 2001) ...... 2-10 

    Figure 8 The “baby” robots (top), “mother” robot with recharge and docking station

    mounted on it body (bottom) (Drenner et al. 2013) .............................. 2-11 

    Figure 9 JL-2 docking manipulator and grasping modules (Wang, et al. 2010) ..... 2-12 

    Figure 10 JL-2 lifting up the middle robot (Wang, et al. 2010)............................... 2-13 

    Figure 11 Robot team with robot 1 serving as a relay node and maintaining line of

    sight with robot 0 and 2 during a mission where global communication in

    not available (Sweeney et al. 2002) ...................................................... 2-14 

    Figure 12 Different Immune Cells in Human Immune System ............................... 2-16 

    Figure 13 Concept of idiotypic networks................................................................. 2-21 

    Figure 14 Immunoid robot in experiment using immune network based decision

    making (Ishiguro, Watanabe et al. 1996). ............................................ 2-24 

    Figure 15 Functional Modules Architecture by Dasgupta (1998) ........................... 2-26 

    Figure 16 The General Suppression Control Framework by Ko et al. (2004) ......... 2-27 

    Figure 17 Architecture of control framework for individual AIS agents by Lau (2006)

    ................................................................................................................ 2-28 

    Figure 18 The change in robot state and role ............................................................. 3-5 

    Figure 19 The performance of system drops when recovery operations in place ..... 3-7 

  • 8/19/2019 FullText Ais

    11/108

    VII

    Figure 20 The two-level defence mechanism in human immune system and proposed

    model...................................................................................................... 3-10 

    Figure 21 Flow of operation from perception to behaviours ................................... 3-13 

    Figure 22 Design of N-Cell and S-Cell.................................................................... 3-14 

    Figure 23 A real Pioneer I robot (left) and a virtualized Pioneer robot in Player/stage

    ................................................................................................................ 3-24 

    Figure 24 Ten robots working in the simulation environment with stochastic damage

    rate on its wheel inflicted by hazardous surface, in 2D and 3D View ... 3-25 

    Figure 25 System Life and productivity of different setups in experiment A ......... 3-27 

    Figure 26 Experiment setup A2 at 600 decision cycles (top) A3 at 500 decision cycles

    (bottom).................................................................................................. 3-28 

    Figure 27 Result of Experiment B with additional recovery mechanism ................ 3-30 

    Figure 28 The robots transforming into Grey robot................................................... 4-4 

    Figure 29 Result of Experiment C with the sacrificing recovery mechanism

    comparing with Setups A5 and B ............................................................ 4-6 

    Figure 30 Result of the Productivity Boost Recovery Model with setup A5, B, C1 &

    C2 ............................................................................................................. 5-3 

    Figure 31 The average maximum continuous productive time in A5, B, C1 & C2 .. 5-4 

    Figure 32 Area coverage of Setup A (i) without Boost and (ii) with Boost; Setup B (iii)

    without Boost and (iv) with Boost. .......................................................... 5-5 

  • 8/19/2019 FullText Ais

    12/108

    VIII

    List of Tables

    Table 1 Recovery Strategies used in (Baydar and Saitou 2004) ............................... 2-3 

    Table 2 Communication Recovery Behaviour in (Ulam and Arkin 2004) .............. 2-14 

    Table 3. Colour of role in particular roles for illustration purpose ............................ 3-4 

    Table 4. Relationships between the proposed recovery model and the immune system

    .................................................................................................................................. 3-11 

    Table 5. Difference among setups in experiment A ................................................ 3-26 

    Table 6. Refined role models for Sacrificing Cooperative Recovery ........................ 4-4 

  • 8/19/2019 FullText Ais

    13/108

    IX

    List of Symbols

      actions of robots

      normal actions

      sustainment actions

      failure recovery actions

      mandatory maintenance operations

      sustainment response

      normal responses

      self-sustainment response

      cooperative sustainment response

     p population size

      sustainability ratio

    n number of N-cell being selected

    s number of S-cell being selected

      sustainability level

      estimated health of the robot α    maximum possible health of the robot α 

      number of working robots within distance d from robot  

      number of failed robots within distance d from robot  

      favourability of an antibody x

  • 8/19/2019 FullText Ais

    14/108

    X

      expected productive value of an antibody x

      number of favourable memory of an antibody x

      number of unfavourable memory of an antibody x

      influence from other triggered antibodies on an antibody x

      stimulation from antibody y to antibody x

      suppression of antibody z to antibody x

      expected gain in production value after the response

      expected loss of production during the recovery

      favourable threshold of the robot α 

  • 8/19/2019 FullText Ais

    15/108

    1-1

    Chapter 1

    Introduction

    One major motivation to develop autonomous robotic solutions is to reduce the need

    for human presence in hazardous environments and to relieve humans from

     performing highly repetitive manoeuvres (Matarić, 2007). Especially after several

    serious disasters, such as the World Trade Centre Collapse and Tohoku Fukushima

    nuclear event, even more attention has been focused on the robot system which can be

    deployed and work in extremely dangerous and unpredictable situations (Murphy2012, NHK 2013).

    Most traditional approaches to build autonomous robotic solutions involve creating a

    single “mighty” robot with capabilities necessary to accomplish the entire mission on

    its own. For small-scale applications, this is by far the most efficient means of system

    design; yet as task complexity increases, such as real world applications that are

    inherently distributed in time, space and functionality, single robot design has

    significant limitations as well as a single point of failure if the robot is disabled for

    any reasons. A multi-agent concept has therefore emerged as the solution to this

     problem (Parker 2003).

    In a multi-agent based robot system, also known as a multi-robot system, a large-scale

    mission is decomposed and sub-missions are allocated to a team of robots. These

    robots inter-coordinate to support and complement each other, performing tasks and

    interchanging duties to achieve a common goal for system-wide mission

    accomplishment. Many studies show that this modelling concept of multi-agent

    systems (MAS) is very useful and efficient for complex robotic operations such as

    firefighting (Cohen et al. 1990, Liao 2005, Yan-Tao et al. 2009), hazardous waste

    clean-up (Kurabayashi et al. 1996, Parker 1998, Jager and Nebel 2002), search and

    rescue (Jennings et al. 1997, Nourbakhsh, Sycara et al. 2005, Baxter, Burke et al.

    2007), planetary exploration (Mataric and Sukhatme 2001, Huntsberger et al. 2003)

    and military reconnaissance (Rybski et al. 2000, Wagner and Arkin 2004, Kolling and

    Carpin 2008). However, in spite of all the advantages MAS has to offer, the

  • 8/19/2019 FullText Ais

    16/108

    1-2

    effectiveness of the multi-robot system is still hard to control if its robots cannot

    function. The presence of unanticipated circumstances and dangers in the many real

    world environments can always deviates the system from its expected behaviour,

    disabling the robots or even rendering the system to a complete failure. This poses a

    very important problem of effective and efficient failure management among the

    multi-robot system designs to cope with real world applications.

    Many studies have been done on the recovery of robot system in past decades. Some

    works focused on fault tolerance and task reallocation after robot failure occurs (Toye

    and Leifer 1994, Visinsky et al. 1994, Parker 1998); some studies even suggested

    integrations of automatic recovery (Gini and Gini 1984, Portugal and Rocha 2013),

    including robots with ability to repair and reconfigure itself (Mahdavi and Bentley

    2003, Bongard and Lipson 2004). However, the area of automatic cooperative failure

    recovery, such as having one of the robots actually helps in the action of recovering

    other failed robots, is yet to be fully explored.

    There are two main problems in developing a cooperative failure recovery-enabled

    robot system. First, the robotic mechanism, which allows the actual failure recovery

    interaction among robots, would greatly increase the complexity of the robot design.

    This mean the cost invested on developing the robot would be increased. Second,

    even if the mechanism is in place, it is hard to coordinate the robot and make the

    system cost-effective. The overall system productivity can be further reduced due to

    the introduction of those recovery mechanisms, as the system production rate which

    already suffering from robot failure would be further slowed down when having a

    functioning to participant in the recovery of a failed robot. A good system control

    strategy is required in order to make the automatic cooperative recovery failure

    favourable.

    This thesis focuses on the highly tolerant and adaptive cooperative failure recovery

    models, which are inspired by the biological immune system. The theoretical

    underpinning of the scheme is based on the formalism of the immune network theory

    inspired from artificial immune systems (AIS), which is an engineering analogy of the

    human immune system and a powerful artificial intelligence (AI) methodology.

    Previous studies (Dasgupta 1998, Lau and Wong 2006, Lu and Lau 2009) have shown

    that AIS offer the favourable features of self-regulation and self-organization,

  • 8/19/2019 FullText Ais

    17/108

    1-3

    adaptation and coordination for providing homeostatic control to a robotic system and

     producing the best solution to a problem in a distributed manner.

    The proposed methodology helps to maintain multi-robot system sustainability and

    dependability in a cost-effective manner; also examine how the incorporating of

    cooperative recovery may influence the future of multi-robot system, unleashing

    overall system potential in stochastic environments.

    1.1.  Goal of Thesis

    The objective of this research is to provide a decent motivation and advocate the

    development of diverse cooperative recovery strategies in future multi-robot system

    through the establishment of the universal cooperative failure recovery models.

    1.2.  Contributions

    The contributions of this thesis are:

      The general and unique recovery model for cooperative recovery of

    multi-robot system

      A control framework for balancing the benefits and overhead which

    introduced recovery strategies

      The construction of scarifying model for integration sophisticated

    cooperative recovery methodology

      The demonstration of possible performance boost in respectable

    confidence levels

  • 8/19/2019 FullText Ais

    18/108

    1-4

    1.3. 

    The Outline

    The thesis proceeds as follows:

    Chapter 2  provides background for this study by looking into the potential robot

    recovery mechanisms, artificial immune system and the previous work done on multi-

    robot system using immunity inspired intelligence.

    Chapter 3 describes the proposed Basic Cooperative Recovery Model (BCRM) and

    how cooperative recovery is achievable, as well as evaluating its possible potential.

    Chapter 4  proposes the modified version of the cooperative recovery model which

    incorporated with the concept of sacrifice. This Scarifying Cooperative Recovery

    Model (SCRM) allows good of sacrifice

    Chapter 5 introduce a modified version of the cooperative recovery model which

    specially designed to maximize the amount time a robot can stay on the field before

    each returning.

    Chapter 6 summarizes and concludes the findings of this research and future works.

  • 8/19/2019 FullText Ais

    19/108

    2-1

    Chapter 2

    Literature Review

    This chapter reviews the projects and studies which influence the model proposed in

    this thesis. This section can be separated into two sections: the first one is to reviews

    the development of robot recovery and related mechanisms which can be applied to

    the proposed new recovery model; the second one is to review several related

     biological immunity inspired artificial intelligence which provide inspirations to this

    study.

    As the focus of this study is failure recovery, the development failure detection would

    not be reviewed here. Section 2.1would begin with the automatic recovery mechanism

    in single robots, and then followed with additional automatic recovery mechanism

    only multi-robot system can perform.

    Artificial immunity is the key inspiration of this study, Section 2.2 would show some

     background of immunity algorithms and related studies of robot systems which

    applying the artificial immunity.

  • 8/19/2019 FullText Ais

    20/108

    2-2

    2.1. 

     Automatic Recovery on Robot Systems

    Failure recovery of robot system is complicated and system dependent. A single

    recovery method may involve multiple problem domains. Not every system failure

    has an existing recovery mechanism, yet the recovery mechanisms are slowly building

    up.

    The followings are collection of research studies with possible mechanism and related

    works which maybe applicable to implement cooperative recovery.

    2.1.1. 

    Single Robot System and Recovery

    Srinivas (1977) may have been the first study on error recovery of robot

    system, he proposed a method to diagnose the system situation using a pre-

    defined decision tree. After that, research studies on automatic recovery start

    getting more attention since 80’s and 90’s. 

    2.1.1.1.  Assembly Robots

    The earliest research studies on automatic robot recovery mainly focus on

    error detection and error diagnosis of internal system failure in single robot

    systems for manufacturing and assembly production line. Their motivations

    are to reduce the production defections and maximize the profit. The failure

    recovery operations can be done on programming level (Gini and Gini 1984,

    Kao 1995) along with real time monitoring (Smith and Gini 1986, Kim and

    Welch 1989).

  • 8/19/2019 FullText Ais

    21/108

    2-3

    Later on, attention starts drawing on the automatic generation of recovery

    knowledge with machine learning, such as the framework being introduced to

    generating automatic recovery plan by Evans and Lee (1994). The framework

    offers quickly correction of known errors and automatically learns to recover

    new errors without human intervention. However, that was designed for stasis

    environment and not much detail about actual recovery strategies has been

    discussed there.

    Bavdar & Saitou (2004) proposed the first off-line virtual diagnosis and

    recovery method using Bayesian reasoning and genetic programming which

    helps to reduce the downtime of assembly robot systems. 

    Table  1  shows the five recovery strategies being applied in its

    experiments. Although the strategies are simple, it still gives examples of

     possible recovery strategies for assembly robot.

    Table 1 Recovery Strategies used in (Baydar and Saitou 2004) 

    Problem Strategy

    Grasping error Try grasp again or release again

    Collision error Collision recovery

    Sensor failure Call maintenance

    Misplacement error Pickup and replace the part

    Flawed parts Dispose the part

  • 8/19/2019 FullText Ais

    22/108

    2-4

    2.1.1.2. 

    Mobile Robots

    There are more recovery mechanics usable for robot recovery. Figure 1 shows

    a snake robot designed by Mahdavi & Bentley (2003, 2006) which can recover

    from damage on its muscle with the evolutionary algorithm and shape memoryalloy muscle. This robot can continue locomotion by evolving new movement

    strategy even some of its muscle wires are damaged. By using this approach, it

    is feasible to develop robots arms and robot legs which can self-recover from

    damages.

    Figure 1 Photography of the snake robot which recover from damage on muscle wires

    (Mahdavi and Bentley 2003)

    In the study of Bongard & Lipson (2004a, 2004b), there even suggested an

    estimation-exploration algorithm which allows a quadrupedal robot and

    hexapedal robot to continue walking with presence of broken legs, jamming

     joints and failed angle sensors. Figure 2 shows the photography of simulated

    robots used by Bongard & Lipson, where M are the positions for motorized

     joints, A are angle sensors and T are touch sensors.

    There are also electronic circuit for robots which can be recovering from

    damage according to the study from Keymeulen et al. (2000).

  • 8/19/2019 FullText Ais

    23/108

    2-5

    Figure 2  Quadrupedal robot (top) and hexapedal robot (bottom) used in (Bongard and

    Lipson 2004a, 2004b),

  • 8/19/2019 FullText Ais

    24/108

    2-6

    2.1.1.2.1. 

    Robot Recharging Mechanics

    Power failure is another issue which always bring troubles to autonomous

    mobile robot. Besides having backup batteries, Silverman et al. have

    developed a recharging station and corresponding docking mechanism in(Silverman et al. 2002, 2003) which allow Pioneer 2DX robots to recharge

    autonomously. Figure 3 shows a photography of a Pioneer 2DX robot going to

     be docked at the charging station. Another similar vision-based self-docking

    and recharging can be found in studies of Kartoun et al. (2006).

    Figure 3 Photography of the recharging station (top)

    and corresponding docking mechanism (bottom) in Silverman et al. (2002, 2003)

  • 8/19/2019 FullText Ais

    25/108

    2-7

    By integration robotic arm and vision sensors, Meeussen et al. (2010) even can

    make a robot to open doors in office environment and plug itself to into the

    electricity socket for power recharging. Figure 4 shows their experimental

    robot opening office door and searching for electricity slot to recharge itself.

    Figure 4 Robot which autonomously plugging itself into electricity slot (Meeussen et al. 2010)

  • 8/19/2019 FullText Ais

    26/108

    2-8

    2.1.2. 

    Multi-Robot System and Recovery

    Since the advent of distributed robotics in the late 1980s, extensive research

    has been carried out on different aspects of multi-robot systems (Parker 2003).

    The majority of studies include task planning (Alami et al. 1998), inter-robot

    communication and decision making (Molnár and Starke 2000), formation

    control (Gu and Hu 2009), localization (Roumeliotis and Bekey 2002),

    exploration (Howard et al. 2006), motion coordination (Yamaguchi 2003),

    task allocation (Strens and Windelinckx 2005) and learning (Fernandez et al.

    2005). Although not many of these studies directly address the problem of

    failure recovery, there are some related studies carried out under the broad

    topic of fault tolerance.

    2.1.2.1.  Task and Resources Re-allocation

    Parker (1998) proposed a widely adopted software architecture, known as

    ALLIANCE, for fault tolerance control among heterogeneous multi-robot

    systems, with its capabilities evaluated in later works (Parker 2000, 2001).

    Gerkey and Mataric (2002) then discussed the problems of ALLIANCE and

    introduced certain provisions for robot failure with “Pusher -Watcher”

    experiments. Dias et al. (2004) proposed a multi-robot coordination approach

    to minimize the performance degradation for three categories of robot failure:

    communication failure, partial robot malfunctions and robot death. These

    research studies focus mainly on the reallocation of tasks and resources after

    robot failure occurred. Although they proposed different way on failure

    handling, they are not directly related to robot failure recovery strategies.

  • 8/19/2019 FullText Ais

    27/108

    2-9

    2.1.2.2. 

    Replaceable Modules

    A more direct approach for robot failure recovery is building the robot system

    a self-repairing mechanical system (Murata et al. 2001), which can repair itself

    without any human intervention. As illustrated in Figure 5,  this involving

    spare units of the robot being transformed and replacing the damaged units.

    This approach can be realised using modular reconfigurable robots such as

    CONRO (Castano et al. 2000), Polybot (Yim et al. 2000), and lattice type

    reconfigurable robots (McGray and Rus 1998).

    Figure 5 Self-repair process of modular reconfigurable robots (Murata et al. 2001)

    Figure 6 showing a group of FRACTUM reconfigurable robots by Yoshida et

    al. (2003) doing self-repairing, cutting off the failed units and use spare units

    to replace and fill up the original position of the failed one.

    Figure 6 Photography of FRACTUM robot doing Self-repairing (Yoshida et al. 2003)

  • 8/19/2019 FullText Ais

    28/108

    2-10

    Apart from self-repairing, another direct approach to robot failure recovery

    would be cooperative repairing. Bererton and Khosla (2001, 2002) visioning

    the building of a self-sufficient robot colonies on distant plants using robots

    with repair capabilities showing in Figure 7. They proposed a repairable robot

    team, which promote module replacement when failures occur, such as

    scavenging usable modules from failed robots and use them to repair future

    failures.

    Although human operators were still required in their experiment, their studies

     placed one step toward cooperative repairing.

    Figure 7 Vision of self-sufficient robot colonies (Bererton and Khosla 2001)

  • 8/19/2019 FullText Ais

    29/108

    2-11

    2.1.2.3. 

    Mobile Recharging Mechanics

    For the recharging problem, instead of having a single robot returning to a

    charging station and recharge itself, multi-robot system can take other

    approach to tackle the problem of recharging. Zebrowski and Vaughan (2005)

    and Couture-Beil and Vaughan (2009) suggested using a specialised “tanker”

    robot in the field to help recharge other robots in the team. This approach

    allow the robot to stay more time on working, reducing the time spent on

    travelling back and forth from base.

    Another approach is to build the multi-robot system as a marsupial robot team

    (Murphy et al. 1999), where smaller “baby” robots are deployed from a big

    “mother” robot. Drawing on the marsupial concept, Drenner et al. (2013) even

    designed a mobile recharging marsupial robot team for exploring mission.

    Figure 8 showing the photography of the marsupial robots with docking

    mechanics which can recharge multiple robots at the same time. When the

    mother robot moving around in the workplace, the baby robots can dock and

    recharge from it.

    Figure 8 The “baby” robots (top), “mother” robot with recharge and docking station

    mounted on it body (bottom) (Drenner et al. 2013)

  • 8/19/2019 FullText Ais

    30/108

    2-12

    2.1.2.4.  Inter-robot Docking

    The ability for robot to grasp and dock with other robots can also be useful in

    many situations. While developing new reconfigurable robot, several studies

    has introduced docking and manipulating capabilities to mobile multi-robot

    system (Zong et al. 2006, Li et al. 2008, Wang et al. 2010). They designed a

    docking manipulator and grasping modes in JL-2, which allow mobile robots

    to grasp and dock with each other. Figure 9 shows the design of gripper and

    dock in JL-2.

    Figure 9 JL-2 docking manipulator and grasping modules (Wang, et al. 2010)

  • 8/19/2019 FullText Ais

    31/108

    2-13

    In Figure 10 a robot in the middle is being lifted up by robots from two sides,

    where G, NG, θ and M represents the weight of the robot, supporting force

    from ground, angle and torques of motors respectively. This mechanism can

     be useful in retrieve damaged robots or rescuing robots which being flipped

    over or trapped on the ground. 

    Figure 10 JL-2 lifting up the middle robot (Wang, et al. 2010)

  • 8/19/2019 FullText Ais

    32/108

    2-14

    2.1.2.5. 

    Communications Recovery

    In multi-robot systems, if robots are not able to communicate with each other,

    it would be hard for them to coordinate their operations. Therefore, there are

    research stuides focusing on the alternative means of communication, such as

    maintaining line of sight networks in (Arkin and Diaz 2002, Sweeney et al.

    2002, Anderson et al. 2003, Nguyen et al. 2003). This kind of studies tries to

     provide the robot team an alternative communication channel in case the

    global primary communication channel cannot be used.

    Figure 11 Robot team with robot 1 serving as a relay node and maintaining line of sight

    with robot 0 and  2 during a mission where global communication in not available

    (Sweeney et al. 2002)

    However, if there is not backup communication method can be used, Ulam &

    Arkin (2004) proposed a set of communication recovery behaviours, such as

    moving from ground surface to higher location and moving to nearest

    neighbour, for individual robots in the system to reactive when

    communication goes wrong. Table 2 shows the four basic recovery elements

    used to form combination of recovery behaviour.

    Table 2 Communication Recovery Behaviour in (Ulam and Arkin 2004)

    Communication Recovery Behaviour Rationale

    Retro-traverse Moving back to previous location for regaining

    communication

    Move-To-Nearest-Neighbor Maintaining line of sight

    Move-To-Higher-Ground Higher surface trends to have better reception

    Probe Moving away from area with reception blocked

  • 8/19/2019 FullText Ais

    33/108

    2-15

    2.2. 

     Artificial Immunity

    Human immune system (HIS) is a complex, self-protected and self-maintained

    cooperative defense system in human body (Dasgupta 1999, Ishida 2004) that ensures

    the survival of mankind for thousands years by protecting human body from the

    invasion of foreign substances such as bacteria and virus. Behind this well-composed

    system inside our body, there are some sophisticated cooperative mechanisms that

    cooperate the trillions of immune cells to response to unpredictable invasion and

    attack of pathogens rapidly. By extracting the concepts behind HIS, we can adopt

    them as metaphors and engineering paradigms, to solve different problems in real life,

    such as negative selection (Forrest et al. 1994),, clonal selection theory (Burnet 1959)

    and immune network theory (Farmer et al. 1986).

    2.2.1. Human Immune System

    HIS protects our body against pathogens by distinguishing, eliminating and

    memorizing foreign cell and macromolecules, it involves two-layer line of defense,

    the innate defense and the adaptive defense (Sompayrac 1999).

    Innate immune defense serves as the first barrier which tries to stop pathogens

    entering our body and minimizes the chance of being infected; while the adaptive

    immune defense activates if pathogens evade from innate response and produce

    antibodies to fight and recover from infection (Purves et al. 2001). There many type

    immune cells in human body. Figure 12 shows key categories of cells in the two

    defense layer.

  • 8/19/2019 FullText Ais

    34/108

    2-16

    Figure 12 Different Immune Cells in Human Immune System

    Innate

    System

    Adaptive

    System

    Phagocytes

    Macrophages

    Monocytes

    Dendritic Cells

     Neutrophils

    Eosinophil

    Basophils

    Lymphocyte

    T-Cell

    B-Cell

    Memory T

    Killer T

    Helper T

    Memory B

    Plasma Cell

  • 8/19/2019 FullText Ais

    35/108

    2-17

    Innate defense provides a general, non-specific resistance to a variety of invasions,

    throughout their first exposure to human body by eliminating macromolecules which

    identified as foreign pathogen (non-self). The macrophages are cells which eliminate

    and eat pathogen.

    Adaptive defense provides specific response which recognizes any pathogens which

    cannot be removed by the innate defense. The plasma cell would create antibodies

    which target specific pathogens. On the surface of antibodies, there are some special

     binding areas known as paratopes and idiotypes. Paratopes are structures that allow

    the antibodies to identify the non-self and react correspondingly; idiotypes are

    structures that allow co-stimulation between antibodies to regulate immune response.

    Antibodies can circulate through the blood and lymph systems in our body to respond

    with antigens in a distributed cooperative manner (Male et al. 2006). After eliminating

    the pathogens, some T cells and B Cells will become memory cells for rapidly

    reaction in future reencounter of same pathogens. The rapidly response is known as

    the secondary response.

  • 8/19/2019 FullText Ais

    36/108

    2-18

    2.2.2. 

     Artificial Immune System

    The intelligent systems using this immune-based metaphor are known as Artificial

    Immune Systems (AIS).

    2.2.2.1. 

    Immune Inspired Algorithms

    The following sections provide some background on three immune inspired

    algorithms, namely Negative Selection theory, Clonal selection theory and Immune

    network theory.

    2.2.2.1.1.  Negative Selection theory

    To avoid the immune cell attacking other cells inside human body, the lymphocytes

    need to go through a clonal deletion process before they become mature and leave the

    thymus. In the clonal deletion, antigen presenting cell would present self-antigens to

    lymphocytes, if the lymphocytes react to self-antigens, they would be deleted from the

     body, leaving only lymphocytes which would not attack own human body. This

    mechanism is called negative selection. Negative selection algorithms are inspired by

    this mechanism.

  • 8/19/2019 FullText Ais

    37/108

    2-19

    The negative selection algorithm was first proposed by Forrest et al. (1994). It is used

    to detect data modifications caused by computer a virus in and malicious software.

    This algorithm use a set of self-strings, S to express the normal state of a system. then

    generate a set of detectors, D , which only react to the complement of S . By applying

    these detectors to new, this algorithm can classify whether the data has been

    manipulated. Below are the outlines of negative selection algorithm (Aitkin et al.

    2013).

    input : S = set of seen known self-elements

    output : D = set of generated detectors

    begin

    repeat

    Randomly generate potential detectors and place them in a set P

    Determine affinity of each member of P with each member of self-set S

    If any element in S recognises a detector in P with recognition threshold,

    then the detector is rejected, otherwise it is added to D

    until Stopping criteria has been met

    end

    2.2.2.1.2. 

    Clonal selection theory

    Another popular immune inspiration mechanism is clonal selection theory. This

    theory draws on the clonal expansion and affinity maturation of B-cell.

  • 8/19/2019 FullText Ais

    38/108

    2-20

    While ensuring that only lymphocytes specific to the antigen on pathogen are cloned

    and produced in large numbers, there are somatic hypermutations occur during clonal

    expansion to introduce random changes to the specificity of lymphocytes, increases

    the repertoire diversity. The proliferation level of the lymphocytes is directly

     proportional to its affinity with the antigen it binds, and the mutation level of the

     proportional is inversely proportional to this affinity.

    The clonal selection inspired algorithms are often applied together with the concept

    memory cells, making it useful in computational optimization and pattern recognition.

    One popular clonal selection inspired algorithm is CLONALG (Cutello et al. 2005).

    Below are the outlines of CLONALG algorithm (Aitkin et al. 2013).

    input : S = set of patterns to be recognised,n = the number of worst elements to select for removal

    output : M = set of memory detectors capable of classifying unseen patterns

    begin

    Create an initial random set of antibodies, A

    forall patterns in S do

    Determine the affinity with each antibody in A

    Generate clones of a subset of A with the highest affinity.

    The number of clones is proportional to its affinity

    Mutate attributes of these clones to the set A , and place a copy of

    the highest affinity antibodies in A into the memory set, M

    Replace the n lowest affinity antibodies in A with new randomly

    generated antibodies

    end

    end

  • 8/19/2019 FullText Ais

    39/108

    2-21

    2.2.2.1.3. 

    Immune network theory

    The immune network theory is proposed by Jerne (1974), which also known an

    idiotypic network. This theory suggests that instead of being isolated, lymphocytes

    interact with each other and form self-regulated network even in the absence of

    antigens. Lymphocytes can recognize both antigens and other lymphocytes.

    Figure 13 Concept of idiotypic networks

    Stimulation

    Suppression 

    Antigen 

    Epitope 

    Paratope 1 

    Idiotope 1 

    Paratope 2 

    Idiotope 2 

    Idiotope 3 

    Paratope 3 

    B-Cell 2 

    B-Cell 1 

    B-Cell 3 

  • 8/19/2019 FullText Ais

    40/108

    2-22

    The concept of idiotypic network is shown in Figure 13,  where the epitope is the

    antigen element and the paratope is the recognition part of lymphocytes. Idiotope is a

    special part on the lymphocytes which presents the antigenic-like element of an

    lymphocyte and can be recognized by other lymphocytes.

    While epitope of antigen binds and matches with paratope of lymphocyte, the idiotope

    lymphocyte binds and matches with paratope of other lymphocytes. In the Figure 13, 

    the antigen with epitope matching with B-cell’s paratope stimulates B-Cell-1. B-Cell-

    1 stimulates B-Cell-2 as Paratope-2 binds to Idiotope-1. B-Cell-1 suppresses B-Cell-3

    as Paratope-1 binds to Idiotope-3. These suppressions and stimulations allow the

    immune system having interactions with itself even without the existence of antigen.

  • 8/19/2019 FullText Ais

    41/108

    2-23

    The immune network theory display emergent properties, such as self-tolerance,

    learning and memory. Below are the outlines of a pattern recognizing algorithm

    inspired by immune network theory (Aitkin et al. 2013).

    input : S = set of patterns to be recognised,

    nt = network affinity threshold,

    ct = clonal pool threshold,

    h = number of highest affinity clones,

    a = number of new antibodies to introduce

    output : N = set of memory detectors capable of classifying unseen patterns

    begin

    Create an initial random set of network antibodies, N

    repeat

    forall patterns in S do

    Determine the affinity with each antibody in N

    Generate clones of a subset of the antibodies in N with the highest

    affinity. The number of clones for an antibody is proportional

    to its affinity

    Mutate attributes of these clones to the set A , a and place h

    number of the highest affinity clones into clonal memory set, C

    Eliminate all elements of C whose affinity with the antigen is

    less than a predefined threshold ct

    Determine the affinity amongst all the antibodies in C

    and eliminate those antibodies whose affinity with each

    other is less than the threshold ct

    Incorporate the remaining clones of C into N

    end

    Determine the affinity between each pair of antibodies in N and

    eliminate all antibodies whose affinity is less than the threshold nt

    Introduce a random number of randomly generated antibodies

    and place into N

    end until a stopping criteria has been met

    end

  • 8/19/2019 FullText Ais

    42/108

    2-24

    2.2.2.2. 

    Immune-Based Single Robot Control

    Immune network theory is useful in robot behaviours control. Early as 90’s, Ishiguro

    et al. show the possibility of controlling autonomous robot’s behaviours using

    idiotypic network (Ishiguro, Kondo et al. 1995, Ishiguro, Shirai et al. 1996, Ishiguro,

    Watanabe et al. 1996). The actions of robots are formulated and decided by the

    concentration of antibodies. The immune network gives the system capacity to learn

    and let the robot perform better decision after evolution. Figure 14 shows

     photography of the practical robot (Ishiguro, Watanabe et al. 1996). The antibodies

    help the robot to decide the energy level to trigger recharge and avoid over charge.

    Figure 14 Immunoid robot in experiment using immune network based decision making(Ishiguro, Watanabe et al. 1996). 

    Similar methods applied by Vargas et al. (2003a and 2003b). These studies also show

    the idiotypic network can be applied to autonomous robot navigation. The immunenetwork even can help robot to escape from dead-end trap (Luh and Liu 2008) or path

     planning (Duan et al. 2004).

  • 8/19/2019 FullText Ais

    43/108

    2-25

    2.2.2.3. 

    Immune-Based Multi-Agent System Control

    The theoretical framework of AIS has been broadly studied in the field of Artificial

    Intelligence (AI) (Dasgupta 1999, De Castro and Timmis 2002, Ji and Dasgupta 2003,

    Ishida 2004, Timmis 2007, Timmis et al. 2008). Several studies (Dasgupta 1998, Lau

    and Wong 2003, Ko et al. 2004, Lau and Wong 2004, Lau and Wong 2006, Lu and

    Lau 2009) concerning distributed multi-agent system control have been carried out

     previously and shown the AIS-based control framework having desirable performance

    and flexibility for multi-agent coordination. Although the focus of those studies were

    not on failure prevention of MAS, they did provide inspiration and interesting ideas

    for developing AIS based MAS control.

    Being one of the multi-agent systems, multi- robot systems take advantages of on the

    immune-based control. The follow sections are some related immune-based control

    frameworks and immune inspired robot related applications.

    In 1998, Dasgupta (1998) proposed a general framework for multi-agent decision

    support, which is the one of the first frameworks for building MAS with

    immunological design. In his design, three immunological properties: 1) mobility, 2)

    adaptively and 3) collaboration were introduced to the system, and agents were

    classified into different behavioural groups with respect to their functions (Beer et al.

    1990).

  • 8/19/2019 FullText Ais

    44/108

    2-26

    Agent Environment

    Agent Communication Scheme

    Agent Control Mechanism

    Response / Decision Strategies

    Figure 15 Functional Modules Architecture by Dasgupta (1998) 

    Figure 15 shows the functional modules architecture of the general framework

     proposed by Dasgupta, where response and decision strategies were made on top ofthe agent environment by a set of agent communication scheme and agent control

    mechanism. This framework helps to achieve better interaction and communication

    through co-simulations between role-based agents.

  • 8/19/2019 FullText Ais

    45/108

    2-27

    In 2004, Ko et al. (2004) presented a general suppression control framework (GSCF)

    to control a decentralized mechatronic manipulator arm of modular. The framework is

    mainly inspired from the immune suppression hypothesis in the immune

    discrimination theory (Sompayrac 1999).

    Figure 16 The General Suppression Control Framework by Ko et al. (2004)

    Figure 16 shows the design of general suppression control framework where emergent

    group behaviours are generated by suppression mechanism between aggressive and

    tolerant cells according to the local environment changes. This framework helps

    effectively reduce the counter-productive behaviour committed by individual modular

     parts of the robot arm and also helps in self-balancing robot control (Ko et al. 2005).

  • 8/19/2019 FullText Ais

    46/108

    2-28

    Lau & Wong (2003, 2004, and 2006) developed a distributed multi-agent control

    framework to effectively control a group of agents with different capabilities based on

    a number of immune system characteristics such as immune memory (Smith et al.

    1999), specificity (Hofmeyr and Forrest 1999) and diversity (Fukuda et al. 1999).

    Figure 17 Architecture of control framework for individual AIS agents by Lau (2006)

    Figure 17 shows the architecture of the distributed multi-agent control framework

    where individual AIS agents are able to adapt their pre-defined functionalities and

    generate new functionalities based on their own long terms and short terms memory.

    This framework has shown significant improvement on the efficiency of MAS when

    applying in automated material handling (Lau and Wong 2003) and intelligent

    transport system (Lau and Wong 2004).

  • 8/19/2019 FullText Ais

    47/108

    2-29

    Lu & Lau (2009) created a real-time cooperative control framework for networked

    multi-agent systems, namely AIS-based Cooperative Reaction Framework (ACRF).

    This framework was mainly inspired from the lymphocyte repertoires’ meta-dynamics

    (Mehr 2005) and the immune diversity (Tonegawa 1983). A preliminary simulation

    studies has been done demonstrating the effectiveness of ACRF for controlling agent

    team in RobotFlag Drill problem.

    There are also other immunity based multi-robot research studies focusing on specific

    applications, such as, Lee & Sim (1997) use clonal selection and immune network to

    control a Distributed Autonomous Robotic System (DARS) in solving search problem;

    Meshref & Landingham (2000) use immune network to solve a Dog and Sheep

     problem; and the hybrid algorithm Thayer and Signh (2002) used for robot

    exploration and mapping domains. These all shows immune network useful in multi-

    robot system.

    Though those research studies mentioned above are not directly related to robot

    failure, except Khan & Silva (2008) start suggest an immune network based self-

    regulated fault tolerant multi-robot system with some consideration on partial and full

    failure robot. Yet, in their approach, there are only job reallocations when robot

    failures occur, without having consideration on robot recovery.

  • 8/19/2019 FullText Ais

    48/108

    2-30

    2.3. 

    Summary

    In this chapter, a review on the development of robot recovery and a summary of

    related recovery mechanism is presented in Section 2.1. These studies help to move

    towards automatic recovery in multi-robot system. Although some studies are still in

    rudimentary stage, with increasing numbers of research studies focusing on this area,

    more mechanisms will be available and the recovery technology will soon be mature.

    However, those research studies yet look into the effectiveness of automatic

    cooperative failure recovery, such as the worthiness of having one of the robots

    actually helps in the action of recovering other failed robots. It would be worthwhile

    to create a generic automatic cooperative failure recovery model which can coordinate

    and provide effective and dynamic control for those mechanisms.

    The reviews on artificial immunity in Section 2.2 demonstrated the capability,

     principles and properties of AIS on current applications, which can be applied to the

     proposed new recovery model.

  • 8/19/2019 FullText Ais

    49/108

    3-1

    Chapter 3

    Cooperative Recovery Model

    One of the most important concepts in this model is the introduction of cooperative

    failure recovery, the concept of allowing a team of robots to recover and survive in

    situation which individual cannot. By combining the capability of multi-robot, this

    model allowing the system to continue functioning effectively under dangerous and

     poor environment. The implementation gives the system ability to recover from

     problem, which beyond the capability of single robot can do. It is thought-provoking

    to study the influence of cooperative recovery on a multi-robot system and how can it

    affect the system in terms of system productivity, survivability and overhead. This

    chapter is going to look into the concept of cooperative failure recovery, its

    implication as well as the proposed model which helps to control the recovery action

    while adapt the system situation and optimise the overall performance.

    The cooperative recovery model proposed in this chapter also takes account of robot

    self-recovery. The model can be applies to the most of common physical failures of

    robot system. Yet, software failures are not its primary concern.

    3.1.  Cooperative Failure Recovery

    The cooperative failure recovery itself is a generic concept which refers to any

    operations that the robots may performs on the spot to assist malfunctioned robots

    recovering from non-functional status to a better functional status. The cooperative

    failure recovery operations diverse among systems and the degree of recovery can be

    varied.

  • 8/19/2019 FullText Ais

    50/108

    3-2

    For example, if one robot in a landmine removal system accidently has its wheel

    rolled on a mud trap or being locked up for some reasons becoming immobilized; the

    operations which other robots may performs freeing it from the trap, such as pulling

    or pushing, resulting the immobilized robot can able to continue its reconnaissance

    mission, would be considered as cooperative failure recovery operations. If the

     problem is more complicated and the robot still cannot move after leaving the trap,

    having other robots withdrawing it to the safe area in repairing station can also be

    considered as a cooperative failure recovery operation.

    The goal of the concept is to give the system cooperative recovery abilities which can

    help the system keep functioning for a longer period and are considered much useful

    for system deployed in the highly hazardous environments, such as highly radioactive

    area, where human intervention on the spot are not favourable.

    3.2.  Recoverable Mechanics

    There are many ways to implement a cooperative recovery solution, and the

    cooperative recovery concept can be applicable in numerous situations. Yet, the detail

    implementation of recovery mechanics for each particular robotic failure can be very

    specific. As the focus on this study is on the general impact of cooperative recovery to

    the multi-robot system, this thesis would not cover the mechanical details of each

    recovery method. A collection of possible robot recovery mechanics has already

    covered in Chapter 2. Those mechanics can be summarised into three implementationapproaches when developing this model. The recovery operations in this research are

    assumed to be implemented on robot system which built with these three design

    approaches: 1) parts replacing, 2), strength combining, and 3) resources sharing

    In this basic model, only the first two approaches are considered. The resources

    sharing approach will be introduce in Chapter 4.

  • 8/19/2019 FullText Ais

    51/108

    3-3

    3.2.1. 

    Parts Replacing

    This approach allows some failed components and parts on a robot being

    replaced or reconfigured by its own or other robots. Spared parts installed on

    the robot; or a robot can obtain and replace some components from others.

    Examples of “Parts Replacing” design are mining robots with spare drill for

    changes; surveillance robots with spare tire and wheels. Those modular

    reconfigurable robots and other models in section 2.1.2.2 with abilities to swap

    robot module are also belong to this type.

    With this “Parts Replacing” design, robots can carry out cooperative failure

    recovery operations by replacing the malfunctioning components on failed

    robots.

    3.2.2. Strength Combining

    The “Strength Combining” approach allowing robot to be lifted up by another

    robot or being towed to repair station. Multiple robots may combine strength

    to remove obstacles in order to free those robots being trapped. The inter-robot

    docking in section 2.1.2.4 can be useful for this approach.

    This design enables the cooperative failure recovery operations by having

    assistance from other robots. Problems apart from location trapping are all

    assumed to be fixed automatically or with manual in the repair station where is

    the location human interaction is allowed in the environment.

    Given huge effort to develop and invest on the implementations of these mechanism,

    one multi-robots system with all those three recoverable mechanics would be consider

    as an advanced tolerant multi-robot system with very high survivability in different

    situations.

  • 8/19/2019 FullText Ais

    52/108

    3-4

    3.3. 

    Different roles of robots

    For the purpose of this thesis, the robots are labelled as “red” robot, “white” robot and

    “black” robot.  These are the three roles of robots in the recovery model, their roles

    change according to their status in the system.

    Table 3. Colour of role in particular roles for illustration purpose

    Colour Role and Status

    Red Productive, carrying out system primary objective

    White Non-productive, Helps in robot recoveryBlack Non-productive, malfunctioning, failed, need rescue

    Table 3 listed how the colour is used to label the corresponding role of robot, which

    will help explaining the proposed model and robot interaction in the following

    chapters. “Red” robots refer to the robots working on the primary system function;

    “Black” robots refer to the robots malfunctioning; while “White” robots refer to therobots which carrying out recovery operations.

    To maximize the system suitability, in this basic model, only red robots can become

    white robot and the cooperative recovery can only be carried out by white robot.

    Figure 18 illustrates the change of robot from different state.

  • 8/19/2019 FullText Ais

    53/108

    3-5

    Figure 18 The change in robot state and role

    White

    Robot

    Red Robot

    Black

    Robot

    Helps in robot

    recovery

    Malfunctioning,

    Failed,

    need rescue

    Recovered /repaired

  • 8/19/2019 FullText Ais

    54/108

    3-6

    3.4. 

    The Cooperative Recovery Trade-off Problem

    Although cooperative failure recovery can be favourable to many multi-robot systems

    in principle, how to make the operations effective is one big issue. One of the

    intentions in this research is to find out whether it is worthwhile to develop those

    mechanics and whether a system with cooperative recovery can be cost-effective after

    all.

    The process of cooperative failure recovery itself does not favourable to the primary

    system objective. Whenever a robot turns into a “white” robot, it  has to stop

     processing primary system tasks. This leads to a temperate drop in system functioning

     performance until it turns to “red”  again. One typical situation is having the entire

    group of functioning robot turns into “white” robots, leaving no “red” robots

     performing primary functions. Figure 19 shows two troughs in the primary objective

    score of a system due to cooperative recovery operations.

    The system performance would greatly suffer if the priority of the recovery action is

    set too high. However, the priority of recovery action cannot be too low neither,

    otherwise, there will chance where all robots become “black” and having entire

    system failure.

    Therefore, there must be a means to control the entire cooperative recovery operations,

    such that the system recovery mechanics can be worthwhile.

  • 8/19/2019 FullText Ais

    55/108

    3-7

    Figure 19 The performance of system drops when recovery operations in place

    Here are the problems the system should be addressing from time to time:

    1.  Which cooperative recovery operation should be carried out?

    2.  When the operations should be done? What is the priority?

    3.  Which robots should be the “white” robot? 

    4.  Would the operations put the system survivability at risk?

    5.  How would the primary system function being affected?

    6.  Did the operations really resolve the problem?

    A new recovery model inspired by human immunity system is hence proposed to

    address this captioned issue.

    Time

    2 black robot

    + 3 white robot invloed in

    operation

    1 black robot

    + 1 white robot

    invloed in operation

       P   r   i   m   a   r   y   O    b   j   e   c   t   i   v   e   S

       c   o   r   e

  • 8/19/2019 FullText Ais

    56/108

    3-8

    3.5. 

    Immunity Inspired Recovery Model

    In this proposed model, all actions ( ) can be performed by each robot are consideredas two types: one is the  Normal Actions  (), which is the primary action that therobots must carry out in order to fulfil its original purpose; the other is the

    Sustainment Actions (s), which the robot can performs to maximise overall system life,

    including all recovery strategy. Every robot in the system is assumed to have the

    abilities to perform one of these actions at a time.

       

    The requirements and designs for each cooperative failure recovery strategy can be

    very different, it would, however, not affect the application of this generic model.

    Sustainment   Actions () are defined as all the behaviours that are used to preserve andimprove the system sustainability, such that the system can operate, sustain and keep

    on productive for a longer period.  Failure Recovery Actions  ( ) and the Mandatory Maintenance Operations  ( ) are both belongs toSustainment Actions. A strategic sustainment action series is defined as sustainment

    response () and strategic normal action series is defined as normal responses ().

     

     

  • 8/19/2019 FullText Ais

    57/108

    3-9

    3.5.1. 

     A Two-level Response Architecture

    Inspired by two-line defence mechanism of human immune systems (Sompayrac 1999,

    Purves et al. 2001, Male et al. 2006), there are two levels of sustainment response:

    Self-Sustainment Response () and Cooperative Sustainment Response (). Thedesign of the two-level sustainment responses shares the same characteristics with the

    innate and adaptive response in human immune systems. These two responses form

    an essential protection for multi-robot system to recover from different degrees of

    failure and help to preserve a balance between system performance and system

    sustainability.

    In the two levels sustainment model, the Self-Sustainment Response () refers to allgeneral sustainment actions which individual robot can performs to make precaution

    and prevent itself failing, such as essential maintenance and replacing of replaceable

    components; while Cooperative Sustainment Response () refers to all sustainmentactions which individual robot can performs to recovery failure robots and prevent

    entire system failure.

    The primary reason of having two levels architecture of responses is to avoid common

    robot failures if possible and minimize the chance of triggering cooperative response.

    Although the cooperative sustainment response is useful to maintain system

    sustainability, it is however not very favourable to the system performance due to its

    implantations are rather specific and usually more time consuming. This architecturehelps to control the trade-offs.

  • 8/19/2019 FullText Ais

    58/108

    3-10

    Innate Immune

    Response

    Adaptive Immune

    Response

    Self-Sustaining

    Response (R ss)

    Cooperative Sustaining

    Response (R sc)

     Human Immune System :

     Proposed Recovery Model :

    1° line of defence 2

    ° line of defence

    If pathogens

    evade

    If failures

    occur

    Figure 20 The two-level defence mechanism in human immune system and proposed model

    Figure 20 shows how the two-level sustainment response model was inspired from

    human immune system. In human immune system, innate immune response functions

    as the first general barrier which attempts to stop pathogens from entering the body,

    helps minimizing the chance of infections; while adaptive immune response triggers if

     pathogens escape from innate response, it produces specify antibodies to stop

    infections (Purves et al. 2001). In this new model, the two-level sustainment

    responses work in a similar manner. The Self-sustainment response () works as thefirst barrier to reduce the chance for common robot failure while cooperative

    sustainment response () activates when there robot failures occurred and helps thesystem to recover.

  • 8/19/2019 FullText Ais

    59/108

    3-11

    Table 4. Relationships between the proposed recovery model and the immune system

    Immune System Proposed Recovery Framework

    Self Red robots, White robots

     Non-self Tasks, Black robot

    Anti-bodyPair of Conditions with Strategic Behaviours

    (Response)

    Antigen Set of Conditions (Conception)

    Antigen Presenting Cell (APC) Sustainability Evaluator

    Innate Response Self-Sustainment Behaviour

    Adaptive Response Cooperative Sustainment Behaviour

    Co-Stimulus Relations between Strategic Behaviours

    Table 4 shows the mappings between immune system and the proposed recovery

    model. The “Red” Robot and “White” Robot are regarding as self, which is good and

    no harm to the system; while tasks and “Black” robot are r egarding as non-self, which

    should be clear and reduced whenever possible. System conditions are the antigen; the

    strategic behaviours the anti-body which trying to match with the system condition.

    Sustainability evaluator serves as the antigen presenting cell, which evaluate the latest

    system conditions. The relations between behaviours are control through the

    suppression and stimulation mechanism inspired by immune network theory (Jerne

    1974).

  • 8/19/2019 FullText Ais

    60/108

    3-12

    3.5.2. 

    The Control System

    Design of the control system of individual robot is shown in Figure 21. The functions

    of each component are described as follows:

    1.  Sustainability Evaluator 

    During system operation, the condition of the system situation is perceived as

     perceptions through different detectors and sensors. The functions of the

    Sustainability Evaluator (APC) is to evaluate the current system situation base

    on the perceptions, creates the conception (antigen) of the current system

    condition and present it to the Response Reactor.

    2.  Response Reactor

    The Response Reactor processes the condition and tries matching the

    condition with potential behaviours (antibodies). Its functions are to calculate

    the favourability of the potential behaviour, mutating their attributes and find

    out the most suitable candidate. Suppression and stimulation among

     behaviours also take place inside the Response Reactor.

    3.  Response Evaluator

    In general, the behaviours with the highest favourability would be carried out

     by the robot. The response evaluator is use to justify if the behaviours

    favourable as expected, which give feedback and adjust the potential

     behaviours in the Response Reactor for next decision.

    4.  System Environment

    The system environment is where the interactions between robots take place.

  • 8/19/2019 FullText Ais

    61/108

    3-13

    Figure 21 Flow of operation from perception to behaviours

    Response

    Reactor

    Sustainability

    Evaluator

    Response

    Evaluator

    System Environment

    Perceptions

    (Detected information)

    Conceptions

    (System condition)

    Selected Behaviors

    Result of Behaviors 

    Feedback

  • 8/19/2019 FullText Ais

    62/108

    3-14

    3.5.3. 

    Design of Antibodies

    There are two types of anti-body, the Normal Response Antibody,  N-Cell,  and

    Sustainment Response Antibody, S-Cell. The  N-Cells  are corresponding to normal

    actions, responsible for the primary system objectives; while the S-Cells  are

    corresponding sustainment actions, responsible for the system sustainability. The

    structures on these antibodies are shown in Figure 22. 

    Conceptions

    Paratope

     Normal Response Co-stimulus

    Idiotype

     N

    Cell

    - Detection of Obstacles

    - Detection of Tasks

    - Concentration of Agents- Distance from stations- Signal Strength

    - Type of Signal- etc ….. 

    - Approaching Tasks

    - Avoid Obstacles

    - Continue Explore- Completing Tasks

    - Response Attributes

    Conceptions

    Paratope

    Sustainment Response Co-stimulus

    Idiotype

    S

    Cell

    - Existence of Failed Agent

    - Concentration of Agents- Agent Health Level- Location of Failed Agent

    - Distance from Base- Sustainment Index

    - Self Sustain Level- Cooperative Sustain Level

    - Self Sustain Action

    Sets- Cooperate Sustain

    Action Sets

    - Response Attributes 

    - Suppressed

     N Cell Actions- Suppressed

    S Cell Actions- Stimulated

     N Cell Actions

    -Suppressed

     N Cell Actions- Stimulated

     N Cell Actions

     Normal

    Response

    Antibod

    Sustainment

    ResponseAntibody

    Function

    - Productive

    Function

    Function

    - Cost Function

    - Recovery

    Function

    Figure 22 Design of N-Cell and S-Cell

    Each antibody corresponds to a particular action or response. The set of conditions

    triggering the antibody are modelled as paratopes, which are the required system

    conditions to carry the corresponding action or response. The size of paratopes and

    the conditions are vary depending on the system objectives and the recovery

    mechanics applied. If the response is to replace the damaged drill by obtaining a spare

    drill from nearest robot, having one robot with spare drill nearby would be one of the

    condition in the paratope. In order to trigger an antibody, all the conditions of

     paratope must be matched.

  • 8/19/2019 FullText Ais

    63/108

    3-15

    The middle parts of the antibody are the action response action series, which is the

    actually action the robot have to involve in the response and the related attitudes, such

    as direction, amount and type of components to be involved. The body of the antibody

    also contains the favourability functions for that response, which describe the way of

    how the favourability of that antibody being formulated.

    Some actions or response may have special relationship to others. For example, if a

    robot is low in health and the nearest repair station is at the east, moving to carry out

    tasks at east would be more favourable than moving to west for other tasks. Similarly,

    if a robot having communication problem with its radio receiver, moving toward other

    robot and keeping line of sight distance would be more favourable than moving to

    unexplored area. These relations are modelled as the idiotype for stimulating or

    suppressing related antibodies.

    To conform to the two levels defence mechanism, the S-Cell   carrying Self-

    Sustainment Response would suppress those carrying Cooperative Sustainment

    Response; and S-Cell would suppress N-Cell in general.

  • 8/19/2019 FullText Ais

    64/108

    3-16

    3.5.4. 

    Classification of Recovery Operations

    Based on the complexity and the effectiveness of each recovery operation, each

    recovery operation can be classified by four criteria.

    1)  Would the operation put the “white” robot at risk? 

    2)  Would the operation involve more than one “white” robot? 

    3)  Would the operation fully resolve the malfunction for a reasonable period?

    4)  Would the operation put the “white” robot at risk?

    If the operation involves only itself, it is denoted as “Self recovery”; if it involves

    only one “white” robot, it is denoted as “Simple” recovery; on the other hand,

    “Complex” recovery refers to recovery operations which involve multiple “white”

    robots. If an operation would be able to completely resolve the problem for a

    reasonable long period, it would be a “Full” recovery, else, it denoted as “Partial”

    recovery. If the operation would not introduce potential problem to the “white” robot,

    the operation would denoted as an “Assist” recovery operation, otherwise, which

    would be denoted as “Sacrifice” recovery. “Sacrifice” recovery operation would not

     be consider in this basic model. This will be discussed and introduce in chapter 4

    inside the Sacrificing Recovery Model.

  • 8/19/2019 FullText Ais

    65/108

    3-17

    3.5.5. 

    Behavioural Decision Making Algorithm

    Inside the Response Reactor, a decision making algorithm shown as below is applied.

    Input : T = System condition from Sustainability Evaluator

    P = processing power of individual robot

    Sr = sustainability ratio,

    h = number of highest favourability clones

    ft = favourable threshold

    Output : B = highest favourable antibody

    Begin

    Create a set of p randomly selected antibodies from Rs and Rn, P

    The numbers of antibodies from O and N are proportional to Sr

    forall behavioural decision making do

    Eliminate antibodies not matching with antigen T from P

    Generate clones of the antibodies in P, A

    Mutate attributes of these clones A

    Determine the favourability amongst all the antibodies

    Eliminate the antibodies with favourability lower than ft

    If antibodies remains in P

    execute highest favourable antibody, B

    increase favourable threshold ft value 

    otherwise

    reduce value of favourable threshold ft

    repeat regenerating randomly selected antibodies step 

    Get feedback from the Response Evaluator

    If returns less than ft,

    place a copy of the B into

    the unfavourable memory set, UM

    eliminate B from P

    otherwise place B into favourable memory set, FM

    place the h highest

    favourability antibodies into the P

    Introduce randomly selected antibodies from Rs and Rn

    and place into P following the ratio Sr

    end

    end 

  • 8/19/2019 FullText Ais

    66/108

    3-18

    The algorithm is built upon the emergent learning and memory properties of the

    immune network theory, while having a scalable consideration on the processing

     power limit of individual robots. As the complexity of system and recovery

    mechanics increase, the number of antibodies would increase in scale. Indeed of

    considering all possible antibodies in every behavioural decision, a sample population

    is used instead. The population size (p)  can be setup according to the actual

     processing power of individual robot. A sustainability ratio () is used to control theamount of normal response and sustainment response to be considered at a time.

    The ratio of N-cell and S-cell is defined as sustainability ratio () computed based onthe sustainability level of the robot. Equation (1) shows the formula of the ratio,

    where n is the number of N-cell being selected while s is the number of S-cell being

    selected.

      (1)

    The sustainability level () of a robot   is determined by the its estimated healthstatus of the robot and the ratio of functioning robots around at that particular moment,

    which is show as equation (2) below, where    .

     

        (2) 

     is the estimated health of the robot and the is the maximum possible healthof the robot . The total number of robot in the system is denoted by  and  refersto the number of functioning robots. When the sustainability level of a robot is high,

    the value approaches 1 and it drops as sustainability drops.

        (3)

  • 8/19/2019 FullText Ais

    67/108

    3-19

    In this model, the sustainability level ( ) is equal to sustainability ratio ( )multiplied by constant justifier   , depending on the importance of systemsustainability. The justifier define the minimum percentage of S-cell response would

     be taken into consideration every decision circle. The smaller it is the higher

     percentage of S-cell would be selected for decision making. If   is equals to three,there will be at least 25% of decision antibodies taken from S-cell; if   equals to one,the minimum percentage of S-cell for consideration would be 50%. In a goal-

    orientated system, such as rescue robots in disaster, where rescue mission itself is

    more important than the life of robot, the justifier would be bigger. In planetary

    exploration system, which system sustainability is crucial, the justifier would be

    smaller for maximizing the system survivability.

    The favourability () of an antibody x is determined by the favourability function ofthe antibodies. For N-cells, the favourability function is defined as equation (4), its

    value is affected by the expected productive value (   for completing thecorresponding normal response, number of favourable memory (   ) andunfavourable memory () of robot decision and the influence from other triggeredantibodies (). Equation (5) shows the influence value where the first term isthe stimulation from other antibodies and the second term is the suppression among

    antibodies.  denoted as the stimulation from antibody y to antibody x,  denotedas the suppression of antibody z to antibody x.

      (4) 

    ∑   ∑   (5)

  • 8/19/2019 FullText Ais

    68/108

    3-20

    The favourability function of S-cell is defined as equation (6), the favourability is

    determined by the expected gain in production value after the response ( ), theexpected loss of production during the recovery (), the current sustainability level(

    ) of the robot, as well as memory and co-stimulations factors.

      (6)

    The favourable memory ( ) and unfavourable memory () give merits to theantibodies which bring benefits to the system and penalties to those not performingwell as expected. Th