Evolving Solutions to Graph Theory Problems A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Luke Branson IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF MASTER OF SCIENCE Dr. Andrew Sutton May 2023 © Luke Branson 2023 Acknowledgements First I would like to acknowledge my advisor Dr. Andrew Sutton for his support and advice during my time as a Master’s student at the University of Minnesota-Duluth. Working with him has helped me discover a passion for research that I didn’t know I would have, it has also provided me with the opportunity to travel to Boston where I was able to meet other researchers and present our work at GECCO 2022. I would also like to thank everyone in both the department of Computer Science and the department of Mathematics and Statistics at the University of Minnesota- Duluth. The staff, faculty, and student body has given me support and provided me with a great learning environment. i Abstract Randomized search heuristics such as evolutionary algorithms are general-purpose techniques that have been shown to be effective at solving combinatorial optimiza- tion problems. In this thesis, we use both empirical and theoretical techniques to examine the use of a simple evolutionary algorithm, the (1+1) EA, applied to com- binatorial optimization problems from graph theory. First, we investigate the evo- lutionary algorithm on problems k-VertexCover, k-FeedbackVertexSet, and k-OddCycleTransversal, and present Fixed-Parameter Tractable performance guarantees for the (1+1) EA equipped with a tailored jump-and-repair operator. We then follow up with empirical results of the (1+1) EA applied to the k-VertexCover problem. Next we move on to investigate the (1+1) EA applied to graceful labeling. We present the first rigorous runtime analysis of an evolutionary algorithm finding graceful labelings of graphs, and prove polynomial time upper bounds for paths, stars, and complete bipartite graphs with a constant sized partition. To supplement the theoretical results, we also empirically compare the performance of the (1+1) EA to that of a complete constraint solver. ii Contents Contents iii List of Tables v List of Figures vi 1 Introduction 1 2 Background 3 2.1 Combinatorial Optimization . . . . . . . . . . . . . . . . . . . . . . . 3 2.2 Evolutionary Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . 4 2.2.1 Evolutionary Algorithms . . . . . . . . . . . . . . . . . . . . . 4 2.2.2 Operators . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2.3 Runtime Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.3.1 Fixed-parameter tractable EAs . . . . . . . . . . . . . . . . . 11 2.3.2 Iterative compression . . . . . . . . . . . . . . . . . . . . . . . 12 2.4 Problems From Graph Theory . . . . . . . . . . . . . . . . . . . . . 13 2.4.1 Vertex Cover . . . . . . . . . . . . . . . . . . . . . . . . . . . 13 2.4.2 Odd Cycle Transversal . . . . . . . . . . . . . . . . . . . . . . 14 2.4.3 Feedback Vertex Set . . . . . . . . . . . . . . . . . . . . . . . 15 iii 2.4.4 Graph Labeling . . . . . . . . . . . . . . . . . . . . . . . . . . 16 3 Results 20 3.1 Jump and Repair Operators . . . . . . . . . . . . . . . . . . . . . . . 21 3.1.1 k-Vertex Cover . . . . . . . . . . . . . . . . . . . . . . . . . . 27 3.1.2 Feedback Vertex Sets in Tournaments . . . . . . . . . . . . . . 31 3.1.3 Odd Cycle Transversals . . . . . . . . . . . . . . . . . . . . . 37 3.1.4 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 3.2 Evolving Graph Labelings . . . . . . . . . . . . . . . . . . . . . . . . 51 3.2.1 Star Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 3.2.2 Bicliques . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56 3.2.3 Paths . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 3.2.4 Experiments . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69 4 Conclusions 73 iv List of Tables v List of Figures 2.1 Examples of graceful graphs with their graceful labelings. . . . . . . . 17 3.5 Mean running time of (1+1) EAk on random planted vertex cover instances. Shaded bands represent standard deviation. . . . . . . . . 45 3.6 Mean running time of (1+1) EAk on vertex cover instances. Shaded bands represent standard deviation. . . . . . . . . . . . . . . . . . . . 46 3.7 Mean running time of (1+1) EAk divided by n2 lnn on random planted vertex cover instances. . . . . . . . . . . . . . . . . . . . . . . . . . . 47 3.11 Star graph K1,7 with a graceful labeling. . . . . . . . . . . . . . . . . 55 3.12 Path graph P10 with two graceful labelings (top and middle) and a non-graceful labeling (bottom) that contains local symmetries from each. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 3.13 Genotype-phenotype mapping from permutations of vertex labels into selection of unique edges. This is composed with a linear function on phenotype space. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 3.14 The top path is a baseline, the second path shows the operation R(x, i+ 1, j), the third path shows the operation R(x, i, j − 1), paths four through six demonstrate Case 3 subcase 1. . . . . . . . . . . . . . . 68 vi 3.15 Timing data for constraint solver, (1+1) EA and (1+1) EAℓ′ on path graphs Pn. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70 3.16 Timing data for constraint solver, (1+1) EA and (1+1) EAℓ′ on bi- cliques K2,n−2. For (1+1) EAℓ′ we also compare size 1 and size 2 partial labelings. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71 3.17 Timing data for EAs and constraint solver on cycle graphs Cn (left) and wheel graphs Wn (right). . . . . . . . . . . . . . . . . . . . . . . 72 3.18 Timing data for (1+1) EAℓ′ and constraint solver on bicliques with partition size 3 and 4. . . . . . . . . . . . . . . . . . . . . . . . . . . 72 vii 1 Introduction Evolutionary Algorithms (EAs) also referred to as Genetic Algorithms (GAs) are algorithms inspired by biological evolution, and use things like reproduction, muta- tion, recombination, and selection to search for solutions to optimization problems. They have been used as optimization tools in an industrial setting and studied aca- demically for decades. EAs are a popular method for solving optimization problems since they can optimize a function with little information about it. Many EAs are considered to be black-box models where they only have access to the problem through the use of their fitness functions. In this thesis we will examine the use of EAs for solving combinatorial optimization problems, specifically graph theory problems. First we present parameterized results, defined in 2.3.1, using repair operators, defined in 2.2.2, for finding a k-VertexCover and k-OddCycleTransversal in undirected graphs and finding a k-FeedbackVertexSet in tournaments in 3.1. All of these are all considered NP-Hard graph problems closed under induced sub- graphs. We prove that a simple EA, the (1+1) EA, can find a k-VertexCover in O(2kn2 log n) time, if one exists. This is an exponential (in k) improvement over the best-known parameterized bound for evolutionary algorithms on VertexCover given by Kratsch and Neumann [31]. For the k-FeedbackVertexSet problem in tournaments, we prove that the EA finds a feasible feedback set in O(2kk!n2 log n) iterations in expectation, and for k-OddCycleTransversal, we prove the EA re- quires at most O(3kkmn2 log n) time. To our knowledge, this is the first run time analysis of an evolutionary algorithm on both FeedbackVertexSet and OddCy- 1 cleTransversal. In 3.1.4 we present empirical results on the k-VertexCover problem. If we let G = (V,E) be a graph where V is the set of vertices and E is the set of edges, we call G a graceful graph if there exists a graceful labeling. A graceful labeling is a vertex labeling of G with an injective mapping ℓ : V → X where X is a finite set of labels. This vertex labeling induces a natural edge labeling where an edge e = uv is assigned the label |ℓ(u) − ℓ(v)|. Suppose G has n vertices and m edges. A vertex labeling is called a graceful labeling if X = {0, 1, . . . ,m} and ℓ induces an edge labeling bijective to the set {1, 2, . . . ,m} := [m]. This implies the induced edge labels under graceful labelings are unique. Graceful labeling was first introduced by A. Rosa in 1967 [49] under the name of β−valuation. Since then graceful labeling has become an important problem in pure mathematics [22, 8], and has several real-world applications such as in coding theory, determining ambiguities in X-ray crystallography, design of communication networks, optimizing circuit layout [7], the configuration of antennas in radio astronomy [6] and link-labeling algorithms in multi-protocol label switching (MPLS) networks [3]. The performance of the (1+1) EA when searching for a graceful labeling of a graph can change depending on the structure of the graph, the fitness function, and the mutation operator. We provide the first runtime analysis results of evolutionary algorithms on the graceful labeling problem, providing bounds for the (1+1) EA equipped with different mutation operators and fitness functions, on different classes of graphs in 3.2. We present the empirical results and discussion in 3.2.4. 2 2 Background 2.1 Combinatorial Optimization We call the area of mathematics that is primarily concerned with counting Com- binatorics. This is both used for obtaining results and defining properties of finite structures, being closely related to other branches of mathematics and applied in many other fields. In this paper when we talk about optimization, or optimization problems, what we mean is that we are looking for the best possible solution in the set of all feasible solutions. A feasible solution is a point in the set of solutions that satisfies all the constraints to the problem. When solving these types of problems there is usually some value that we are trying to minimize or maximize. Many of the problems of this type suffer from a combinatorial explosion: when a problem exhibits a rapid grown of complexity based on the problems inputs, con- straints, and bounds. This is relevant because as the problem grows in size, the solution space grows rapidly, and with it the difficulty of solving the problem. A combinatorial optimiztion problem is an optimization problem in which it is necessary to search a discrete but large solution space to try to find an optimal solu- tion. Usually these problems are inside a space so large that an exhaustive search is not possible. More formally, given a finite but intractably large set X and a function f , we want to find an element x ∈ X that maximizes (or minimizes) f(x). The Travel- ing Salesman Problem, Integer Linear Programming, and Minimum Vertex Cover are 3 some examples of well-studied combinatorial optimization problems. Many of these problems are important due to their real world applications, and are considered to be NP-Hard. Some traditional methods of solving combinatorial optimization problems are local search, constraint optimization, branch and bound. We call a problem a constraint satisfaction problem (CSP) when there exists a set of restrictions on what a solution can have, or a set of constraints or limitations that must be satisfied for the problem to be considered solved. There has been extensive research into different types of CSPs. These problems can modeled as optimization problems. GECODE is a complete constraint solver written in C++. It is open source and provides a state-of-the-art performance while being modular and extensible [23]. It can be used to find solutions for combinatorial optimization problems. In 3.2.4 we compare the perfomance GECODE to the performance of a evolutionary algorithms, defined in 2.2, for finding graph labelings 2.4.4. 2.2 Evolutionary Algorithms 2.2.1 Evolutionary Algorithms As the name suggests, Evolutionary Algorithms (EAs), sometimes called Genetic Algorithms (GAs), are randomized algorithms that use mechanisms inspired by bio- logical evolution. In biological evolution, results become noticeable after many small mutations occur over a long period of time, where only the fittest offspring are able to survive and reproduce. Evolutionary algorithms take advantage of mechanisms like reproduction, mutation, recombination, and selection. We call a possible solution to a problem a candidate solution. We use a fitness 4 function to measure how well the candidate solution performs. Similar to biologi- cal evolution, only the fittest are selected to reproduce. Each generation, candidate solutions go through a process that changes the offspring, this process is called re- combination or crossover if there are multiple candidate solutions being put together to create more, or called mutation if only one parent is required. The hope is that after generations of mutations we will find an optimal solution. For many problems it is not feasible to check all possible solutions to find the best solutions, and an approximate solution would be sufficient. In these types of problems, EAs usually perform well. Due to how they are designed, evolutionary algorithms can find many solutions to problems with multiple solutions or find approximate solutions to problems that cannot be easily solved using other techniques. When working with EAs and GAs, there are single objective optimization algo- rithms and multi-objective optimization algorithms. As the name suggests, single objective optimization algorithms have a fitness function defined to optimize a sin- gle objective. Multi-objective optimization problems are generally seen to be more difficult to solve than single objective optimization problems [42]. The (1+1) EA Algorithm 1 and (µ + λ)−EA Algorithm 2 are both examples of single objective algorithms, while Global SEMO Algorithm 3 [31] is an example of multi-objective EA. Algorithm 1: (1+1) EA-Generic 1 x← randomly generated Candidate Solution; 2 while stopping condition not met do 3 y ← x; 4 mutate(y); 5 if f(y) ≥ f(x) then x← y; The (1+1) EA is a very simple single-objective optimization algorithm that has 5 been applied to many problems. It has a population of size one, and in each generation there is only one offspring, with the better of the parent and the offspring moving on to reproduce in the following generation. It has also been the target of runtime analysis research [43], [20]. Algorithm 2: (µ+ λ)EA 1 Create P from µ uniformly at random generated candidate solutions; 2 while stopping condition not met do 3 P ′ ← P ; for i in range(λ) do 4 Choose x ∈ P uniformly at random; 5 y ← mutate(x) ; 6 P ′ ← P ′ ∪ {y} ; 7 Remove the λ candidate solutions with the lowest fitness from P ′; P ← P ′; The (µ+λ)−EA has a population of size µ and an offspring population size λ. This algorithm is simple and similar to the (1+1) EA, but allows for a larger population size, which can help with diversity. The (1+1) EA is a specific case of the (µ+λ)−EA where µ = 1 and λ = 1. Algorithm 3: Global SEMO 1 x← randomly generated Candidate Solution; 2 Find f(x); 3 P ← {x}; 4 while stopping condition not met do 5 Choose x ∈ P uniformly at random y ← x; 6 mutate(y); 7 if If y is not dominated by x, for all x ∈ P then P ← {y} and P\{Z}, where Z is the set of all points weakly dominated by y; Some problems benefit from trying to optimize not just one function, but multiple. In multiobjective optimization problems the objectives can conflict, where one cannot improve without the other getting worse. 6 In single objective optimization it is straightforward to compare the fitness of two points, just choose the point that has a greater or a lesser fitness depending on if we are trying to minimize or maximize. However, when trying to solve a multi- objective optimization problems, we have k fitness functions f1, . . . , fk. These k fitness function form a vector. So a candidate solution x has f : x → Rk. We also could have conflicting fitness functions, where fi can only improve if fj gets worse. To get around this it helps to define dominance as [16]: A ⪰ B ⇐⇒ ∀i ∈ {1, . . . , n}ai ≥ bi, and∃i ∈ {1, . . . , n}, ai > bi Where A,B are candidate solutions and A dominates B, and fi(A) = ai. When the objectives are conflicting, there is no single solution that will dominate all of the others, we call A nondominated if it is not dominated by any other. We hold onto all of these nondominated solutions which we call Pareto Optimal in a set F that we call the Pareto Front. 2.2.2 Operators The seminal paper by Scharnow, Tinnefeld, and Wegener [51] introduced a version of the (1+1) EA adapted to permutations using different mutation operators for reordering the permutation. The simplest mutation operator is swap(i), in which positions i and i+1 of the permutation are swapped. They also introduced three other more general operations: exchange(i, j) which exchanges the elements at positions i and j, jump(i, j) which makes the element at position i jump to position j shifting all the elements between in the appropriate direction, and finally reverse(i, j), where i < j, reverses the ordering of the elements in positions i, i+ 1, . . . , j. More recently Doerr, Ghannane, and Ibn Brahim introduced the scramble muta- 7 tion operation [12]. This operation selects a random subset of the permutation and shuffles, or scrambles, these positions by reordering with a random permutation. In many problems, mutating the candidate solution will result in a new candidate solution that is infeasible, or doesn’t satisfy basic constraints that any solution must satisfy. When this happens many times the infeasible solution can be corrected using a problem specific repair operator [10]. Using a repair operator allows the EA to freely generate infeasible offspring that can be then made feasible at a low computational cost. Techniques for constraint-handling in evolutionary algorithms include penalty functions, multi-objective optimization in both fitness and constraint space, special operators (such as random keys) and repair mechanisms. When mutation or crossover produce a solution that is infeasible, a repair operator will apply a transformation to the infeasible solution to make if feasible again. Early work on repair mechanisms in the context of evolutionary computation at- tempted to reconcile the use of standard bit string representations for more restrictive combinatorial structures. For example, Hamiltonian cycles in a graph can be easily encoded as bit strings, but classical mutation and crossover operators are very un- likely to result in offspring that are legal Hamiltonian cycles. This was an obstacle for traditional genetic algorithms tasked to solve problems over permutations such as the Traveling Salesperson Problem (TSP), and one proposed expedient was to repair infeasible offspring by applying a greedy algorithm [34]. Repair-based crossover oper- ators that work directly on permutation encodings have also been investigated [24, 40, 39]. For cardinality constraints on bit strings, the genetic fix and restricted search ap- proaches ensure that offspring always have fixed Hamming weight after crossover [50]. Repair techniques for minimum vertex cover were examined empirically for hier- archical Bayesian optimization (hBOA) and the simple genetic algorithm (SGA) [44]. 8 The repair mechanism employed was a local search algorithm that transformed in- feasible offspring into valid vertex covers. The authors found that these approaches (along with simulated annealing) discovered optimal vertex covers on Erdo˝s-Re´nyi random graphs significantly faster than complete branch-and-bound search. One drawback to repair mechanisms (including the one we present in this thesis) is that they are often problem-dependent, and require domain-specific knowledge [10]. Of course, it should be no surprise that extra domain knowledge can positively in- fluence the efficiency of an evolutionary algorithm. For example, in the context of VertexCover, He, Yao and Li [27] empirically demonstrated that allowing the mu- tation probability for each bit to depend on the corresponding vertex degree resulted in a significant performance gain over a pure black-box EA (i.e., access to the instance only via the fitness function). In the 2012 paper by Kratsch and Neumann [31] they find that Global SEMO (Algorithm 3) equipped with the Alternative Mutation Operator in Algorithm 4 can find a minimum vertex cover of size k in O(n2 + log n + k · n2 + n · 4k) time, giving us a parameterized result, defined in Section 2.3.1. Algorithm 4: Global SEMO Alternative Mutation Operator 1 Choose b ∈ {0, 1} uniformly at random; 2 if b=1 then 3 for i ∈ {1, . . . , n} do 4 If there exists an edge {vi, vj} ∈ E E(x), flip xi with probability 12 . Otherwise flip xi with probability 1 n 5 else 6 for i ∈ {1, . . . , n} do 7 Flip xi with probability 1 2 . 9 2.3 Runtime Analysis The amount of time needed to execute an algorithm, or how the size of an algo- rithm’s input related to the number of steps needed to complete the algorithm is the time complexity of an algorithm. The process of finding the time complexity of an algorithm is called runtime analysis. Best, worst, and average case runtime are all studied, since problems of the same size input may differ in performance. Randomized search heuristics are computational procedured that generate stochas- tic sequences of elements x(1), x(2), x(3), . . . ∈ X where the only access to f is through evaluating these points. We define runtime of a randomized search heuristic as T = inf{t : f(x(t)) is optimal} where T is a random variable, and runtime analy- sis seeks to understand its expectation (and concentration). To analyze the runtime of algorithms in this thesis we define runtime as the number of calls to the fitness function. We let x(t) be the best candidate solution seen by generation t. This allows the runtime T to be the first generation where f(x(T )) is optimal. The following theorems are commonly used for analyzing the runtime of evolu- tionary algorithms. Specifically they are used to relate the expected change in one step of the process to the time that it will take for the process to find a solution. Theorem 1 (Additive Drift Theorem [26]). Let (Xt)t∈N be a sequence of nonnegative random variables with finite support S ⊆ R≥0 such that 0 ∈ S. Let T := inf{t ≥ 0 : Xt = 0}. Then, if there exists δ > 0 such that for all s ∈ S \ {0}, E[Xt−Xt+1 | Xt = s] ≥ δ, then E[T ] ≤ E[X0]/δ. Theorem 2 (Multiplicative Drift Theorem [13]). Let (Xt)t∈N be a sequence of non- negative random variables with finite support S ⊆ R≥0 such that 0 ∈ S. Let smin = min(S \ {0}) and let T := inf{t ≥ 0 : Xt = 0}. Then, if there exists δ > 0 such that 10 for all s ∈ S \ {0}, E[Xt −Xt+1 | Xt = s] ≥ δs, then E[T ] ≤ 1+E[ln(X0/smin)]δ . 2.3.1 Fixed-parameter tractable EAs The theory of parameterized complexity [15, 19] allows the analysis of the run- ning time of an algorithm to be decomposed into multiple parameters of the input. The motivation is that many large intractable problems can be solved in practice because real-world problem instances usually exhibit some kind of restriction over their structure. Parameterized complexity aims to distill the source of hardness in a problem class by isolating the superpolynomial contribution to the running time to a parameter independent of the problem size. Formally, a parameterized problem is a language L ⊆ Σ∗×N for a finite alphabet Σ. A problem L is fixed-parameter tractable if (x, k) ∈ L can be decided in time g(k)|x|O(1) for some function g that depends only on k. The complexity class of fixed-parameter tractable problems is FPT. A problem L is slice-wise polynomial if (x, k) ∈ L can be decided in time |x|g(k). The complexity class of slice-wise polynomial problems is denoted XP. Note that for a problem in FPT, each fixed parameter value determines (via g) the size of a leading constant of a polynomial running time, whereas a slice-wise polynomial problem also has polynomial running time for each fixed- parameter value, but each fixed parameter value governs the degree of the polynomial. Applying parameterized complexity analysis to the run time analysis of evolution- ary algorithms is useful when one would like to gain direct insight into how problem instance structure influences run time on NP-hard problems [41, 54]. For a random- ized search heuristic, the optimization time is characterized as a random variable T that measures the number of fitness function evaluations until an optimal solution is first visited. With a suitable fitness function, it is possible to apply randomized 11 search heuristics for optimization problems to decision problems as follows. An algo- rithm is a Monte Carlo FPT algorithm for a parameterized problem L if it accepts (x, k) ∈ L with probability at least 1/2 in time g(k)|x|O(1) and accepts x ̸∈ L with probability zero. Any randomized search heuristic with a bound E[T ] ≤ g(k)|x|O(1) on L can be trivially transformed into a Monte Carlo FPT algorithm by stopping its execution after 2g(k)|x|O(1) iterations. It is therefore convenient to say a randomized search heuristic runs in randomized FPT time on a parameterized problem of size n when E[T ] ≤ g(k)nO(1) (similarly for randomized XP time). 2.3.2 Iterative compression The technique of iterative compression was first presented by Reed, Smith and Vetta for finding odd cycle transversals in a graph [48]. The main idea is to employ a compression routine that takes as input a problem instance together with a solution S and calculates a smaller solution or proves S is already of minimum size. The running time of the compression routine depends exponentially on the size |S| of the solution to compress, so a good way to utilize it is to build a graph up one vertex at a time while always keeping the minimum possible solution. In other words, starting with an empty graph V ′ = ∅ and S = ∅, since the problems we consider are closed under induced subgraphs, S would be a valid solution of G[V ′]. We then iterate over the vertex set V adding each vertex v to both V ′ and S (see Algorithm 5). In each step, since S would again be a solution for G[V ′], we call the compression routine on G[V ′] and S to compute a smaller solution for G[V ′], or certify none exists. Thus if the compression routine runs in O(g(k) · nc) time for a constant c > 0, iterative compression correctly solves the parameterized problem in O(g(k) · nc+1) time. The iterative compression technique motivates the (1+1) EAk framework pre- 12 Algorithm 5: IterativeCompression input: A graph G = (V,E) 1 V ′ ← ∅; 2 S ← ∅; 3 for v ∈ V do 4 V ′ ← V ′ ∪ {v}; 5 S ← S ∪ {v}; 6 S ← Compress(G[V ′], S); 7 return S; sented in this paper. In the remainder of the paper, we will prove that the jump-and- repair approach enables the (1+1) EA to simulate iterative compression, resulting in the solution of certain NP-hard graph problems in randomized FPT time. 2.4 Problems From Graph Theory 2.4.1 Vertex Cover Let G = (V,E) be a graph with V as the set of vertices and E as the set of edges, where an edge connects two vertices. A graph can either be directed or undirected. In an undirected graph, the edges go both ways, while in a directed graph, edges have a specific direction, in this case the edges are often referred to as Arcs. A Vertex Cover is a set of vertices of a graph such that an endpoint of every edge is adjacent to a vertex of this set. The vertex cover problem is an optimization problem in which we want to find the minimum number of vertices that form a vertex cover. Given an undirected graph G = (V,E) where |V | = n and |E| = m we want a subset V ′ ⊆ V of minimum cardinality such that for each e ∈ E, e ∩ V ′ ̸= ∅ holds. This problem is both NP-Complete and fixed-parameter tractable. Due to this, it has been the target for research [31, 43, 44, 30] . 13 A Matching is a set of edges of a graph that have no vertices in common. A maximal matching is any matching of G that is not a proper subset of any other matching. A maximum matching is a matching containing the largest possible number of edges. There could be multiple maximum matchings. Each maximum matching is also a maximal matching, however a maximal matching might not be a maximum matching. The endpoints of a maximal matching form a vertex cover. 2.4.2 Odd Cycle Transversal In an undirected graph G = (V,E) an odd cycle transversal is a set S ⊆ V such that G[V \S] is bipartite. So S, a set of vertices, is an odd cycle transversal if the graph G′ = ([V \S], E) is bipartite. Finding an odd cycle transversal is called a node- deletion problem for a nontrivial hereditary property, and is NP-complete [33]. The parameterized version for finding an odd cycle transversal of at most k, a k−odd cycle transversal, is fixed-parameter tractable. The k−odd cycle transversal was also the first problem to have an iterative compression technique developed, this was done in the seminal paper of Reed, Smith, and Vetta [48]. In the paper the authors presented an iterative compression algorithm for solving k-odd cycle transversal that was O(4kkmn). Since then [29] improved the analysis further to O(3kkmn) and [35] further simplified it. Currently the fastest parameterized algorithm for the problem is O(2.3146k·poly(n)) and is due to Lokshtanov et al. [36]. A tournament is an orientation of a complete graph, that is, a directed graph G = (V,E) such that for every distinct u, v ∈ V , either (u, v) ∈ E or (v, u) ∈ E. A tournament G is transitive when (u, v), (v, w) ∈ E =⇒ (u,w) ∈ E. 14 We call a tournament transitive if and only if it contains no directed cycles. More- over, all cycles contain a directed triangle, yielding the following basic proposition. Proposition 3. A tournament G is transitive if and only if it contains no directed triangles. Proof. If G contains a directed triangle, then it is not transitive. If G is not transitive, it must contain some u, v, w ∈ V such that (u, v), (v, w) ∈ E but (u,w) ̸∈ E (otherwise it is transitive). Since G is a tournament, it follows that (w, u) ∈ E so it contains the directed triangle u→ v → w → u. 2.4.3 Feedback Vertex Set In a directed graph G = (V,E) a feedback vertex set S ⊆ V is a set that intersects every directed cycle in G. So an induced subgraph G′ = ([V \S], E) is transitive. Finding a feedback vertex set of minimum size corresponds to the problem of making an antisymmetric relation transitive by eliminating the smallest possible number of elements. The problem is NP-hard [53, 9], however the parameterized version is in FPT [14]. The parameterized version of the problem, k-FVST, has us start with a tournament G and we search for a feedback vertex set of size at most k. Since every transitive tournament has a unique topological sort by starting from the vertex with zero in-degree, removing that vertex and recursively sorting the re- mainder of the graph we have the following proposition. 15 Proposition 4. If G = (V,E) is a tournament and S ⊆ V is a feedback vertex set of G, then there is a unique sequence on V \ S Sort(G,S) := (v1, v2, . . . , v|V \S|), where (vi, vj) ∈ E ⇐⇒ i < j. 2.4.4 Graph Labeling Graph labeling is an assignment of labels to the vertices, edges, or faces of the graph. The labels are generally represented by integers, but the problem of graph coloring can be thought of as a graph labeling problem. A labeling can be considered a function of the parts of the graph to a set of labels L. For example a function from V onto L would be called a vertex-labeling, similarly an edge-labeling would be a function from E onto L and a face-labeling would be a function from F onto L, where F is the set of faces in the graph. In 1967 Rosa introduced the β-valuation, a function f on a graph G = (V,E) with m edges, where f maps V → {0, 1, . . . ,m} where each edge xy is given the label |f(x) − f(y)|, and each of the edge labels is unique. The since the vertex labels are unique, function |f(x) − f(y)| forces the edge labels to be in the set {1, 2, . . . ,m}. The graceful labeling was introduced as a tool to be used to decompose graphs into isomorphic subgraphs. [22] Later on Golomb referred to the β-valuation as a graceful labeling, which is now the popular term and will be referred to as such for the rest of this paper [22]. Several classes of graphs have been proved to be graceful (or not). For exam- ple, the complete graph Kn is graceful if and only if n ≤ 4 [25, 7]. The cycle Cn is graceful if and only if n ≡ 0 or 3 (mod 4) [49]. All paths, wheels [21], double 16 07 3 1 6 3 2 8 0 7 42 5 3 1 6 (a) Cycle graph C8 2 14 13 3 11 5 9 0 2 2 14 12 13 1 3 10 118 5 6 9 4 (b) Wheel graph W8 211 0 3 9 1 8 12 2 9 11 3 6 8 7 412 12 5 (c) Crown graph S04 0 3 1 6 2 9 3 6 9 2 5 8 1 4 7 (d) Bipartite graph K3,3 Figure 2.1: Examples of graceful graphs with their graceful labelings. wheels [32], helms [4], crowns and complete bipartite graphs [25] are graceful. We refer to Figure 2.1 for an example of graceful labelings of some of these graphs. There are many classes of trees known to be graceful including symmetrical trees, trees with diameter at most 5, and trees with at most 4 end vertices. More generally there exists the Kotzig-Ringel conjecture [22], also known as the Graceful Tree Con- jecture, claims that all trees are graceful, for each tree there exists a graceful labeling. Since this conjecture came into existence, many papers have been focused on trying to prove partial results related to the Graceful Tree Conjecture. Many direct labeling algorithms developed so far are computational approaches for verifying the graceful tree conjecture on finite classes of trees. This conjecture, sometimes called the Ringel-Kotzig conjecture, is that every tree admits a graceful labeling. The typical approach to this verification is to systematically generate each tree with n vertices using the algorithm of Wright et al. [57] for small n, and then applying some kind of search for a graceful labeling of each tree. An early example of this was due to Aldred and McKay [2] who used this approach to verify that every tree with at most 27 vertices is graceful. To find a graceful labeling for a given tree, they employed an algorithm that searches the space of vertex label 17 permutations with a random vertex label swap operator and tabu list to hill climb on the number of distinct edge labels induced by the vertex labeling. Another approach by Horton [28] parallelized the search over trees and used greedy backtracking search on each tree to show that trees with at most 29 vertices are graceful. Fang [18] used a similar deterministic backtracking search together with a stochastic local search heuristic with a tabu list and Metropolis-like acceptance condition to verify all trees of to size 35 are graceful. This backtracking approach labels a tree by first picking a “root” to label with 0, and then systematically assigning legal labels until backtracking is necessary. If no labeling is found, a new vertex is selected as root. Eshghi and Azimi [17] posed the graceful labeling problem as an instance of integer programming, and developed a branch-and-bound search algorithm. They experimen- tally investigated the running time of this approach, and compared the performance of different branching rules. Similarly, Redl [47] described an integer programming approach, as well as a constraint programming approach to generating graceful label- ings. Mahmoudzadeh and Eshghi [37] developed an ant colony optimization (ACO) metaheuristic for finding graceful labelings. In their approach, a vertex labeling is characterized as a matching on the complete bipartite graph Kn,m+1 between the n vertices and the m + 1 possible vertex labels. The cost of a vertex labeling is the count of repeated induced edge labels. The ants are tasked at finding matchings that minimize this cost, since a zero cost matching corresponds to a graceful labeling. They empirically tested the running time on small graphs (up to 25 vertices) and performed an experimental comparison between the ACO approach and the integer programming approach of Eshghi and Azimi [17]. Graceful Labelings are important in pure mathematics, being the focus of many 18 mathematicians since first being introduced [22, 8]. Graceful Labelings have several real-world applications such as in coding theory, determining ambiguities in X-ray crystallography, design of communication networks, optimizing circuit layout [7], the configuration of antennas in radio astronomy [6] and link-labeling algorithms in multi- protocol label switching (MPLS) networks [3]. 19 3 Results In this chapter we will be discussing results related to different combinatorial opti- mization problems. First we will consider parameterized graphs problems finding run- time results for k-VertexCover, k-FeedbackVertexSet, and k−OddCycleTransversal using the (1+1) EA and a tailored jump and repair operator. Then we will go on to analyze the performance of the (1+1) EA when applied to the graceful labeling problem, finding runtime results for different classes of graphs. To collect our experimental results we used GNU parallel [56], a free software package that allows users to execute multiple jobs in parallel using one or more computers. We ran these experiments on a heterogeneous HPE cluster with AMD ROME compute nodes provided by the Minnesota Supercomuting Institute (MSI) at the University of Minnesota [38]. 20 3.1 Jump and Repair Operators In this section we will be considering multiple parameterized graph problems in which we are given a graph G = (V,E), either directed or undirected, and a nat- ural number k with the goal to find a set S ⊆ V with |S| ≤ k such that S is in some sense feasible with respect to G. The problems we will consider are k- VertexCover, where we we insist each edge of E is incident to at least one vertex in S, k-FeedbackVertexSet, where deleting S from the graph leaves us with a cycle-free graph, and k-OddCycleTransversal, where the graph obtained by deleting S is bipartite. Given a subset V ′ ⊆ V , the induced subgraph of G with respect to G, denoted G[V ′], is the graph obtained from G by including only vertices from V ′ and edges from E with both endpoints in V ′. Note that feasible solutions of the above listed problems are closed under induced subgraphs, i.e., if S is feasible with respect to G, then for any V ′ ⊆ V , S ∩ V ′ is feasible with respect to G[V ′]. The (open) neighborhood of a vertex v ∈ V in G is defined as the set N(v) = {u : (v, u) ∈ E}. The neighborhood of a set S ⊆ V is denoted by N(S) = ⋃v∈S N(v). Evolutionary algorithms tasked with solving problems like VertexCover typi- cally operate by representing a solution as a bitstring in {0, 1}n where |V | = n, which selects elements from V to include in S. The task is then to minimize the Hamming weight of the bitstring (and hence the cardinality of the chosen set) subject to the constraint that it must be a feasible solution, e.g., a valid vertex cover. The hope is that the algorithm would eventually find a feasible solution of Hamming weight k. We will take a different perspective to parameterized graph problems by two new insights 21 1. We expand the search space to both candidate solutions S and induced sub- graphs of G. 2. We employ a focused jump-and-repair step that has the potential to efficiently repair an offspring made infeasible by mutation. Thus, instead of searching for S-sets in G where |S| ≤ k, we design the EA to search for induced subgraphs of G such that |S| ≤ k and S is already a feasible solution for the subgraph. The fitness of a solution is the size of the induced subgraph. In this way, we systematically build up induced subgraphs together with already-feasible sets until the entire graph is recovered. We argue that this approach is useful on parameterized graph problems that are closed under induced subgraphs, such as the ones we investigate in this paper. To characterize both a set S and an induced subgraph of G, we define the search space over {0, 1}2n so that a candidate length-2n bit string x has a length-n prefix corresponding to vertices selected for the solution set and a length-n suffix corre- sponding to vertices selected for the subgraph. It is convenient from an analytical perspective to factor such a string into an ordered pair x := (xS, xV ) where xS corre- sponds to a candidate solution set in the induced subgraph G[xV ] 1. Ignoring vertices outside the induced subgraph, given a parameterized graph problem, a candidate string x = (xS, xV ) has two feasibility constraints: Solution constraint: xS must be a valid solution for G[xV ]. Cardinality constraint: |xS| ≤ k. It is therefore convenient to use the following definitions. 1Here, and throughout the paper, we will often abuse notation by directly interpreting length-n bitstrings as sets on n elements and vice-versa. 22 Definition 5. We say a string x = (xS, xV ) is solution feasible for G when xS is a a valid solution for the induced subgraph G[xV ], that is, the vertex set S = xS ∩ xV is feasible in the induced subgraph. We say x is cardinality feasible when |xS| ≤ k. We say a x is infeasible if at least one of these properties is violated. We define a general fitness function for a parameterized graph problems as follows. fk(x) =  −(|xS|+ |xV |) if x is infeasible, |xV | otherwise. (3.1) The function fk is designed to penalize infeasible solutions by attaining a negative value that pressures solutions toward smaller sets and graphs, while feasible solutions are rewarded for their size. In Algorithm 6 we define the (1+1) EA equipped with an additional JumpAn- dRepair operation as (1+1) EAj+rk . This is a rather general framework that can be used to apply an evolutionary algorithm to parameterized graph problems, and we will use for all of the problems that we consider. Algorithm 6: (1+1) EAj+rk input: A graph G on n vertices 1 choose x from {0, 1}2n uniformly at random; 2 while fk(x) < n do 3 Obtain y from x by flipping each bit with probability 1 2n ; 4 if y is solution feasible but not cardinality feasible then 5 y′ ← JumpAndRepair(y,G); 6 y ←{y,y′} fk; 7 if fk(y) ≥ fk(x) then x← y; This outlines a rather general framework for applying an evolutionary algorithm to a parameterized graph problem. Apart from the fitness function, there is only 23 one additional module that requires problem-specific knowledge to attempt to repair solutions. Moreover, there is also a just-in-time appeal to this approach, as at any point during the execution of the (1+1) EAj+rk , a feasible solution is a valid k-solution to some induced subgraph. In Algorithm 7 we give a general framework for the focused jump-and-repair op- eration. It starts with taking a solution-feasible offspring, not necessarily cardinality feasible, and executes a focused jump by selecting a random subset of the elements of xS (in line 3) and then calling a problem-specific repair procedure if the set of se- lected elements is not solution feasible (in line 5). This operation is successful when it kicks out enough elements for the repaired string to be both cardinality and solution feasible (see Figure 3.1). Algorithm 7: JumpAndRepair input: A pair of strings x = (xS, xV ) and a graph G /* Jump */ 1 S ′ ← ∅; 2 for i ∈ {1, . . . , n : xS[i] = 1} do 3 with probability 1/2 do S ′ ← S ′ ∪ {i}; 4 if S ′ is solution feasible for G[xV ] then return (xS′ , xV ); /* Repair */ 5 Find a minimal set T ⊆ xV \ xS s.t. S ′ ∪ T is solution feasible for G[xV ]; 6 return (xS′∪T , xV ); Like the alternative mutation operator (Algorithm 4) for Global SEMO (Algo- rithm 3) discussed earlier on page 9, the goal of the jump operation is to focus the search on components of the solution that need attention. Unlike the alternative mutation operator, which focuses on elements yet to be included in the solution, the jump operation tries to find a promising subset of the solution that can then be repaired. The repair part of the operation is problem specific and later we will define different 24 offspring focused jump repair solution-feasible region cardinality-feasible region feasible region Figure 3.1: A successful jump-and-repair operation transforms a cardinality- infeasible offspring by focusing on a random subset of xS and applying a repair oper- ation on the result to make it solution feasible again. repair operators for each of the parameterized graph problems that we consider. It is important that the repair operation runs in polynomial time for n for any meaningful FPT results, however it could more generally run in FPT time. According to Equation (3.1), any infeasible solution has a negative fitness, whereas every feasible solution has nonnegative fitness. This immediately yields the following lemma. Lemma 6. If the underlying parameterized graph problem is closed under induced subgraphs, then the (1+1) EAj+rk produces a feasible solution after O(n log n) steps in expectation. After this point, it does not accept infeasible solutions. Proof. We consider the potential function ϕ(x) := max{−fk(x), 0} and let (Xt)t≥0 be the stochastic process corresponding to the potential of the solution 25 generated by (1+1) EAk in the t-th iteration. The elitist nature of the (1+1) EAk ensures that this potential is nonincreasing, and since feasible points always attain nonnegative fk-values, ϕ(x) = 0 ⇐⇒ x is feasible. Thus it suffices to bound the drift E[Xt −Xt+1 | Xt = s] and apply Theorem 2. Assume that the point x in the t-th iteration is infeasible. It follows that Xt > 0, and in fact, Xt = |x| by Equation (3.1). We argue that there is a reasonably good chance that the (1+1) EAk can reduce the potential by at least one. For any index i such that x[i] = 1, the probability of producing an offspring y by flipping only the i-th bit of x to zero in the mutation step, leaving the remaining bits unchanged, is at least 1 2n ( 1− 1 2n )2n−1 ≥ 1 2en . After this, if the offspring y happens to be solution feasible but not cardinality feasible, then JumpAndRepair is called (line 5 of Algorithm 6) to produce an intermediate point y′. We now argue that y′ is copied back to y only if the potential of y′ is no larger than the potential of y. Note that y′ is guaranteed to be solution feasible, since the repair procedure always returns a cover. In the case that y′ is still not cardinality feasible, then in line 6, y′ is copied back to y only if fk(y′) ≥ fk(y) (and hence ϕ(y′) ≤ ϕ(y) = Xt − 1). Otherwise, if y′ is also cardinality feasible, then it must be a feasible point. Since all feasible points attain nonnegative fk-values, we have fk(y ′) > fk(x) and y′ becomes the new offspring in line 6. We would then have Xt+1 = 0 < Xt. In any case, under this mutation event, we have Xt+1 ≤ Xt − 1, and the mutation event occurs with probability at least 1/(2en). Summing over all one-bits in x, we have E[Xt−Xt+1 | Xt = s] ≥ s2en , and applying 26 Theorem 2, the expected time until a feasible solution z is first generated is O(n log n). At this point, since fk(z) ≥ 0 and every infeasible solution has a negative fitness, no infeasible solution is subsequently accepted. 3.1.1 k-Vertex Cover The first FPT problem that we consider is the k-Vertex Cover Problem. In this problem we are given an undirected graph G = (V,E) with |V | = n vertices and |E| = m edges, a positive integer k, with the goal of finding a set S ⊆ V such that |S| ≤ k and, for each {u, v} ∈ E, {u, v} ∩ S ̸= ∅, i.e., at least one endpoint of every edge in E is in S. Since vertex covers are closed under induced subgraphs, our framework introduced in Algorithm 7 is applicable. In the following lemma we introduce a compression routine for k-Vertex Cover. Lemma 7 (Compression for k-VertexCover). Suppose that S is a vertex cover of a graph G = (V,E) with |S| > k, and that G has a vertex k-cover. Then a cover of size at most k can be found by removing some subset R ⊆ S and repairing each uncovered edge by adding N(u) for each u ∈ R. Proof. Assume there is a vertex k-cover S∗ in G and let R := (S \ S∗) ⊆ S. Suppose removing the vertices of R leaves a set F ⊆ E edges uncovered. Let {u, v} ∈ F be an arbitrary uncovered edge, and w.l.o.g., assume u ∈ R. It follows that v ∈ S∗ since S∗ is a cover for G. Thus, N(R) ∪ (S \ R) = S∗, and thus adding the vertices N(u) for each u ∈ R obtains a k-cover for G. In Algorithm 8 we present a jump-and-repair operator for vertex covers that we were able to design using the result of Lemma 7. This operator takes a set selected by xS and executes a focused jump by flipping each element in {i : xS[i] = 1} with 27 probability 1/2, hence choosing a random subset S ′ of the set selected by xS. If S ′ is already a vertex cover, this is returned. Otherwise, the solution S ′ is repaired to be solution feasible by covering all of the uncovered edges with the neighbors of the “removed” elements from xS \S ′. The resulting set is guaranteed to be a vertex cover, which is then returned by the operator. Algorithm 8: JumpAndRepairVC input: A pair of strings x = (xS, xV ) and a graph G /* Jump */ 1 S ′ ← ∅; 2 for i ∈ {1, . . . , n : xS[i] = 1} do 3 with probability 1/2 do S ′ ← S ′ ∪ {i}; 4 if S ′ is a vertex cover for G[xV ] then return (xS′ , xV ); /* Repair */ 5 T ← ∅; 6 for v ∈ xS \ S ′ do 7 T ← T ∪N(v) 8 return (xS′∪T , xV ); Now in the next theorem we are able to show what the expected optimization time is of the (1+1) EAj+rk . Theorem 8. Let G = (V,E) be a graph with a k-vertex cover. Then the expected optimization time of the (1+1) EAj+rk applied to G is bounded by O(2 kn2 log n). Proof. By Lemma 6, a feasible solution is generated after O(n log n) iterations in expectation and infeasible solutions are not accepted thereafter. It remains to bound the time spent on feasible solutions. We consider the potential function ϕ(x) = n− fk(x) and consider the stochastic process (Xt)t≥0 on {0, . . . , n} corresponding to the potential ϕ of the solution generated by the execution of the (1+1) EAj+rk after the first feasible solution is found. 28 Let x = (xS, xV ) be the solution in the t-th feasible iteration of (1+1) EA j+r k . Since we assume x is feasible, |xS| ≤ k and corresponds to a cover on G[xV ]. Denote by Ei the event that mutation changes xV [i] from 0 to 1 and that, after mutation, xS[i] = 1. We argue that Pr(Ei) ≥ (2en)−2. It is sufficient that mutation flips bit i of xV , which happens with probability ( 1− 1 2n )2n−1 ≥ 1 2en . To derive a lower bound on the probability of Ei, we pessimistically assume that xS[i] = 0 which also flips with probability at least 1 2en . After event Ei has occurred, the intermediate offspring corresponds to a new graph G[xV ∪ {i}] and xS ∪ {i} must be a cover, since only vertex i was introduced, and all edges in G[xV ∪{i}] incident to i are covered as i is in the set corresponding to xS∪{i}. Therefore, this string is solution feasible (but not necessarily cardinality feasible). Let J denote the event that the subsequent call to JumpAndRepairVC produces an offspring y that is a valid k-cover of G[yV ]. Since x is feasible, then |xS| ≤ k, and it holds that |xS ∪ {i}| ≤ k + 1. By Lemma 7, there is a k-cover that can be obtained by removing a subset of elements in xS and ensuring that any uncovered edges are covered by neighbors of xS. JumpAndRepairVC selects exactly this subset with probability 2−|S| and so we have Pr(J | Ei) ≥ 2−|S| ≥ 2−(k+1). We now argue that the joint occurrence of events Ei and J results in a feasible offspring y with a strictly larger fitness value. In particular, |xV ∪{i}| = |xV |+1 and so f(y) = f(x)+1 since J ensures that y is feasible. Therefore, the drift conditioned on J ∩ Ei is 1. We can bound the total drift as follows. E[Xt −Xt+1 | Xt = s] ≥ ∑ i:xV [i]=0 Pr(J ∩ Ei) = ∑ i:xV [i]=0 Pr(J | Ei) Pr(Ei) ≥ |{i : xV [i] = 0}| 4e22k+1n2 = Ω ( s 2kn2 ) . 29 Applying Theorem 2 completes the proof. It follows that the (1+1) EAk can be characterized as a Monte Carlo FPT al- gorithm for the k-VertexCover problem on graphs. This also allows for one to develop a strategy to finding the minimum vertex cover of a graph in FPT time. In particular, consider the restart strategy for the (1+1) EAk listed in Algorithm 9. Algorithm 9: Restart framework input: A graph G = (V,E) 1 k ← 1; 2 while a vertex cover for G has not been found do 3 Run the (1+1) EAj+rk on G for 6e 22kn2 lnn steps; 4 k ← k + 1; Theorem 9. Let G = (V,E) be a graph with an optimal vertex cover of size OPT . Then with probability 1 − o(1), the restart framework of the (1+1) EAj+rk in Algo- rithm 9 finds an optimal vertex cover for G within O(2OPTn2 log n) function calls. Proof. Since there are no vertex covers for G of size k < OPT , Algorithm 9 is successful when the run in which k = OPT finds a vertex cover for G. Thus, it suffices to bound the probability of success in this run. Observe that this is equivalent to the probability that an unbounded run would be successful before the cutoff time. Let T denote the random variable that measures the optimization time on an unbounded run of (1+1) EAj+rk with k = OPT , and let T = T1+T2 where T1 measures the steps spent on infeasible solutions and T2 the steps spent on feasible solutions. We thus seek to bound Pr(T1 + T2 < 6e 22OPTn2 lnn). In the proof of Theorem 8, we have bounded the drift in the feasible phase from below as δ ≥ 1/(e22k+1n2). Thus, setting r = 1 2 lnn in the tail bound of Theorem 2, 30 we get Pr(T2 > 6e 22OPT−1n2 lnn) < e−r = 1√ n . Similarly, the drift in the infeasible phase, obtained in the proof of Lemma 6, is δ = Ω(n), and so Pr(T1 > 6e 22OPT−1n2 lnn) < e−Ω(n), and it follows that Pr(T1 + T2 < 6e 22OPTn2 lnn) = 1− o(1). If Algorithm 9 is successful in its k-th run where k = OPT , it spends only a constant fraction of time on too-small vertex covers, and the total optimization time is OPT∑ k=1 6e22kn2 lnn = O(2OPTn2 log n), which completes the proof. Ignoring polynomial factors, this constitutes an upper bound that is smaller by an exponential factor (in OPT ) than the bound presented for global SEMO solving minimum VertexCover, which requires O(n2 log n + OPT · n2 + 4OPTn) fitness evaluations in expectation [31]. 3.1.2 Feedback Vertex Sets in Tournaments A feedback vertex set is a set S ⊆ V that intersects every directed cycle of G (and thus G \ S is transitive). Finding a feedback vertex set of minimum size corresponds to the problem of making an antisymmetric relation transitive by eliminating the smallest possible number of elements. The problem is NP-hard [53, 9], however the 31 parameterized version is in FPT [14]. In the parameterized version of the problem, k-FVST, we are given a tournament G, and the task is to find a feedback vertex set of size at most k. Figure 3.2: A transitive tournament on 8 vertices. Every transitive tournament has a unique topological sort by starting from the vertex with zero in-degree, removing that vertex and recursively sorting the remainder of the graph. Thus we have the following proposition. Proposition 10. If G = (V,E) is a tournament and S ⊆ V is a feedback vertex set of G, then there is a unique sequence on V \ S Sort(G,S) := (v1, v2, . . . , v|V \S|), where (vi, vj) ∈ E ⇐⇒ i < j. Similar to k-VertexCover, we can construct the following compression result for k-FVST. Lemma 11 (Compression for k-FVST). Let G = (V,E) be a tournament. Suppose that S ⊆ V is a feedback vertex set of G (that is, G − S is transitive), and |S| > k. Suppose there exists a feedback vertex set S∗ of G with |S∗| ≤ k. Then a feedback vertex set of size at most k can be found by 32 1. removing a set R of vertices from S, and 2. guessing the correct relative ordering on R in Sort(G,S∗). Proof. Set S ′ = S ∩ S∗ and R = S \ S ′. Let T denote the set of vertices v ̸∈ S that appear in a directed triangle u, v, w where u,w ∈ R. It follows that T ⊆ S∗ because otherwise G− S∗ would not be cycle-free. We seek to find the smallest set that extends S ′ ∪ T but does not overlap with R (note that S∗ is one such extension). Pick an arbitrary W where W ⊇ (S ∪ T ). Note that W is a feedback vertex set since G− S is transitive and W ⊇ S. We construct a sequence σW on V \W . Define a labeling p : V \W → |π| as follows. For each v ∈ V \W , p(v) = min ( {i : (v, πi) ∈ E} ∪ {|π|+ 1} ) . Then for all v, w ∈ V \W we set v ≺σW w ⇐⇒  p(v) < p(w) or, p(v) = p(w),Sort(G,W )[v] < Sort(G,W )[w]. We claim that σW = Sort(G,W ) if and only if W \R is a feedback vertex set for G. First, assume σW = Sort(G,W ). Suppose for contradiction that there is a cycle in G− (W \ R). Then by Proposition 3, there must be a directed triangle u, v, w in G− (W \R). Note that G−W is triangle-free (since W is a feedback vertex set), so there must be at least one vertex of this triangle in R. Furthermore, there can be at most one vertex of this triangle in R since if there were two vertices appearing in R, then the third would be in S ′∪T which do not appear in G− (W \R). Obviously, all three triangle endpoints cannot be in R since S∗ is a feedback vertex set. Without 33 loss of generality, suppose u ∈ R and v, w ̸∈ R with (v, w) ∈ E. Note that u = πi for some i and the triangle is completed by edges (w, πi) ∈ E and (πi, v) ∈ E. Since (w, πi) ∈ E, it follows that i ≥ p(w), and since (πi, v) ∈ E, it follows that i < p(v) and we have p(w) < p(v) and so w appears before v in σW . However, (v, w) ∈ E so v appears before w in Sort(G,W ), which contradicts the assumption the σW = Sort(G,W ). Now suppose σW ̸= Sort(G,W ). Then there must be a pair v, w ∈ V \W with (v, w) ∈ E but p(v) > p(w). This means for i = p(w), we have (w, πi) ∈ E and (v, πi) ̸∈ E. But since G is a tournament, (v, πi) ̸∈ E =⇒ (πi, v) ∈ E. Since πi ∈ R, there is a triangle v, w, πi in G− (W \R). Thus, given S ′ and π, it suffices to find the minimal suchW so that σW is equal to Sort(G,W ). This can be done by computing the sequences for σS∪T and Sort(G,S∪ T ) and adding to S ∪ T the vertices not in a longest common subsequence. Since we know that there exists at least one feedback vertex set S∗ disjoint from R with size at most k, it follows that this method computes a feedback vertex cover with size at most k. T S ′ R S S∗ u vw Figure 3.3: Illustration of possible directed triangles in the proof of the compression lemma for k-FVST (Lemma 11). Using Lemma 11, we are able to design a focused jump-and-repair operation for 34 Algorithm 10: JumpAndRepairFVST input: A pair of strings x = (xS, xV ) and a tournament G /* Jump */ 1 S ′ ← ∅; 2 for i ∈ {1, . . . , n : xS[i] = 1} do 3 with probability 1/2 do S ′ ← S ′ ∪ {i}; 4 R← xS \ S ′; 5 π ← a random permutation of R; 6 if G[xV \ S ′] is transitive then return (xS′ , xV ); /* Repair */ 7 T ← ∅; 8 while ∃ a directed triangle u, v, w s.t. u,w ∈ R and v ̸∈ T do 9 T ← T ∪ {v} 10 for v ∈ xV \ (xS ∪ T ) do 11 p[v] = min({i : (v, πi]) ∈ E} ∪ {|π|+ 1}) 12 σ ← Sort(G, xS ∪ T ); 13 σ′ ← the sequence obtained by sorting xV \ (xS ∪ T ) by p and breaking ties by σ; 14 Z ← vertices in longest common subsequence of σ and σ′; 15 return (xS′∪T∪(xV \Z), xV ); the k-FVST problem (Presented in Algorithm 10). This operation is slightly more complicated than that of the k-VertexCover, since we also guess the correct or- dering on the vertices removed from the current feedback set, and it is necessary to find the longest subsequence common the two arrangements of the remaining ver- tices. The former slows the optimization time by a factor of k!. The latter is not directly reflected in the optimization time, but would incur an extra factor of O(n2) in each repair operation for solving the LongestCommonSubsequence problem using, e.g., dynamic programming [11]. Using this we are able to prove the following Theorem. Theorem 12. Let G = (V,E) be a tournament with a feedback vertex set of size at most k. Then the expected optimization time of the (1+1) EAj+rk applied to G is 35 bounded by O(2kk!n2 log n). Proof. We focus only on the phase of execution that consists of feasible solutions, which occurs after O(n log n) iterations in expectation by Lemma 6. As with the proof of Theorem 8, we bound the drift of the potential function ϕ(x) = n − fk(x) modeled by the stochastic process (Xt)t≥0 which starts after the first feasible iterations, and hits the absorbing state when a feedback vertex set of size at most k on the entire graph G is discovered. Again, assume that x = (xS, xV ) is the candidate solution in the t-th feasible iteration of the (1+1) EAj+rk , and denote Ei as the event that changes xV [i] from 0 to 1 and xS[i] = 1 after mutation. The resulting string is solution feasible, but not necessarily cardinality feasible. Let J be the event conditioned on Ei that the subsequent call to JumpAndRepairVC results in a solution y that is a feedback vertex set of size at most k for the induced subgraph G[yV ]. Since G has a k feedback vertex set, the induced subgraph G[xV ∪ {i}] must also have a k feedback vertex set S∗ ⊆ xV ∪ {i}. With probability 2−(|xS |+1) the jump phase of Algorithm 10 selects S ′ = (xS ∪ {i}) ∩ S∗, and with probability 1(|xS |+1−|S′|)! , it selects the permutation π that corresponds to the order of the vertices in xS \ S ′ in Sort(G[xV ∪ {i}, S∗]. By Lemma 11, if this jump operation was successful, the subsequent repair step produces a feedback vertex set for G[xv ∪ {i}] with size at most k. Since |xS| ≤ k, and |S ′| > 0, we have Pr(J ) ≥ 2 −|xS | (|xS|+ 1− |S ′|)! ≥ 1 2k+1k! . As the resulting offspring y is feasible and |yV | = |xV |+1, it follows that f(y) > f(x). By the same argumentation as in the proof of Theorem 8, Pr(Ei) ≥ (2en)−2, and 36 we bound the total drift as E[Xt −Xt+1 | Xt = s] ≥ ∑ i:xV [i]=0 Pr(J ∩ Ei) = ∑ i:xV [i]=0 Pr(J | Ei) Pr(Ei) ≥ |{i : xV [i] = 0}| 4e22k+1k!n2 = Ω ( s 2kk!n2 ) , and the claim is proved by applying Theorem 2. 3.1.3 Odd Cycle Transversals The final FPT graph problem that we will consider is the k-OddCycleTransversal. This problem has us search an undirected graph G = (V,E) for a set S ⊆ V such that G[V \ S] is bipartite. We layout a jump-and-repair operation in Algorithm 11. Given a solution feasible odd cycle transversal that is not cardinality feasible, the jump procedure randomly removes some vertices from the transversal, and then guesses a bipartition of these removed vertices. The repair procedure, outlined in Algorithm 12, takes advantage of a result presented by Lokshtanov et al. in [35] that relates two odd cycle transversals in a graph to a vertex cut separating certain sets. After the jump, the repair attempts to judiciously remove any odd cycles exposed by calculating a vertex cut in the remainder of the graph. The following lemma demonstrates the repair performed by Algorithm 12 is suc- cessful if the jump is successful. The proof is similar to the proofs of Lemmas 3.2 and 3.3 of Lokshtanov et al. [35]. Lemma 13 (Compression for k-OddCycleTransversal). Let S be an odd cycle 37 Algorithm 11: JumpAndRepairOCT input: A pair of strings x = (xS, xV ) and a graph G /* Jump */ 1 S ′ ← ∅; 2 for i ∈ {1, . . . , n : xS[i] = 1} do 3 with probability 1/3 do S ′ ← S ′ ∪ {i}; 4 if S ′ is an odd cycle transversal for G[xV ] then return (xS′ , xV ); /* Repair */ 5 R← xS \ S ′; 6 Select a bipartition A,B of R uniformly at random; 7 Calculate T using repair Algorithm 12 called with inputs G[xV ], S,R, A, B; 8 return (xS′∪T , xV ); Algorithm 12: Repair odd cycles that are exposed by removing R from S input: A graph G = (V,E) input: An odd cycle transversal S of G input: A set R ⊆ S vertices to remove from S input: A partition A,B of R 1 Let C and D be the bipartition of G[V \ S]; 2 Find a minimal vertex cut T in G[V \ S] separating (C ∩N(A)) ∪ (D ∩N(B)) and (C ∩N(B)) ∪ (D ∩N(A)); 3 return T ; A B S ′ C ∩N(A) C ∩N(B) D ∩N(A) D ∩N(B) S S∗ T C D Figure 3.4: Illustration of partitions of G = (V,E) in the proof of the compression lemma for k-OddCycleTransversal (Lemma 13). 38 transversal of a graph G = (V,E) with |S| > k, and suppose there exists an odd cycle transversal S∗ of G with |S∗| < k. Then an odd cycle transversal of size at most k can be found by 1. removing a set R of vertices from S, 2. guessing the correct partition A ⊎ B = R that splits R correctly into the bipar- tition of G induced by S∗, and 3. running Algorithm 12 on these sets to repair any odd cycles exposed by removing R. Proof. Let S∗ be an odd cycle transversal of size at most k in G. Then G[V \ S∗] is bipartite with bipartition VA, VB. We prove that if we remove the vertex set R := S \ S∗, then calling Algorithm 12 with sets R, A := R ∩ VA and B := R ∩ VB results in an odd cycle transversal of size at most k. Let S ′ = S ∩ S∗ and C,D ⊆ V \ S be the bipartition formed by S (see Figure 3.4 for an illustration). We first argue that T := S∗ \ S is a vertex cut that separates (C ∩N(A)) ∪ (D ∩ N(B)) and (C ∩N(B)) ∪ (D ∩N(A)) in the graph G[V \ S]. Let P be a path in G[V \ (S ∪T )]. We show that P cannot connect (C ∩N(A))∪ (D ∩ N(B)) and (C ∩ N(B)) ∪ (D ∩ N(A)) in G[V \ S] by ruling out the possible endpoints of P . Let (u, . . . , v) be the sequence of vertices associated with P . Case 1: Suppose u ∈ C ∩ N(A) and v ∈ C ∩ N(B). Since u, v ∈ C and C is an independent set in G[V \ (S ∪ T )], P must contain an odd number of vertices. Furthermore, u has a neighbor a ∈ A and v has a neighbor b ∈ B. Thus replacing the vertices in S \ S∗ would allow us to construct the path P ′ = (a, u, . . . , v, b) in G[V \ S∗], also with an odd number of vertices. Since P ′ also has an odd vertex count, there is no proper 2-coloring of P ′ that places its 39 endpoints in opposite color classes. But a ∈ A ⊆ VA and b ∈ B ⊆ VB, so this contradicts the fact that G[V \ S∗] is bipartite with bipartition VA and VB. Case 2: Suppose u ∈ C ∩ N(A), v ∈ D ∩ N(A). Since u ∈ C and v ∈ D, then P has an even count of vertices. Note that both u and v have neighbors a, a′ ∈ A (respectively), so replacing the vertices in S \S∗ would allow us to construct an even-length path P ′ = (a, u, . . . , v, a′) in G[V \S∗]. Since P ′ has an even vertex count, there is no proper 2-coloring of P ′ that places its endpoints in the same color class. Again, this contradicts the fact that S∗ is an odd-cycle transversal as described above. Case 3: If u ∈ D ∩ N(B), v ∈ C ∩ N(B), then P cannot exist by a symmetric argument to Case 2, swapping C with D and A with B. Case 4: If u ∈ D ∩ N(B), v ∈ D ∩ N(A), then P cannot exist by a symmetric argument to Case 1, swapping C with D and A with B. Thus, no path exists in G[V \ (S ∪ T )] between the sets (C ∩ N(A)) ∪ (D ∩ N(B)) and (C ∩ N(B)) ∪ (D ∩ N(A)) and therefore T is a vertex cut separating these two sets in G[V \ S]. Finally, we argue that if T ′ is any vertex cut separating (C ∩N(A))∪ (D∩N(B)) and (C ∩N(B)) ∪ (D ∩N(A)) in G[V \ S], then T ′ ∪ S ′ is an odd cycle transversal in G. Let Q be an arbitrary cycle in G[V \ (T ′ ∪ S ′)] and denote as (e1, e2, . . . , eq) the sequence of edges in the order they appear in Q. It suffices to show that q must be even. If Q has no vertices in A ∪B, then q must be even since S = A ∪B ∪ S ′ is an odd cycle transversal. On the other hand, suppose Q has at least one point in A∪B. We call an edge ei internal if ei ∩ (A ∪B) = ei, and external if ei ∩ (A ∪B) = ∅. 40 Similarly, we call a path an internal path when it is comprised only of internal edges. An external path is an edge sequence (ei, . . . , ej) where |ei∩(A∪B)| = |ej∩(A∪B)| = 1, and the remaining edges are external. Note that Q can always be decomposed into a sequence of internal and external paths where |e1 ∩ eq| = 1. SinceG[A] andG[B] are independent sets, every internal path with both endpoints in A or both endpoints in B has an even number of edges, and every internal path with one endpoint in A and one endpoint in B has an odd number of edges. Now consider an external path P in G[V \ (S ∪ T ′)] with endpoints u and v. If u, v ∈ A then the vertex sequence for P is (u,w, . . . , y, v) for some w, y ∈ N(A). But T ′ is a vertex cut separating C ∩N(A) and D∩N(A), so there are no paths between these sets, so either w, y ∈ C or w, y ∈ D. In either case, the subpath from w to y has an even count of edges in G[V \ (S ∪ T ′)] since C and D is a bipartition induced by the odd cycle transversal S. A similar argument shows that external paths with both endpoints in B must have even length, and one endpoint in A and one in B must have odd length. Now let Q = (P1, P2, . . . , Pℓ) be a decomposition of Q into internal and external paths. Each path Pi with endpoints in A and B must have a return path Pi with endpoints in B and A. Therefore, there must be an even number of paths Pi with an odd count of edges, and so Q must contain an even count of edges. To complete the proof, we see that Algorithm 12 finds a minimal vertex cut T ′ that separates (C∩N(A))∪(D∩N(B)) and (C∩N(B))∪(D∩N(A)), and |T ′| ≤ |T | since T must also be such a vertex cut. As |S ′ ∪ T | ≤ k, it follows that S ′ ∪ T ′ is an odd cycle transversal of G with size at most k. The repair procedure of Algorithm 12 finds a minimum vertex separator between two vertex sets, which can be done in O(nm) time by computing the maximum flow 41 in the appropriate network, e.g., by the Ford-Fulkerson method [11]. If we insist on a vertex separator on size at most k, the running time bound can be improved to O(km), as the existence of a separator of this size can be decided, and subsequently constructed, within this bound. In the case that a vertex separator of size at most k cannot be found, the repair procedure can return an arbitrary set (e.g., R), as the repair would have failed. We also point out a small deviation from the general JumpAndRepair procedure listed in Algorithm 7. In line 3 of Algorithm 11, we only select a vertex for S ′ with probability 1/3 rather than 1/2, which ultimately improves the bound by a (1.33)k factor. Theorem 14. Let G = (V,E) be a graph where |V | = n, |E| = m and G con- tains an odd cycle transversal of size k. Then the expected optimization time of the (1+1) EAj+rk applied to G is bounded by O(3 kkmn2 log n). Proof. Again by Lemma 6, after O(n log n) iterations in expectation, all subsequent solutions are feasible. Let x = (xS, xV ) be a feasible solution, that is, |xS| ≤ k and corresponds to an odd cycle transversal of G[xV ]. Let i ∈ {j : xV [j] = 0} and let Ei denote the event that after mutation xS[i] = xV [i] = 1 and no other bits have changed. Clearly, xS ∪ {i} is also an odd cycle transversal of G[xV ∪ {i}] so the resulting offspring conditioned on Ei must be solution feasible, but not necessarily cardinality feasible. In the latter case, note that we have assumed that G admits an odd cycle transversal S∗ ⊆ V where |S∗| ≤ k. Since odd cycle transversals are closed under induced subgraphs, (xV ∪ {i}) ∩ S∗ is an odd cycle transversal of G[xV ∪ {i}]. Let J1 be the event that in the jump phase of Algorithm 11, the process chooses to flip to zero the set of bits R = (xS ∪ {i}) \ S∗, hence keeping the set S ′ = 42 ((xV ∪ {i}) ∩ S∗) ∩ (xS ∪ {i}) set to 1. Let J2 be the event that R is partitioned exactly into the sets A,B which are each respectively a subset of a bipartition induced by (xV ∪ {i}) ∩ S∗. Then Pr(J1) = (2/3)|R|(1/3)|S′| and Pr(J2 | J1) = (1/2)|R|, so we have Pr(J1 ∩ J2) = (1/3)|R|+|S′| ≥ 3−(k+1). Under this joint event, by Lemma 13, Algorithm 12 must return an odd cycle transversal for xV ∪ {i} of size at most k. Arguing in the same way as with the proofs of Theorems 8 and 12, we may set up a potential function with multiplicative drift E[Xt −Xt+1 | Xt = s] ≥ ∑ i:xV [i]=0 Pr(J1 ∩ J2 ∩ Ei) ≥ |{i : xV [i] = 0}| 4e23k+1n2 = Ω ( s 3kn2 ) . Applying Theorem 2, the expected number of iterations of the (1+1) EAj+rk until an odd cycle transversal is generated for the entire graph G is O(3kn2 log n). Since the repair procedure of Algorithm 12 costs O(km), we obtain the claimed result. 3.1.4 Experiments To interpret the concrete running time of (1+1) EAj+rk on k-VertexCover instances as a function of both n and k and to estimate the magnitude of hid- den constants, we performed a number of experiments on different instances of k- VertexCover. In order to maintain experimental control over both n and k, we created three graph classes: random planted, clique/anticlique and biclique. In the random planted class, instances are randomly generated by drawing each graph from a planted version of the standard Erdo˝s-Re´nyi random graph model in which a k-cover is “planted” into the graph and edges are selected for inclusion with fixed probability p subject to having an end point in the planted cover. In particular, each random graph on n vertices with a planted k-cover was generated by first randomly choosing k vertices for the vertex cover and then iterating over each vertex pair vi, vj, such that 43 i ∈ {1, . . . , k} and j ∈ {k+1, . . . , n}, and adding the edge (vi, vj) with probability p. In the limiting case of p = 1 we obtain a (nonrandom) clique/anticlique instance comprised of a k-clique fully connected to an n − k anticlique, that is, every vertex in the k-clique is connected to every other vertex in the graph, and the remaining vertices are connected to every vertex in the k-clique (but not each other). Finally, we also investigate bicliques Kk,n−k, that is, complete bipartite graphs with k vertices in one of the partitions. We generated graph instances using values of n ∈ {20, 30, . . . , 100} and k ∈ {3, 4, . . . , 8}. For the nonrandom graphs (clique/anticlique and biclique) we generated an instance for each n and k, resulting in 54 instances each of the two classes. For the random graphs, we also controlled for edge density using p ∈ {0.1, 0.25, 0.5, 0.75}, and for each value of n, k and p, we generated 10 separate random graph instances, resulting in 2160 total random graph instances. On each instance, we measured the mean (and standard deviation) of the iterations required for the (1+1) EAk to find a k-cover for the entire graph over 100 runs per instance. In Figure 3.5, we plot the mean running time as a function of n for the random planted instances grouped by edge density p. The shaded bands represent the standard deviation from the mean. To compare the empirical running time to the bound proved in Theorem 8 and assess the magnitude of hidden constants, we plot the mean running time normalized by n2 lnn in Figures 3.7 and 3.8. The results here suggest that not only is the normalized running time is bounded above by a fixed value that depends only on k, as predicted, but also that the bound in Theorem 8 may be too large, at least for these instances. In order to observe the dependence of the running time on k for fixed n, we plot the mean running time (again normalized by n2 lnn) as a function of k in Figures 3.9 44 20 40 60 80 100 0 2 4 6 8 ·104 n m ea n ru n ti m e k = 8 k = 6 k = 4 (a) p = 0.1 20 40 60 80 100 0 2 4 6 8 ·104 n m ea n ru n ti m e k = 8 k = 6 k = 4 (b) p = 0.25 20 40 60 80 100 0 2 4 6 8 ·104 n m ea n ru n ti m e k = 8 k = 6 k = 4 (c) p = 0.5 20 40 60 80 100 0 2 4 6 8 ·104 n m ea n ru n ti m e k = 8 k = 6 k = 4 (d) p = 0.75 Figure 3.5: Mean running time of (1+1) EAk on random planted vertex cover in- stances. Shaded bands represent standard deviation. and 3.10. We compare these results with a plot of 2k/100 and observe that the growth with k appears to be even subexponential. The constant 1/100 is used as a scaling factor for the comparison. We conjecture that these graph instances are relatively easy to solve once a k-cover in an induced subgraph has been identified. 45 20 40 60 80 100 0 2 4 6 8 ·104 n m ea n ru n ti m e k = 8 k = 6 k = 4 (a) clique/anticlique 20 40 60 80 100 0 2 4 6 8 ·104 n m ea n ru n ti m e k = 8 k = 6 k = 4 (b) biclique Figure 3.6: Mean running time of (1+1) EAk on vertex cover instances. Shaded bands represent standard deviation. 46 20 40 60 80 100 0 1 2 3 n m ea n ru n ti m e/ (n 2 ln n ) k = 8 k = 7 k = 6 k = 5 k = 4 k = 3 (a) p = 0.1 20 40 60 80 100 0 1 2 3 n k = 8 k = 7 k = 6 k = 5 k = 4 k = 3 (b) p = 0.25 20 40 60 80 100 0 1 2 3 n m ea n ru n ti m e/ (n 2 ln n ) k = 8 k = 7 k = 6 k = 5 k = 4 k = 3 (c) p = 0.5 20 40 60 80 100 0 1 2 3 n k = 8 k = 7 k = 6 k = 5 k = 4 k = 3 (d) p = 0.75 Figure 3.7: Mean running time of (1+1) EAk divided by n2 lnn on random planted vertex cover instances. 47 20 40 60 80 100 0 1 2 3 n m ea n ru n ti m e/ (n 2 ln n ) k = 8 k = 7 k = 6 k = 5 k = 4 k = 3 (a) clique/anticlique 20 40 60 80 100 0 1 2 3 n k = 8 k = 7 k = 6 k = 5 k = 4 k = 3 (b) biclique Figure 3.8: Mean run time divided by n lnn of (1+1) EAk on vertex cover instances. 48 3 4 5 6 7 8 0 0.5 1 1.5 2 2.5 k m ea n ru n ti m e/ (n 2 ln n ) n = 100 n = 90 n = 80 n = 70 n = 60 n = 50 2k/100 (a) p = 0.1 3 4 5 6 7 8 0 0.5 1 1.5 2 2.5 k n = 100 n = 90 n = 80 n = 70 n = 60 n = 50 2k/100 (b) p = 0.25 3 4 5 6 7 8 0 0.5 1 1.5 2 2.5 k m ea n ru n ti m e/ (n ln n ) n = 100 n = 90 n = 80 n = 70 n = 60 n = 50 2k/100 (c) p = 0.5 3 4 5 6 7 8 0 0.5 1 1.5 2 2.5 k n = 100 n = 90 n = 80 n = 70 n = 60 n = 50 2k/100 (d) p = 0.75 Figure 3.9: Mean run time of (1+1) EAk divided by n2 lnn on random planted vertex cover instances as a function of k. 49 3 4 5 6 7 8 0 0.5 1 1.5 2 2.5 k m ea n ru n ti m e/ (n ln n ) n = 100 n = 90 n = 80 n = 70 n = 60 n = 50 2k/100 (a) clique/anticlique 3 4 5 6 7 8 0 0.5 1 1.5 2 2.5 k n = 100 n = 90 n = 80 n = 70 n = 60 n = 50 2k/100 (b) biclique Figure 3.10: Mean run time of (1+1) EAk divided by n2 lnn on vertex cover instances as a function of k. 50 3.2 Evolving Graph Labelings Similar to the work on local search for graceful tree labelings [2], we pose the problem of finding graceful labelings as an instance of combinatorial optimization in which the permutations of vertex labelings are searched to maximize the number of unique induced edge labels. Let G = (V,E) be a graph on n vertices and m edges. We represent a candidate labeling for V as a linear ordering on [m] ∪ {0}, i.e., an ordered sequence x = (x1x2 . . . xm+1) of distinct integers. A linear ordering x induces a labeling on V by assigning the label xi to vertex vi ∈ V . When m > n − 1, the labels in xk for k > n are unused, and do not contribute to the labeling. For a given candidate solution x we define the sequence of label counts Li(x), i = 1, 2 . . . ,m as Li(x) = |{uv ∈ E : |xu − xv| = i}|. We define the following sequence of indicator functions χi(x) =  1 if Li(x) = 1, 0 otherwise. By the definition of a graceful labeling we know that the candidate solution x corre- sponds to a graceful labeling if and only if for all i ∈ [m], Li(x) = χi(x) = 1. We define a weighted fitness function f(x) = m∑ i=1 wiχi(x). (3.2) We will consider the following three weight variants, funit, in which wi = 1 for all i ∈ [m], flin, in which wi = i for all i ∈ [m] and fexp, in which wi = 2i−1 for all 51 i ∈ [m]. We point out that flin is similar to the evaluation used in the local search algorithm of Fang [18], but our version further penalizes non-unique induced edge labelings. Algorithm 13: (1+1) EA input: A graph G = (V,E) on n vertices and m edges; A fitness function f ∈ {funit, flin, fexp}; 1 x← (0, 1, . . . ,m) 2 randomly shuffle x 3 while stopping condition not met do 4 y ← x 5 Draw r from a Poisson distribution with λ = 1 6 for i← 1 to r do 7 randomly exchange two elements of y 8 if f(y) ≥ f(x) then x← y Due to local graph symmetries, the fitness landscape of the graceful labeling prob- lem can contain local optima that are difficult to escape. One possible way of over- coming these local symmetries is by fixing a small partial labeling and searching over labelings of the remaining graph. In fact, this procedure can be used as a subroutine for systematic search by exhaustively iterating over small partial labelings. For ex- ample, this is exactly the approach taken for graceful labeling of trees by Fang [18] in the context of backtracking search. For backtracking methods, this is a method of symmetry breaking in which one would like to constrain the search to a subset of the search space by adding a symmetry-breaking constraint that forces a vertex to a certain label. This and other symmetry-breaking techniques have been shown to be effective for constraint programming search with graceful labeling by Petrie and Smith [45]. This kind of constraint symmetry breaking can also be useful for hill climbing algorithms because it can result in a more favorably structured fitness landscape [46]. 52 We implement such a framework in Algorithm 14, which takes a small constant set of labels, and systematically iterates over all partial labelings and calls as a subroutine a slightly modified EA (the (1+1) EAℓ′ , listed in Algorithm 15) with a predefined budget to search over the remaining labels. In line 8 of Algorithm 15, we denote ℓ′⊙ x as the vertex labeling of V obtained by joining the partial labeling ℓ′ of V ′ and the partial labeling x of V \ V ′. Algorithm 14: Symmetry-breaking restart framework for the (1+1) EA input: A graph G = (V,E); A fitness function f ∈ {funit, flin, fexp}; A set L of c = O(1) labels; A runtime budget b; 1 for each subset S ⊂ V of size c do 2 for each partial labeling ℓ′ : S → L do 3 Run the (1+1) EAℓ′ on G for b steps with partial labeling ℓ ′ Algorithm 15: (1+1) EAℓ′ input: A graph G = (V,E) on n vertices and m edges; A fitness function f ∈ {funit, flin, fexp}; A partial labeling ℓ′ assigning labels to V ′ ⊂ V ; // x is a partial labeling of the induced subgraph G[V \ V ′] 1 x← a sequence on {0, 1, . . . ,m} \ ℓ′[V ′]; 2 randomly shuffle x; 3 while stopping condition not met do 4 y ← x; 5 Draw r from a Poisson distribution with λ = 1; 6 for i← 1 to r do 7 randomly exchange two elements of y; 8 if f(ℓ′ ⊙ y) ≥ f(ℓ′ ⊙ x) then x← y; For the following classes of graphs that we consider, there is a known partial labeling that will result in a graceful labeling, so it is enough to prove runtime re- sults for Algorithm 15. In the general case of the problem, a partial labeling may 53 not be known. We use the following proposition to connect the runtime results on Algorithm 15 to the general framework of Algorithm 14. Proposition 15. Let g(n) be an asymptotically positive function of n, c > 0 be a constant, and G = (V,E) be a graceful graph with |V | = n vertices. Let L be a list of at most c labels from {0, . . . ,m}. Suppose that the (1+1) EAℓ′ has an expected runtime of at most T on G using some partial labeling ℓ′ : V ′ → L. Then Algorithm 14 with a budget of b = g(n) · T finds a graceful labeling for G with probability 1 − 1/g(n) using at most nc · g(n) · T fitness evaluations. Proof. By Markov’s inequality, the (1+1) EAℓ′ with partial labeling ℓ ′ is successful within g(n)·T iterations with probability at least 1−1/g(n). Pessimistically assuming that Algorithm 14 fails when checking every other n!/(n−c)!−1 ≤ nc partial labeling from L. The total number of fitness evaluations is thus at most b · nc. When proved rigorous runtime results on the running time of the (1+1) EA when ran on various classes of graceful graphs. We find polynomial time upper bounds for the (1+1) EA (or Algorithm 15) for all the classes of graphs that we consider. 3.2.1 Star Graphs The first graph that we consider is the star graph (see Figure 3.11), or the complete bipartite graph K1,m, a tree on n nodes and m edges with a single internal node of degree m and m leaves. Graceful Labelings of stars are simple to find since any time the graph has the internal node labeled with 0 orm, it is graceful. Because of this fact every non-graceful vertex labeling neighbors a global optimum, making it particularly easy for the (1+1) EA to find a solution. 54 12 3 4 5 6 7 0 1 2 3 4 5 6 7 Figure 3.11: Star graph K1,7 with a graceful labeling. Theorem 16. Using any of the fitness functions funit, flin or fexp, the expected number of fitness evaluations until the (1+1) EA finds a graceful labeling for K1,n−1 is at most e(n2 − n)/4. Proof. Given an arbitrary permutation x of vertex labels from [n − 1] ∪ {0}, we assume without loss of generality, that x1 corresponds to the label of the vertex of degree n− 1 (the center vertex). The induced label on each of the n− 1 edges of the graph is |xi − x1| for i ∈ [n] \ {1}. If 0 < x1 < n − 1, then there will be at least two edges with the induced label z = |(x1 + 1) − x1| = |(x1 − 1) − x1|, and hence χz(x) = 0. Otherwise, each edge label will be unique. Hence, the labeling corresponding to x is graceful if and only if x1 = 0 or x1 = n− 1. For any f ∈ {funit, flin, fexp}, f(x) = ∑m i=1wi if and only if x corresponds to a graceful labeling, otherwise, f(x) < ∑m i=1wi. Thus, if mutation produces a graceful labeling y in line 7 of Algorithm 13, it will always be accepted. We bound the probability that mutation produces a graceful labeling. If x is not graceful, there is a pair of indices k, ℓ ∈ [n] \ {1} where xk = 0 and xℓ = n− 1. Let E denote the event that r = 1 in line 5 of Algorithm 13 and the single swap exchanges 1 with k or ℓ. Then, Pr(E) = 4Pr (Pois(λ = 1) = 1) n(n− 1) = 4 en(n− 1) . 55 Event E is sufficient to produce a graceful labeling from an arbitrary labeling x, so the probability that a graceful labeling is produced in any iteration from a non- is at least Pr(E). The waiting time until a graceful labeling is found is geometrically distributed with expectation at most Pr(E)−1, completing the proof. 3.2.2 Bicliques A biclique is a bipartite graph G = (V,E) where A,B is a partition of V such that A and B are independent sets, and ab ∈ E for every a ∈ A and b ∈ B. We consider highly unbalanced bicliques that have one partition with a small constant size. It is likely that there are many different graceful labelings for bicliques, but in the following lemma we prove that a partial labeling of the small partition gives us a fitness landscape with a unique local optimum. We start by considering the biclique K2,n−2 where |A| = 2 and |B| = n− 2. Lemma 17. Consider the biclique K2,n−2 with independent sets A,B. Let ℓ be any labeling such that a1, a2 ∈ A are ℓ(a1) = 0, ℓ(a2) = n−2. Then ℓ is a graceful labeling if and only if ℓ[B] = {m 2 + 1, m 2 + 2, . . . ,m } . Proof. Let ℓ be such a labeling. Denote bi ∈ B such that ℓ(bi) = m/2 + i. An edge incident on a1 has induced label |0−ℓ(bi)| = ℓ(bi) = m/2+i, whereas an edge incident on a2 has induced label |(n − 2) − ℓ(bi)| = m/2 − (m/2 + i) = i since m/2 = n − 1. Thus each of the m edges has a unique induced label from {1, 2, . . . ,m}. On the other hand, let ℓ be a graceful labeling with ℓ(a1) = 0 and ℓ(a2) = n− 2. Suppose for contradiction that there exists a j ∈ ℓ[B] where j ≤ m/2. Let b ∈ B be the vertex such that ℓ(b) = j. Note that under these conditions 0 < j < m/2, otherwise the uniqueness of vertex labels is violated. Since the induced label on edge 56 a1b is j, there cannot exist a b ′ ∈ B with ℓ(b′) = m/2+ j, otherwise the induced edge label on a2b ′ would be |n − 2 − (m/2 + j)| = j, and therefore not unique. But this implies no edge has induced label m/2 + j, contradicting the assumption that ℓ is a graceful labeling. Using this lemma we prove the following bound on the runtime of the (1+1) EAℓ′ on K2,n−2 when we give the smaller partition a partial labeling. Theorem 18. Consider K2,n−2 with independent sets A and B such that |A| = 2. Let ℓ′(a1) = 0, ℓ′(a2) = n− 2 be a partial labeling of the vertices in A. Then the expected time of the (1+1) EAℓ′ to find a graceful labeling of K2,n−2 using any of the fitness functions {funit, flin, fexp} is O(n2 log n). Proof. We define a potential function on sequences (x1, x2, . . . , xm−1) of the label set {0, 1, . . . ,m} \ ℓ′[A] = {1, 2, . . . , n− 3, n− 1, . . . ,m} as follows. For a given sequence x, define ϕ(x) = m− m∑ i=1 χi(x) = m− flin(x). Clearly ϕ(x) = 0 if and only if x corresponds to a graceful labeling. Note that ϕ counts the number of edges where the induced label either (1) does not appear in the labeling corresponding to x, or (2) appears more than once in the labeling corresponding to x. We call an edge with a nonunique label bad. Let ϕ(x) > 0. Since every edge has an induced label, there must exist bad edges in the labeling corresponding to x. We argue that every bad edge either 1. Is incident to a vertex b ∈ B with ℓ(b) < m/2, or 2. Implies the existence of another bad edge incident to a vertex b ∈ B with ℓ(b) < m/2. 57 To see this, consider the latter case in which a bad edge e is incident to a vertex b′ ∈ B such that ℓ(b′) > m/2. Let k := m/2− ℓ(b′). It follows that e must be incident to a2 since the induced label on the edge a1b is m/2 + k, which cannot be doubled (and so χm/2+k(x) = 1). Therefore, since e is bad and incident to a2, the induced edge label on e is |(n− 2)− (m/2 + k)| = k and it must be the case that χk(x) = 0. But if e is bad, there must be another edge e′ = a′b′ with induced label k, and this would imply ℓ(b′) = k < m/2. We now show that for each bad edge, we can either repair it or its pair. Let e be a bad edge incident to a vertex b ∈ B, and by the above argument, suppose that ℓ(b) < m/2. Let e1 = a1b and e2 = a2b (hence e ∈ {e1, e2}). Furthermore, let xi = ℓ(b) be the component of x corresponding to b’s label. We make the following case distinction. Case 1. Only one of the induced labels on e1 and e2 is unique. Let the unique induced label be k, i.e., χk(x) = 1. This implies that there is no vertex b ′ such that ℓ(b′) = m/2 + k, otherwise a2b′ would also have induced label k. So the label xj = m/2 + k lies at index j > n − 2. Swapping xi and xj would thus increase χm/2+k(x) from zero to one, whereas χk(x) would remain at one. Since only the edge labeled k was unique, it follows that this swap gains a unique edge, and thus decreases the potential ϕ by one. Case 2. Neither of the induced labels on e1 and e2 are unique. This implies the existence of an xj with j > n− 2 and xj > m/2. Since no edge can have label xj, swapping xi and xj would gain at least one unique induced label, hence decreasing the potential by at least one. Consider the stochastic process (Xt)t∈N where Xt corresponds to the value of the potential function ϕ applied to the candidate solution in the t-th step of the algorithm. 58 As argued above, if a candidate solution x produces ϕ(x) bad edges, at least ϕ(x)/2 are incident to vertex b ∈ B such that ℓ(b) < m/2, and this vertex corresponds to a unique element xi = ℓ(b) where i ≤ n− 2. In either case above, there exists at least one element xj with j > n− 2 such that swapping xi and xj reduces the potential ϕ by at least one. Furthermore, since each fitness function {funit, flin, fexp} is inverseely proportional to ϕ, a strict reduction in ϕ corresponds to a strict increase in the fitness, and the resulting point is accepted. We therefore have E[Xt−Xt+1 | Xt] ≥ 2n(n−1) · Xt2 , and the claimed runtime bound follows by Theorem 2. Now that we have seen a runtime bound when we the partial labeling completely labels the smaller partiton, it is natural to ask if it is possible to find a similar bound when we run the (1+1) EAℓ′ with only a single labeled vertex. The following theorem shows that the O(n2 log n) bound of Theorem 18 is still possible, but now only as a constant-probability tail bound. We may still bound the expectation by a cubic term. Theorem 19. Consider the biclique K2,n−2 with independent sets A and B such that |A| = 2. Let ℓ′(a1) = n − 2 be a partial labeling of the vertices in A. Then the (1+1) EAℓ′ using partial labeling ℓ ′ has a runtime of O(n2 log n) with constant probability, and O(n3) in expectation. Proof. Let X be the set of all sequences on {0, 1, . . . ,m} \ {n− 2} and let X ′ ⊂ X be the set of sequences {x ∈ X : χm(x) = 1}. Clearly, for all x, y ∈ X, if x ∈ X ′ and y ∈ X \X ′, then fexp(x) ≥ 2m > 2m − 1 ≥ fexp(y). Thus, once a sequence from X ′ is generated by the (1+1) EAℓ′ , no sequence from X \ X ′ will be subsequently selected. After this, we may apply the same drift arguments from Theorem 18 (using bad edges adjacent to labels larger than m/2 if ℓ(a2) = m instead of zero). A sequence x ∈ X ′ if and only if the indexes containing 0 and m are both at most n − 1. Since in this case, |x| = m = 2(n − 2), with probability at least 1/4 59 the initial solution is already in x′, and applying Markov’s inequality to the result of Theorem 18 yields the claimed tail bound. Otherwise, we bound the time until a sequence from X ′ is first generated. Let x ∈ X \X ′ be an arbitrary point, and let xi = 0 and xj = m. Without loss of generality, assume that x1 corresponds to the label on vertex a1. We also pessimistically assume that both i, j > n− 1. To produce an offspring y ∈ X ′, it suffices to swap xi with x1 and xj with xj′ for any j ′ ≤ n− 1. The probability of such a mutation is at least Pr (Pois(λ = 1) = 2)( n−1 2 ) · n− 1(n−1 2 ) = 4 e2n2(n− 1) = Ω ( 1 n3 ) . The waiting time until such a mutation occurs is distributed geometrically with ex- pectation O(n3), after which the proof of Theorem 18 can be applied. The cubic term dominates the expectation. Theorem 18 can be generalized to all bicliques given a fixed partial labeling of one of the partitions. Theorem 20. Let Kc,n−c be a biclique with partitions A,B such that a1, a2, . . . , ac ∈ A and ℓ′(ai) = (i− 1)(n− c) for all i ∈ {1, . . . , c} be the partial labeling of the partition A. Then using any of the fitness functions funit, flin, or fexp, the expected runtime of the (1+1) EAℓ′ is bounded above by O(n 2 log n). 3.2.3 Paths A path Pn is a graph on V = v1, v2, . . . , vn where vi and vi+1 are adjacent for all i ∈ {1, 2, . . . , n − 1}. Unlike star graphs, the structure of paths causes local sym- metries that do not translate to global symmetries. This leads to synchronization problems where all the local symmetries need to agree on an optimal solution. Syn- 60 0 9 1 8 2 7 3 6 4 5 9 8 7 6 5 4 3 2 1 8 1 9 0 6 3 7 2 4 5 7 8 9 6 3 4 5 2 1 0 9 1 8 6 3 7 2 4 5 9 8 7 2 3 4 5 2 1 Figure 3.12: Path graph P10 with two graceful labelings (top and middle) and a non-graceful labeling (bottom) that contains local symmetries from each. chronization problems are well-studied for pseudo-Boolean problems, particularly in the case of so-called spin-flip symmetry DBLP:journals/ec/HoyweghenNG02. Fig- ure 3.12 demonstrates these local symmetries. To counter this, we fix a single end vertex with the label zero in a partial labeling and run the (1+1) EAℓ′ . This breaks the symmetry and guarantees that there is only a single global optimum, proven in Lemma 21. Note that in general, the number of optimal solutions for Pn is Ω(2.37 n) [1]. It simplifies the proofs greatly to fix a degree-one vertex, but we point out that Proposition 15 ensures that Algorithm 14 adds at most a linear factor overhead to systematically search each vertex to fix to zero. Lemma 21. Let Pn be the path on n vertices v1, v2, . . . , vn where vi is adjacent to vi+1 for all i ∈ [n − 1]. Then there is exactly one unique graceful labeling with ℓ(v1) = 0. Moreover, this labeling corresponds to the vertex label sequence x∗ = (0,m, 1,m− 1, 2,m− 2, . . . , ⌈m/2⌉), where m = n− 1 is the number of edges in Pn. Proof. Let e1, e2, . . . , em be the sequence of edges in Pn such that ei = vivi+1. The induced edge label of ei under x ∗ is m+1− i, so each edge label in 1, 2, . . . ,m appears exactly once. Thus, x∗ corresponds to a graceful labeling. Uniqueness is proved by 61 the following inductive argument. If ℓ(v1) = 0 is fixed, then there is only one valid position for m, and that is adjacent to the zero in the first position (otherwise, the induced edge label m would not appear). Therefore, m must appear in this position in every graceful labeling. Consider now the subsequence x∗[1 . . k] where k ≥ 1. Suppose for induction that every global optimum is identical to x∗ in the first k elements. By construction, the element in the k-th position of x∗ is either ⌊k−1 2 ⌋ or m− ⌊k−1 2 ⌋. Again, the only way for the induced edge label m− 2⌊(k − 1)/2⌋ − 1 to appear is for the vertex label m− (⌊k−1 2 ⌋+ 1), or respectively, (⌊k−1 2 ⌋+ 1) to appear in the next position of the sequence. But this is equal to x∗k+1. Thus x ∗ is unique. Unfortunately, partial labeling is not enough to overcome problems that arise from symmetry. We now prove the existence of local optima under the funit function that are difficult to escape. Proposition 22. Let Pn be the path on n vertices with m = n−1 edges. The sequence xloc := (0,m− 1, 1,m− 2, 2, . . . , ⌊m/2⌋,m) is a (nonstrict) local optimum in the for the (1+1) EAℓ′ using the fitness function funit. Furthermore, x loc is at distance Ω(n) from the global optimum. Proof. Let e1, e2, . . . , em denote the sequence of adjacent edges of Pn. The sequence of induced edge labels produced by xloc is m− 1,m− 2,m− 3, . . . , 3, 2, 1, ⌈m/2⌉. Therefore, all the induced edge labels are unique except for ⌈m/2⌉, which appears exactly twice. Thus funit(x loc) = m− 2. Note that funit(x) ≤ funit(xloc) for all x ̸= x∗ where x∗ is the unique global optimum for Pn described in Lemma 21. 62 Note that |{i : xloci ̸= x∗i } ≥ ⌊m/2⌋ since the labels do not match in every other position. Since x∗ is the unique global optimum of Pn, it is necessary to move at least ⌊m/2⌋ elements. Since each swap can move at most two elements into the correct position, we conclude that at least ⌊m/2⌋/2 ≥ (m− 2)/4 = Ω(n) swaps are necessary to transform xloc into x∗. In order to overcome local optima of the type presented in Proposition 22, we de- compose Equation (3.2) into two components. Similar to the genotype-phenotype maps from early work in genetic programming [5], we consider permutations of {0, 1, . . . ,m} isomorphic to the symmetric group Sm+1, as the genotype space, and the sets of unique edges in the corresponding labeling, isomorphic to (a subset) of {0, 1}m as the phenotype space. Equation (3.1) can be written as the composition of a linear function pseudo-Boolean function f : {0, 1}m → N and a genotype-phenotype mapping g : Sm+1 → {0, 1}m. See Figure 3.13 for an illustration. genotype space phenotype space Sm+1 {0, 1}m fi tn es s unique edges linear function Figure 3.13: Genotype-phenotype mapping from permutations of vertex labels into selection of unique edges. This is composed with a linear function on phenotype space. We use Domino Convergence to our advantage. Employing the exponentially weighted fitness function fexp forces the phenotype function into domino convergence. fexp causes unique edges to become fixed one-by-one, preventing the (1+1) EAℓ′ from becoming stuck in a local optimum. Doing this we guarantee a polynomial time expected performance of the (1+1) EAℓ′ on paths, proven in the next theorem. 63 Theorem 23. Let Pn be a path on n vertices and ℓ ′(v) = 0 be the partial labeling of a single degree-one vertex of Pn. Then using the fitness function fexp, the expected runtime of the (1+1) EAℓ′ is bounded above by en3 4 . Proof. Denote the vertices of Pn as v0, v1, . . . , vm, and the edges as e1, e2, . . . , em such that ei = vi−1vi. Obviously, m = n− 1, and without loss of generality, let ℓ′(v0) = 0 in the (1+1) EAℓ′ . Let x ∗ = (m, 1,m − 1, 2, . . .) be the unique optimum described in Lemma 21 with the zero element removed, and let x = (x1, x2, . . . , xn−1) be an arbitrary permutation of (1, . . . , n − 1). We define the potential function over such permutations as follows ϕ(x) := n−min{{i ∈ [n− 1] : xi ̸= x∗i } ∪ n}. Thus x is a graceful labeling (and fexp is maximized) if and only if ϕ(x) = 0. We first argue that the potential cannot increase during a run of the (1+1) EAℓ′ . For any sequence x, fexp(x) ≥ m−ϕ(x)∑ k=0 2m−k = 2m+1 − 2ϕ(x), since at least the induced edge labels e1, e2, . . . , em−ϕ(x) are unique. Moreover, we have fexp(x) ≤ ( m∑ k=0 2k ) − 2m−(m−ϕ(x)+1) = 2m+1 − 2ϕ(x)−1 − 1, because we know at least the induced edge label for em−ϕ(x)+1 is incorrect. 64 Thus, if y is produced from x during the mutation step of the (1+1) EAℓ′ and ϕ(y) > ϕ(x), fexp(x)− fexp(y) ≥ 2ϕ(y)−1 − 2ϕ(x) + 1 > 0, and thus y would not be accepted. Now let (Xt)t≥0 be the sequence of random variables corresponding to the values of the potential function ϕ during a run of the (1+1) EAℓ′ . Let x be the current labeling at time t (so Xt = ϕ(x)). Then the induced edge labels on edges e1, e2, . . . , em−ϕ(x) are unique, since xn−ϕ(x) = xm−ϕ(x)+1 ̸= x∗m−ϕ(x)+1, there must be an element xj of the sequence where j > n− ϕ(x) and xj = x∗m−ϕ(x)+1. A strictly improving move can be produced by exchanging xj and xn−ϕ(x). This occurs with probability at least 2 Pr (Pois(λ = 1) = 1)( n−1 2 ) = 4 e(n− 1)(n− 2) . Since such a move decreases the potential by at least one, the drift of the potential is E[Xt−Xt+1 | Xt] ≥ 4e(n−1)(n−2) , and the claim follows by application of Theorem 1. Naturally we wonder how the (1+1) EAℓ′ would perform when equipped with the Jump or Reverse operations introduced in [51]. Using the same ideas as Theorem 23 we can prove the following Theorem. Theorem 24. Let Pn be a path on n vertices and ℓ ′(v) = 0 be the partial labeling of a single degree-one vertex of Pn. Then using the fitness function fexp, the expected runtime of the (1+1) EAℓ′ equipped with the Jump or Reverse mutation operator is bounded above by en 3 4 . When we equip the (1+1) EAℓ′ with the fexp fitness function and the Reverse mutation operator we are able to find a graceful labeling without fixing any of the 65 vertices and we do not need to rely on Domino Convergence. But first we need to note the following. Proposition 25. There are k+ 1 vertex pairs that can be selected for Reverse to get the edge weight w(uv) = |xu−xv| = m−k such that xu > xv. These pairs are exactly: (xu, xv) ∈ {(m, k), (m− 1, k − 1), . . . , (m− k, 0)} = Ck. Let Lk(α) be a labeling of Pn where the k largest edge weights are unique and part of α subpaths. We call a vertex label locked if it is an internal vertex on one of the α subpaths. Proposition 26. There are k − α locked vertices vertex labels in Lk(α). Proof. Call the i-th subpath pi, and say |pi| is the number of vertices in the subpath. There are then |pi| − 1 edges in the i-th subpath and |pi| − 2 interior locked vertices. That means ∑α i=1(|pi| − 1) = k so the total number of vertices in the subpaths is∑k i=1 |pi| = k+ α and ∑α i=1(|pi| − 2) = ∑α i=1 |pi| − 2α = (k+ α)− 2α = k− α locked vertices. For a weight m− k, we call a vertex combination (xu, xv) infeasible if xu or xv is locked in Lk(α). If we have the candidate solution x let R(x, i, j) be the Reverse oper- ator that omits the candidate solution x′ = (x1x2 . . . , xj, xj−1, . . . , xi, xj+1, . . . , xm+1), reversing xi, xi+1, . . . , xj. We can now prove the following lemma. Lemma 27. Given Lk(α) we can use at most two Reverse operations to get Lk+1(α ′) until k ≥ m 2 Proof. The largest weight not in Lk(α) is m − k, so to get Lk+1(α′) we need to add this weight. Since there are k−α locked vertices 26 and there are k+1 combinations 66 to get edge weight m− k we get that there are (k+1)− (k−α) = α+1 vertex pairs (xu, xv) ∈ {(m, k), (m− 1, k− 1), . . . , (m− k, 0)} such that neither xu or xv is locked. This is because {m, k} ∪ {m − 1, k − 1} ∪ . . . ∪ {m − k, 0} = ∅, if k ≥ m 2 this is not true. Now we know that there are α subpaths, so if each vertex is an end vertex in one of the subpaths, there are at least two vertex labels not in these α subpaths. Let (xi, xj) be vertex labels not in the same subpath giving edge weight m− k. Case 1: Neither xi or xj are labels of end vertices of a subpath. The reverse operations R(x, i+ 1, j) and R(x, i, j − 1) emit Lk+1(α + 1). Case 2: Exactly one of xi or xj is the labeling of an end vertex of some subpath. Without loss of generality, suppose this vertex is xi. Subcase 2.1: Pi = vi, vi+1, . . . , vh is the subpath. We can use R(x, i, j − 1) to get Lk+1(α). Subcase 2.2: Pi = vh, vh+1, . . . , vi is the subpath. We can use R(x, i+ 1, j) to get Lk+1(α). Case 3: Both xi and xj are labelings of subpaths Pi and Pj respectively. Subcase 3.1: xi and xj labelings of the left (or right) end vertices of their sub- paths. Without loss of generality let them the the right end vertices. Then Pi = vi, vi+1, . . . , vh and Pj = vj, vj+1, . . . , vh′ . We can use the operation R(x, i, j − 1) to get Lk+1(α− 1). Subcase 3.2.1: Pi = vi, vi+1, . . . , vh and Pj = vh′ , vh′+1, . . . , vj. We can use the two operations R(x, h′, j) = x′ to reverse Pj and R(x′, i, j − 1) to get Lk+1(α− 1). Subcase 3.2.2: Pi = vh, vh+1, . . . , vi and Pj = vj, vj+1, . . . , vh′ . We can use the two operations R(x, j, h′) = x to reverse Pj and R(x′, i+ 1, j) to get Lk+1(α− 1). Now that we know that there is always at least a pair of operations that will give us a candidate solution with the next largest weight, we prove the following Theorem. 67 · · · xi−2 xi−1 xi xi+1 xi+2 · · · xj−2 xj−1 xj xj+1 xj+2 · · · · · · xi−2 xi−1 xi xj xj−1 xj−2 · · · xi+2 xi+1 xj+1 xj+2 · · ·m− k · · · xi−2 xi−1 xj−1 xj−2 · · · xi+2 xi+1 xi xj xj+1 xj+2 · · ·m− k · · · xi−1 xi xi+1 · · · xh · · · xh′−1 xh′ · · · xj−1 xj · · · · · · xi−1 xi xi+1 · · · xh · · · xh′−1 xj xj−1 · · · xh′ · · · · · · xi−1 xh′−1 · · · xh · · · xi+1 xi xj xj−1 · · · xh′ · · ·m− k Figure 3.14: The top path is a baseline, the second path shows the operation R(x, i + 1, j), the third path shows the operation R(x, i, j − 1), paths four through six demonstrate Case 3 subcase 1. Theorem 28. Let Pn be a path on n vertices, using the fitness function fexp, the expected runtime of the (1+1) EAℓ′ equipped with the Reverse mutation operator to find a labeling with the largest m 2 edges unique is bounded by O(n5). Proof. Let x = (x1, x2, . . . , xn−1) be an arbitrary permuation of (1, 2, . . . , n − 1). Define the potential function over these permutations as ϕ(x) = max{{i ∈ [n− 1] : χi(x) = 0} ∪ {0}} ϕ(x) = 0 if an only if fexp(x) is maximized and x is a graceful labeling. If we have a candidate solution x with the k largest induced edge weights being unique, and a candidate solution x′ with the k+1 largest induced edge weights being unique, we know that fexp(x ′) ≥ ∑ki=0 2m−i > ∑k−1i=0 2m−i ≥ fexp(x). Because of this once a solution with the k largest induced edge weights being unique, all following candidate solutions will have at least the k largest induced edge weights unique. Also because of this ϕ will nver increase during a run of the (1+1) EAℓ′ . 68 Let (Xt)t≥0 be the sequence of random variables corresponding to ϕ during a run of the (1+1) EAℓ′ . Also let x be the current labeling at time t (so Xt = ϕ(x)). We know from 27 that we can find a strictly improving move using at most two Reverse operations. This occurs with probability at least Pr (Pois(λ = 1) = 2)( n−1 2 )( n−1 2 ) = 1 e2(n− 1)2(n− 2)2 = Ω ( 1 n4 ) . Since the potential is decreased by at least one when this move occurs, the drift of the potential is E[Xt−Xt+1 | Xt] = O(n5), and the claim follows by application of Theorem 1. 3.2.4 Experiments To empirically investigate the runtime of the (1+1) EA and (1+1) EAℓ′ and com- pare their performance against a complete constraint solver, we ran experiments on a heterogeneous HPE cluster with AMD ROME compute nodes, distributing runs using GNU Parallel [55]. We collected data on paths Pn, wheel graphs, cycles of length 0, 3 (mod 4) and various bicliques. The problem of finding a graceful labeling can be characterized as a constraint satisfaction problem [52] in which both the vertex and edge labels are variables and there is a ternary constraint on each edge that enforces the edge variable is the absolute difference of the variables corresponding to its incident vertices. Moreover, the global AllDiff constraint is enforced over the set of vertex label variables and edge label variables. To compare the evolutionary algorithms to a complete constraint solver for this setting, we contrasted the (1+1) EA and the (1+1) EAℓ′ using the different fitness functions to the GECODE constraint solver [23]. For each of the different graphs classes and solving methods, we collected data for graph sizes between 69 5 10 15 20 25 30 35 40 45 50 0.0001 0.001 0.01 0.1 1 10 100 funit flin fexp (1+1) EA funit flin fexp (1+1) EAℓ′ CSP solver n ti m e [s ec ] Figure 3.15: Timing data for constraint solver, (1+1) EA and (1+1) EAℓ′ on path graphs Pn. n = 5 and n = 50. For each n, we collected 50 trials of each search algorithm, with a two hour cutoff. In Figure 3.15, we plot for each approach the wall clock time vs. n on path graphs Pn. We see that the EAs outperform the CSP solver for finding graceful labelings, and the advantage increases with n. Nevertheless, even for n > 26, the traditional (1+1) EA is not able to find a graceful labeling within the cutoff. When we fix a degree-one vertex to label zero and use the exponentially weighted function fexp, we see extremely efficient behavior. In fact, this technique was the only one that could solve up to P50, and could do so in a fraction of a second. This supports our theoretical 70 5 10 15 20 25 30 35 40 45 50 0.00001 0.0001 0.001 0.01 0.1 1 10 100 funit flin |ℓ′| = 2 funit flin |ℓ′| = 1 (1+1) EAℓ′ funit flin (1+1) EA CSP solver n ti m e [s ec ] Figure 3.16: Timing data for constraint solver, (1+1) EA and (1+1) EAℓ′ on bicliques K2,n−2. For (1+1) EAℓ′ we also compare size 1 and size 2 partial labelings. analysis that symmetry breaking with partial label-fixing and enforcing phenotypic domino convergence is useful for this problem. Figure 3.17 plots the running time of the algorithms in dependence of n on cycle graphs Cn and wheel graphs Wn. The (1+1) EA outperforms the constraint solver on cycles (the latter does not complete all trials within the cutoff already for n > 16), and the (1+1) EA with the unit fitness function funit outperforms both the constraint solver and the other fitness functions on wheels. For bicliques K2,n−2, we measured the running time of the (1+1) EA and the (1+1) EAℓ′ with a partial labeling assigning a1, a2 ∈ A the labels 0 and n − 2, as in 71 8 10 12 14 16 18 20 22 24 0.00001 0.1 1,000 funit flin fexp (1+1) EA CSP solver n ti m e [s ec ] 6 8 10 12 14 0.0001 0.1 100 funit flin fexp (1+1) EA CSP solver n ti m e [s ec ] Figure 3.17: Timing data for EAs and constraint solver on cycle graphs Cn (left) and wheel graphs Wn (right). 10 20 30 40 50 0.00001 0.01 10 funit flin (1+1) EAℓ′ CSP solver n ti m e [s ec ] (a) K3,n−2. 10 20 30 40 50 0.00001 0.01 10 funit flin (1+1) EAℓ′ CSP solver n ti m e [s ec ] (b) K4,n−4. Figure 3.18: Timing data for (1+1) EAℓ′ and constraint solver on bicliques with partition size 3 and 4. the claim of Theorem 18. We also ran the (1+1) EAℓ′ with a size-one partial labeling by assignment a1 the label n− 2 as in the claim of Theorem 19. Figure 3.16 reports these results. The plot supplies empirical evidence that the expectation bound in Theorem 19 is not tight, and that O(n2 log n), as in Theorem 18 is closer to the truth. However, we note some strange oscillations in the timing of funit when |ℓ′| = 1. This suggests the existence of local optima trapping the process on the funit fitness landscape in this setting. We also plot the running time of the (1+1) EAℓ′ and constraint solver in Figure 3.18 for larger partition sizes. Note that we omit fexp for bicliques due to the fact that m = 2(n − c), and so as n approaches 32 + c, the experiments begin to suffer from integer overflow on 64-bit architecture. 72 4 Conclusions In this thesis we analyzed the performance of variations of the simple evolutionary algorithm, the (1+1) EA, when applied to various combinatorial optimization graph problems. In Section 3.1 we presented (1+1) EAj+rk , a variant of the (1+1) EA, that uses a focused jump-and-repair operation to find feasible vertex sets of size k to solve parameterized variants of graph problems closed under induced subgraphs. Instead of searching for a vertex set that is feasible on the entire graph we search in parallel for an induced subgraph and a vertex set that is feasible on this induced subgraph. When an offspring is found that has a cardinality infeasible vertex set we give it the opportunity to be probabilistically repaired by the focused jump-and-repair operation. Using this approach we proved that the (1+1) EAj+rk is an FPT Monte-Carlo algorithm for the k-VertexCover, the k-FVST, and the k-OddCycleTranversal problems. We then proved that using a simple restarting framework allows the (1+1) EAj+rk can solve the minimum VertexCover problem in O(2OPTn2 log n) where OPT is the size of the optimal vertex cover. This upper bound outperforms the best known upper bound for FPT evolutionary algorithms on the minimum VertexCover by an exponential factor in OPT . We investigate leading constants and small-n behavior empirically in Section 3.1.4. A downside to our approach is the jump-and-repair operation is problem spe- cific and needs to be designed with each problem in mind. In some problems, like the k-VertexCover problem, the repair operation is natural, but in others, like k-FVST and k-OddCycleTransversal, the repair operation is more involved. 73 The problem-tailored jump-and-repair operation and the fitness function are the only problem-specific components required by the (1+1) EAj+rk to solve parameterized problems. Using the general framework presented will make it easier to design repair operators for other parameterized problems. Similarly, in Section 3.2 we present results for the (1+1) EA when finding graceful labelings of certain classes of graphs that have clean structure. We present rigor- ous runtime bounds for stars, paths, and bicliques. We also demonstrate a way to overcome synchronization problems that arise from local symmetries by using partial labelings to force domino convergence to find graceful labelings. The results prove that evolutionary algorithms using these techniques can label some types of graphs in expected polynomial time. These results are the first runtime bounds for evolutionary algorithms on graph labeling problems. In Section 3.2.4 we present empirical results that demonstrate the (1+1) EA typ- ically outperforms a complete constraint solver, even without the use of partial la- belings to break symmetry. This includes results on classes of graphs not covered by our theoretical results. This work leaves us with many directions for future work. First the upper bounds for the (1+1) EA applied to the graceful labeling problem can be tightened. Then there are missing lower bounds on the FPT evolutionary algorithms for Vertex- Cover and the (1+1) EA on the graceful labelings. These missing bounds make it difficult to directly compare algorithms, and without lower bounds it is hard to fully understand the performance of these algorithms. Also experiments comparing the em- pirical runtimes for both the traditional EAs and Global SEMO on VertexCover would be important. It would also be valuable to generalize the performance of the (1+1) EA on the graceful labeling problem to that of a more sophisticated search heuristic and larger graph classes. This could be carried out experimentally. It could 74 also prove interesting to investigate the performance of evolutionary algorithms on other types of graph labelings. 75 Bibliography [1] M. Adamaszek. “Efficient enumeration of graceful permutations”. In: CoRR math/0608513 (2006). url: http://arxiv.org/abs/math/0608513. [2] R. E. L. Aldred and B. D. McKay. “Graceful and harmonious labellings of trees”. In: Bulletin of the Institute of Combinatorics and its Applications 23 (1998), pp. 69–72. [3] I. C. Arkut, R. C. Arkut, and N. Ghani. “Graceful label numbering in op- tical MPLS networks”. In: OptiComm 2000: Optical Networking and Com- munications. Ed. by I. Chlamtac. Vol. 4233. International Society for Optics and Photonics. SPIE, 2000, pp. 1–8. doi: 10.1117/12.401809. url: https: //doi.org/10.1117/12.401809. [4] J. Ayel and O. Favaron. The helms are graceful. Universite´ Paris-Sud, Centre d’Orsay, Laboratoire de recherche en Informatique, 1981. [5] W. Banzhaf. “Genotype-Phenotype-Mapping and Neutral Variation - A Case Study in Genetic Programming”. In: Parallel Problem Solving from Nature - PPSN III, International Conference on Evolutionary Computation. The Third Conference on Parallel Problem Solving from Nature, Jerusalem, Israel, Octo- ber 9-14, 1994, Proceedings. Ed. by Y. Davidor, H. Schwefel, and R. Ma¨nner. Vol. 866. Lecture Notes in Computer Science. Springer, 1994, pp. 322–332. doi: 76 10.1007/3-540-58484-6\_276. url: https://doi.org/10.1007/3-540- 58484-6%5C_276. [6] J.-C. Bermond. “Graceful graphs, radio antennae and French windmills”. In: Proceedings of the One Day Combinatorics Conference. Vol. 34. Research Notes in Mathematics. Pitman, 1979, pp. 18–37. [7] G. S. Bloom and S. W. Golomb. “Applications of Numbered Undirected Graphs”. In: Proceedings of the IEEE 65.4 (Apr. 1977), pp. 562–570. [8] L. Brankovic and I. M. Wanless. “Graceful Labelling: State of the Art, Appli- cations and Future Directions”. In: Math. Comput. Sci. 5.1 (2011), pp. 11–20. doi: 10.1007/s11786-011-0073-6. [9] M. Cai, X. Deng, and W. Zang. “An Approximation Algorithm for Feedback Vertex Sets in Tournaments”. In: SIAM Journal on Computing 30.6 (2001), pp. 1993–2007. doi: 10.1137/s0097539798338163. url: http://dx.doi.org/ 10.1137/S0097539798338163. [10] C. A. Coello Coello. “Theoretical and numerical constraint-handling techniques used with evolutionary algorithms: a survey of the state of the art”. In: Com- puter Methods in Applied Mechanics and Engineering 191.11-12 (Jan. 2002), pp. 1245–1287. issn: 0045-7825. doi: 10.1016/s0045-7825(01)00323-1. url: http://dx.doi.org/10.1016/S0045-7825(01)00323-1. [11] T. H. Cormen et al. Introduction to Algorithms. Third. The MIT Press, 2009. [12] B. Doerr, Y. Ghannane, and M. I. Brahim. “Towards a Stronger Theory for Permutation-Based Evolutionary Algorithms”. In: Proceedings of the Genetic and Evolutionary Computation Conference. GECCO ’22. Boston, Massachusetts: Association for Computing Machinery, 2022, pp. 1390–1398. isbn: 9781450392372. 77 doi: 10.1145/3512290.3528720. url: https://doi.org/10.1145/3512290. 3528720. [13] B. Doerr, D. Johannsen, and C. Winzen. “Multiplicative Drift Analysis”. In: Algorithmica 64 (2012), pp. 673–697. [14] M. Dom et al. “Fixed-parameter tractability results for feedback set problems in tournaments”. In: Journal of Discrete Algorithms 8.1 (2010), pp. 76–86. doi: 10.1016/j.jda.2009.08.001. url: http://dx.doi.org/10.1016/j.jda. 2009.08.001. [15] R. G. Downey and M. R. Fellows. Parameterized Complexiy. Springer, 1999. [16] A. E. Eiben and J. E. Smith. Introduction to Evolutionary Computing. 2nd. Springer Publishing Company, Incorporated, 2015. isbn: 3662448734. [17] K. Eshghi and P. Azimi. “Applications of mathematical programming in grace- ful labeling of graphs”. In: Journal of Applied Mathematics (2004), pp. 1–8. doi: 10.1155/S1110757X04310065. [18] W. Fang. “A Computational Approach to the Graceful Tree Conjecture”. In: CoRR abs/1003.3045 (2010). arXiv: 1003.3045. url: http://arxiv.org/ abs/1003.3045. [19] J. Flum and M. Grohe. Parameterized complexity theory. Springer-Verlag, 2006. [20] T. Friedrich et al. “Approximating covering problems by randomized search heuristics using multi-objective models”. In: Proceedings of the Conference on Genetic and Evolutionary Computation (GECCO). 2007, pp. 797–804. doi: 10. 1145/1276958.1277118. url: http://dx.doi.org/10.1145/1276958. 1277118. 78 [21] R. Frucht. “Graceful Numbering of Wheels and Related Graphs”. In: Annals of the New York Academy of Sciences 319.1 (1979), pp. 219–229. doi: https: //doi.org/10.1111/j.1749-6632.1979.tb32792.x. [22] J. A. Gallian. “A dynamic survey of graph labeling”. In: Electronic Journal of Combinatorics (2021). Dynamic Surveys #DS6. [23] Gecode Team. Gecode: Generic Constraint Development Environment. Avail- able from http://www.gecode.org. 2019. [24] D. E. Goldberg, R. Lingle, et al. “Alleles, loci, and the traveling salesman problem”. In: Proceedings of the First International Conference on Genetic Al- gorithms and their Applications (ICGA). Ed. by J. J. Grefenstette. Vol. 154. Hillsdale, NJ: Lawrence Erlbaum, 1985, pp. 154–159. [25] S. W. Golomb. “How To Number A Graph”. In: Graph Theory and Com- puting. Ed. by R. C. Read. Academic Press, 1972, pp. 23–37. isbn: 978-1- 4832-3187-7. doi: https://doi.org/10.1016/B978- 1- 4832- 3187- 7. 50008-8. url: https://www.sciencedirect.com/science/article/pii/ B9781483231877500088. [26] J. He and X. Yao. “A study of drift analysis for estimating computation time of evolutionary algorithms”. In: Natural Computing 3 (2004), pp. 21–35. [27] J. He, X. Yao, and J. Li. “A comparative study of three evolutionary algo- rithms incorporating different amounts of domain knowledge for node covering problem”. In: IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 35.2 (2005), pp. 266–271. doi: 10.1109/TSMCC. 2004.841903. 79 [28] M. Horton. “Graceful Trees: Statistics and Algorithms”. Bachelor’s Thesis. Uni- versity of Tasmania, 2003. [29] F. Hu¨ffner. “Algorithm Engineering for Optimal Graph Bipartization”. In: J Graph Algorithms Appl 13.2 (2009), pp. 77–98. doi: 10.7155/jgaa.00177. [30] S. Khuri and T. Ba¨ck. “An Evolutionary Heuristic for the Minimum Vertex Cover Problem”. In: Proceedings of the Eighteenth Annual German Conference on Artificial Intelligence (KI-94). Ed. by J. Kunze and H. Stoyan. 1994, pp. 86– 90. [31] S. Kratsch and F. Neumann. “Fixed-Parameter Evolutionary Algorithms and the Vertex Cover Problem”. In: Algorithmica 65.4 (2012), pp. 754–771. doi: 10.1007/s00453-012-9660-4. url: http://dx.doi.org/10.1007/s00453- 012-9660-4. [32] R. Le Bras, C. P. Gomes, and B. Selman. “Double-wheel graphs are grace- ful”. In: Twenty-Third International Joint Conference on Artificial Intelligence. 2013. [33] J. M. Lewis and M. Yannakakis. “The Node-Deletion Problem for Hereditary Properties is NP-Complete”. In: J. Comput. Syst. Sci. 20.2 (1980), pp. 219–230. doi: 10.1016/0022-0000(80)90060-4. [34] M. L. Lidd. Traveling salesman problem domain: Application of a fundamentally new approach to utilizing genetic algorithms. Tech. rep. Air Force Contract F4920-90-G-003. MITRE Coroporation, 1991. [35] D. Lokshtanov, S. Saurabh, and S. Sikdar. “Simpler Parameterized Algorithm for OCT”. In: Combinatorial Algorithms, 20th International Workshop, IWOCA 2009, Hradec nad Moravicı, Czech Republic, June 28-July 2, 2009, Revised Se- 80 lected Papers. Ed. by J. Fiala, J. Kratochvıl, and M. Miller. Vol. 5874. Lecture Notes in Computer Science. Springer, 2009, pp. 380–384. doi: 10.1007/978-3- 642-10217-2\_37. url: https://doi.org/10.1007/978-3-642-10217-2_37. [36] D. Lokshtanov et al. “Faster Parameterized Algorithms Using Linear Program- ming”. In: ACM Trans. Algorithms 11.2 (2014), 15:1–15:31. doi: 10.1145/ 2566616. [37] H. Mahmoudzadeh and K. Eshghi. “A Metaheuristic Approach to the Graceful Labeling Problem of Graphs”. In: 2007 IEEE Swarm Intelligence Symposium. 2007, pp. 84–91. doi: 10.1109/SIS.2007.368030. [38] Minnesota Super COmputing Institute. url: http://www.msi.umn.edu. [39] G. G. Mitchell et al. “GeneRepair - A Repair Operator for Genetic Algorithms”. In: Proceedings of the Genetic and Evolutionary Computation (GECCO). 2003. url: http://mural.maynoothuniversity.ie/10351/. [40] H. Mu¨hlenbein. “Parallel Genetic Algorithms in Combinatorial Optimization”. In: Computer Science and Operations Research: New Developments in their Interfaces. Elsevier, 1992, pp. 441–453. doi: 10.1016/b978-0-08-040806- 4.50034- 4. url: http://dx.doi.org/10.1016/B978- 0- 08- 040806- 4.50034-4. [41] F. Neumann and A. M. Sutton. “Parameterized Complexity Analysis of Ran- domized Search Heuristics”. In: Theory of Evolutionary Computation: Recent Developments in Discrete Optimization. Ed. by B. Doerr and F. Neumann. Natural Computing Series. Springer International Publishing, 2020, pp. 213– 248. isbn: 978-3-030-29414-4. doi: 10.1007/978-3-030-29414-4_4. url: https://doi.org/10.1007/978-3-030-29414-4_4. 81 [42] F. Neumann and C. Witt. “Bioinspired Computation in Combinatorial Opti- mization: Algorithms and Their Computational Complexity”. In: Proceedings of the 14th Annual Conference Companion on Genetic and Evolutionary Com- putation. GECCO ’12. Philadelphia, Pennsylvania, USA: Association for Com- puting Machinery, 2012, pp. 1035–1058. isbn: 9781450311786. doi: 10.1145/ 2330784.2330928. url: https://doi.org/10.1145/2330784.2330928. [43] P. S. Oliveto, J. He, and X. Yao. “Analysis of the (1 + 1)-EA for Finding Approximate Solutions to Vertex Cover Problems”. In: IEEE Transactions on Evolutionary Computation 13.5 (2009), pp. 1006–1029. doi: 10.1109/tevc. 2009.2014362. url: http://dx.doi.org/10.1109/TEVC.2009.2014362. [44] M. Pelikan, R. Kalapala, and A. K. Hartmann. “Hybrid evolutionary algorithms on minimum vertex cover for random graphs”. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO). ACM, 2007, pp. 547– 554. doi: 10.1145/1276958.1277073. url: https://doi.org/10.1145/ 1276958.1277073. [45] K. E. Petrie and B. M. Smith. “Symmetry Breaking in Graceful Graphs”. In: Principles and Practice of Constraint Programming - CP 2003, 9th Interna- tional Conference, CP 2003, Kinsale, Ireland, September 29 - October 3, 2003, Proceedings. Ed. by F. Rossi. Vol. 2833. Lecture Notes in Computer Science. Springer, 2003, pp. 930–934. doi: 10.1007/978-3-540-45193-8\_81. url: https://doi.org/10.1007/978-3-540-45193-8%5C_81. [46] S. D. Prestwich and A. Roli. “Symmetry Breaking and Local Search Spaces”. In: Integration of AI and OR Techniques in Constraint Programming for Com- binatorial Optimization Problems, Second International Conference, CPAIOR 2005, Prague, Czech Republic, May 30 - June 1, 2005, Proceedings. Ed. by R. 82 Barta´k and M. Milano. Vol. 3524. Lecture Notes in Computer Science. Springer, 2005, pp. 273–287. doi: 10.1007/11493853\_21. url: https://doi.org/10. 1007/11493853_21. [47] T. A. Redl. “Graceful Graphs and Graceful Labelings: Two Mathematical Pro- gramming Formulations and Some Other New Results”. In: Congressus Numer- antium 164 (2003), pp. 17–31. [48] B. Reed, K. Smith, and A. Vetta. “Finding odd cycle transversals”. In: Opera- tions Research Letters 32.4 (2004), pp. 299–301. issn: 0167-6377. doi: https:// doi.org/10.1016/j.orl.2003.10.009. url: https://www.sciencedirect. com/science/article/pii/S0167637703001482. [49] A. Rosa. “On certain valuations of the vertices of a graph”. In: Theory of Graphs, International Symposium. Gordon and Breach; Dunod, 1967, pp. 349– 355. [50] S. Salcedo-Sanz. “A survey of repair methods used as constraint handling tech- niques in evolutionary algorithms”. In: Computer Science Review 3.3 (Aug. 2009), pp. 175–192. issn: 1574-0137. doi: 10.1016/j.cosrev.2009.07.001. url: http://dx.doi.org/10.1016/j.cosrev.2009.07.001. [51] J. Scharnow, K. Tinnefeld, and I. Wegener. “The analysis of evolutionary al- gorithms on sorting and shortest paths problems”. In: J. Math. Model. Algo- rithms 3.4 (2004), pp. 349–366. doi: 10.1007/s10852- 005- 2584- 0. url: https://doi.org/10.1007/s10852-005-2584-0. [52] B. M. Smith and J. Puget. “Constraint models for graceful graphs”. In: Con- straints 15.1 (2010), pp. 64–92. doi: 10.1007/s10601- 009- 9071- 6. url: https://doi.org/10.1007/s10601-009-9071-6. 83 [53] E. Speckenmeyer. “On feedback problems in digraphs”. In: Graph-Theoretic Concepts in Computer Science. Ed. by M. Nagl. Berlin, Heidelberg: Springer Berlin Heidelberg, 1990, pp. 218–231. isbn: 978-3-540-46950-6. [54] A. M. Sutton. “Fixed-Parameter Tractability of Crossover: Steady-State GAs on the Closest String Problem”. In: Algorithmica 83.4 (2021), pp. 1138–1163. doi: 10.1007/s00453-021-00809-8. url: https://doi.org/10.1007/s00453- 021-00809-8. [55] O. Tange. “GNU Parallel - The Command-Line Power Tool”. In: ;login: The USENIX Magazine 36.1 (Feb. 2011), pp. 42–47. doi: http://dx.doi.org/10. 5281/zenodo.16303. url: http://www.gnu.org/s/parallel. [56] O. Tange. GNU Parallel 20210822 (’Kabul’). GNU Parallel is a general paral- lelizer to run multiple serial command line programs in parallel without chang- ing them. Aug. 2021. doi: 10.5281/zenodo.5233953. url: https://doi.org/ 10.5281/zenodo.5233953. [57] R. A. Wright et al. “Constant Time Generation of Free Trees”. In: SIAM Journal on Computing 15.2 (1986), pp. 540–548. doi: 10.1137/0215039. 84