11institutetext: RWTH Aachen, Aachen, Germany
11email: [email protected]
22institutetext: CNRS, LaBRI, Université de Bordeaux, Talence, France
22email: [email protected]
33institutetext: LaBRI, Université de Bordeaux, Talence, France
and University of Augsburg, Augsburg, Germany
33email: [email protected]

Graph Exploration with Edge Weight Estimates

Matthias Gehnen 11 0000-0001-9595-2992    Ralf Klasing
and Émile Naquin
2233 0009-0009-1496-3717
Abstract

In the Travelling Salesman Problem, every vertex of an edge-weighted graph has to be visited by an agent who traverses the edges of the graph. In this problem, it is usually assumed that the costs of each edge are given in advance, making it computationally hard but possible to calculate an optimal tour for the agent.

Also in the Graph Exploration Problem, every vertex of a given graph must be visited, but here the graph is not known in the beginning - at every point, an algorithm only knows about the already visited vertices and their neighbors.

Both however are not necessarily realistic settings: Usually the structure of the graph (for example underlying road network) is known in advance, but the details are not. One usually has a prediction of how long it takes to traverse through a particular road, but due to road conditions or imprecise maps the agent might realize that a road will take slightly longer than expected when arriving on it. To deal with those deviations, it is natural to assume that the agent is able to adapt to the situation: When realizing that taking a particular road is more expensive than expected, recalculating the tour and taking another road instead is possible.

In a sense, this setting lies in between the offline travelling salesman problem, and the online problem of graph exploration, as it allows some computation at the beginning, but remains an online problem during traversal through the graph.

We analyze the competitive ratio of this problem based on the perturbation factor α𝛼\alphaitalic_α of the edge weights. For general graphs we show that for realistic factors smaller than 2222 there is no strategy that achieves a competitive ratio better than α𝛼\alphaitalic_α, which can be matched by a simple algorithm.

In addition, we prove an algorithm which has a competitive ratio of 1+α21𝛼2\frac{1+\alpha}{2}divide start_ARG 1 + italic_α end_ARG start_ARG 2 end_ARG for restricted graph classes like complete graphs with uniform announced edge weights. Here, we present a matching lower bound as well, proving that the strategy for those graph classes is best possible. We conclude with a remark about special graph classes like cycles.

Keywords:
Graph Exploration Travelling Salesman Estimates Online Algorithms

1 Introduction

When planning a tour for a travelling salesman, usually it is assumed that the distances between each two points are known in advance. Therefore, computing an optimal route is possible, although the computation might be hard. However, this assumption might not be realistic in real-world applications. Take a simple example:

Imagine a tourist who wants to visit n𝑛nitalic_n sights across a city. The tourist is aware of the street map, and therefore knows the structure of the city he wants to explore. However, the time that it will take to travel on each road is not exactly known in advance, since unexpected traffic, accidents, or road conditions are not depicted in the tourist’s street map. As soon as the tourist arrives at some street, he will get aware of the actual conditions and therefore of the time traversing the road takes.

We model this by assuming that the agent who explores the graph knows the graph structure together with an interval for each edge weight in advance. The actual, precise weight of each edge gets revealed when the agent is on a vertex incident to the edge. As we focus on the impact of the missing information, we assume that the agent has unbounded computational power and space.

In this article, we start with a formal definition of the graph exploration problem with edge weight estimates. Afterwards, we will give a short overview about the travelling salesman and graph exploration problem and see how the topic of this work lies in between them.

In Section 2, we will see that in general it is not possible to hope for algorithms that achieve a better competitive ratio than the ratio between the upper and lower bound of the edge weight announcement. This also shows that a simple algorithm, that just calculates the shortest tour based on the lower bounds announced is best possible in some cases.

This however is not true for all graph classes, as it is shown in Section 3: We will see that graphs like complete graphs or complete balanced bipartite graphs allow a competitive ratio of 1+α21𝛼2\frac{1+\alpha}{2}divide start_ARG 1 + italic_α end_ARG start_ARG 2 end_ARG, when each edge weight is announced uniformly in an interval of [1,α]1𝛼[1,\alpha][ 1 , italic_α ]. This can be achieved by using an algorithm that recalculates the cheapest possible tour in each step when new edge weights are revealed. We further show that this competitive ratio is best possible for those graph classes, but also that this algorithm is not able to calculate a better competitive ratio than α𝛼\alphaitalic_α for other graph classes, even if they have uniform announced edge weights.

We finally conclude with an outlook to other special graph classes like cycles, and close with some questions that imposed while developing this framework.

1.1 Formal Definition of the Estimates Model

In this article, we consider the travelling salesman problem as the underlying offline problem of our semi-online setting:

Definition 1 (Travelling Salesman Problem (TSP))

Given a graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ) with weights w(e)𝑤𝑒w(e)italic_w ( italic_e ) for each edge eE𝑒𝐸e\in Eitalic_e ∈ italic_E, a vertex sV𝑠𝑉s\in Vitalic_s ∈ italic_V as a start-vertex and a vertex tV𝑡𝑉t\in Vitalic_t ∈ italic_V as end-vertex, the Travelling Salesman (Path) Problem is the problem of finding the shortest path visiting all vertices, starting and ending at the designated vertices.

For the sake of this article, the graph does not need to be complete. The distance between two non-adjacent vertices therefore is the length of the shortest path between them. In an online setting, where the graph is not known in advance but is given during the tour by revealing the neighborhood of each vertex, the problem is also known as Graph Exploration Problem. Note that in an online setting, where not every information is revealed at the beginning, the shortest path between two points might not be clear in the beginning.

In this article, we focus on a variant where the structure of the graph is known in advance, but the actual edge weights only get revealed whenever an incident vertex is visited.

Definition 2 (Graph Exploration with Edge Weight Estimates (GEEWE))

Given a graph G=(V,E)𝐺𝑉𝐸G=(V,E)italic_G = ( italic_V , italic_E ), an upper and lower bound u(e)𝑢𝑒u(e)italic_u ( italic_e ) and (e)𝑒\ell(e)roman_ℓ ( italic_e ) for each edge weight w(e)𝑤𝑒w(e)italic_w ( italic_e ), a vertex sV𝑠𝑉s\in Vitalic_s ∈ italic_V as a start-vertex and a distinct vertex tV𝑡𝑉t\in Vitalic_t ∈ italic_V as an end-vertex, the Graph Exploration with Edge Weight Estimates (GEEWE) is the problem of finding the shortest walk visiting all vertices, starting and ending at the designated vertices. The actual weight w(e)[(e),u(e)]𝑤𝑒𝑒𝑢𝑒w(e)\in[\ell(e),u(e)]italic_w ( italic_e ) ∈ [ roman_ℓ ( italic_e ) , italic_u ( italic_e ) ] of an edge e𝑒eitalic_e is revealed as soon as the salesman visits an incident vertex.

While in the literature usually the start- and end-vertex are identical, we will assume that they are distinct from each other as in the travelling salesman path problem. For this sake, a walk will refer to an open walk with distinct start and end vertices throughout the article. This helps avoiding minor technical issues like dealing with edge weights that got revealed but connect two vertices that both need to be visited (one of them being the starting vertex). The results however translate also to the classical travelling salesman problem, as two adjacent vertices can be chosen as start- and end vertex. The competitive ratio proven in the upper and lower bounds then stays almost the same, and only deviates by one potentially optimal chosen edge in the online algorithm’s solution for all of the presented algorithms and lower bound constructions.

1.2 Related Work

1.2.1 Travelling Salesman Problem.

The problem of finding a shortest possible tour that visits a given set of places is basically as old as persons travel. The first explicit mathematical formulation is attributed to Menger in 1930 [29], afterwards it became one of the most influential problems in discrete optimization.

Due to its close relation to the Hamiltonian cycle problem, the travelling salesman problem is one of the first problems for which NP-hardness was known [21]. As an early combinatorial optimization problem, it heavily influenced research in the area of integer programming, and was one of the first problems where the cutting plane method was proven to be valuable by Dantzig et al. [15]. On the approximation side, the Christofides-Serdyukov algorithm achieves a 1.51.51.51.5-approximation ratio with a simple combinatorial argument in 1976 [14]. For a long time this approximation ratio was the best possible for metric TSP, only recently the factor of 1.51.51.51.5 from Christofides was improved to 1.5ε1.5𝜀1.5-\varepsilon1.5 - italic_ε by Karlin et al. [20]. In practice, many real-world problems are solvable close to optimality.

Due to its natural setting, many variants of the travelling salesman problem are known and have real-life applications as well. This includes for example the vehicle routing problem, where more than just one person can traverse throughout the graph, or variants in which additional constraints for the visits of the salesperson at the vertices are added [8, 23].

Due to its nature, many real-life applications in which the travelling salesman problem appears are in fact online settings. One track of the research for the online travelling salesman problem is the setting in which the actual requests are not known in advance, and only become available over time. If there are multiple salespersons, and requests need to be served immediately, this problem is also known as the k-server problem, introduced by Manasse et al. [24]. This problem focuses on which salesperson should serve the request. The question whether there is a k𝑘kitalic_k-competitive algorithm when k𝑘kitalic_k salespersons are available is known as the k𝑘kitalic_k-server conjecture, and is often referred to as the holy grail of online computation.

If however the requests do not have to be fulfilled immediately, the question about the order in which the request should be served becomes important. This line of research was started by Ausiello et al. 25 years ago [2]. In this line of research the impact of predictions is studied as well, making the problem semi-online [7]. This recent article of Bampis et al. also gives a nice overview about this line of research.

The case where the structure of the graph is discovered online is called the graph exploration problem.

1.2.2 Graph Exploration.

The graph exploration setting usually refers to a problem, where an agent needs to visit all vertices of a given graph but is only aware of the graph induced by the so far visited vertices and its neighbors.

The study of this problem was initiated by Kalyanasundaram and Pruhs in 1994 [19], where they proved a 16161616-competitive ratio for planar graphs.

Since then, the graph exploration model attracted interest in various settings. It was shown by Miyazaki et al. [26] that there cannot be an algorithm achieving a better competitive ratio than 2222 on unweighted graphs. The lower bound for the graph exploration problem on general graphs was then improved to 2.52.52.52.5 by Dobrev et al. [16]. Currently, the best lower bound is given by Birx et al. [9] with a competitive ratio of 10/310310/310 / 3 for planar graphs.

On the other hand, no algorithm achieving a constant competitive ratio is known for general graphs. The so far best approach by Rosenkrantz et al. [28] achieves a logarithmic competitive ratio, using a nearest neighbor approach. Another algorithm that achieves a logarithmic competitive ratio is a modification of depth-first search by Megow et al. [25]. However, constant competitive algorithms are known for restricted graph classes.

In Megow et al. [25], an algorithm with constant competitive ratio for bounded genus graphs was presented. They also proved that this algorithm it is not constant competitive for general graphs. Thus, we may wonder whether knowing the structure of the graph in advance might help to construct exploration algorithms with constant competitive ratio.

Their results are generalized by Baligacs et al. [6], who prove that one can achieve constant competitive ratio on any minor-free graph class.

A recent study of the advice complexity of the unweighted case with unknown graph structure is provided in Böckenhauer et al. [10], where they show that an online algorithm can compute an optimal solution on sparse graphs if it is provided with O(m)𝑂𝑚O(m)italic_O ( italic_m ) bits of correct advice, where m𝑚mitalic_m is the number of edges.

In the same year, Eberle et al. [17] studied the graph exploration problem using algorithms with predictions. Here, the authors present an algorithm that achieves a constant competitive ratio if the predictions are correct, but degrades the worse the predictions turn out to be. In real-world scenarios, these predictions could come from machine learning models trained on the kind of graphs that are expected.

Another research direction of the graph exploration problem is given by the setting where instead of one agent several can traverse through the graph simultaneously. Here, the goal is either to visit every vertex as soon as possible, or to minimize the total distance traversed by all agents. A recent overview about the different results in this direction can be found in van den Akker et al. [1].

1.2.3 Online Algorithms with Estimates.

In the research of online algorithms, one of the key questions is how to deal with the lack of information about the yet unknown parts of the problem. Therefore, a natural question that arises when thinking about an online problem is: “What information is missing exactly?”

One could for example assume that basically nothing is known about the future input. This may be seen as the somewhat most intuitive point of view, indeed online problems were first introduced in this setting by Sleator and Tarjan [30]. For a general overview about online problems, we refer to the text books of Komm [22] or Borodin and El-Yaniv [12].

Otherwise, one might assume that an algorithm does not have to be completely blind when attacking an online problem. Therefore, the concept of advice was introduced by Böckenhauer et al. [11], where an algorithm may ask for some bits of arbitrary knowledge about the future input. This has lead to various theoretical insights, but also raised the question about the availability of such advice bits. One answer to this issue is the introduction of online algorithms with predictions (sometimes called machine-learned advice) by Purohit et al. [27], where the algorithm cannot ask for particular advice bits, but usually gets a hint on what to do with each part of the input. However, those hints may be wrong, and the algorithm’s task is to balance the belief in the predictions but also keep some kind of backup if the hints turn out to be incorrect.

This however assumes that the hint can be given in a local version. However, the prediction-giver will most likely deduce the hints of his belief from the global structure of the upcoming input - for example from some underlying machine learning program.

To model this setting, a variant of the prediction model was introduced that we will call online algorithms with estimates. Here, a rough overview about the instance is given in advance, but the details like actual values of parts of the input are revealed in an online fashion.

A setting of this kind of estimates where parts of the instance are given but can be distorted at later points is natural for most problems that model a setting over time, for example scheduling. Here, the actual duration of each job might not be known at the beginning of those, like studied by Azar et al. [3, 4, 5] for different scheduling settings.

Another sensible setting that was modeled using our model of online algorithms with estimates are packing problems. Here, Gehnen et al. [18] studied the online knapsack problem, where an overview about the instance is given at the beginning, but each items size can deviate by a factor of α𝛼\alphaitalic_α. The actual weight of each item will be revealed when the algorithm needs to make a decision whether to take the item or not.

2 Unrestricted Graph Classes

The graph exploration with edge weight estimates (GEEWE) problem is a generalization of the graph exploration problem. Therefore, lower bounds for the graph exploration problem also apply for the GEEWE problem, when allowing the range between upper and lower bound to be sufficiently large.

On the other hand, every upper bound for the graph exploration problem is an upper bound for the GEEWE problem, no matter how small the ratio between upper and lower bound of each edge is.

However, the lower bound constructions used for the graph exploration model also hide the structure of the graph, for example by presenting dead ends. As those structures can only be hidden in the GEEWE problem when the factor between upper and lower bound for some edges is very large, we focus on instances where this factor is rather small.

In this section, we will see that no algorithm can achieve a better competitive ratio than the largest factor α𝛼\alphaitalic_α between the upper and lower edge weight (u(e)α(e)𝑢𝑒𝛼𝑒u(e)\leq\alpha\ell(e)italic_u ( italic_e ) ≤ italic_α roman_ℓ ( italic_e ) for all edges e𝑒eitalic_e), with α2𝛼2\alpha\leq 2italic_α ≤ 2 . Furthermore, we show that this ratio can easily be achieved by a rather simple algorithm.

Theorem 2.1

No algorithm for the GEEWE problem can achieve a better competitive ratio then the maximum factor α𝛼\alphaitalic_α between an upper and lower bound of an announced edge weight when α2𝛼2\alpha\leq 2italic_α ≤ 2.

Proof

Construction of the Instance.

The following instance is constructed recursively, where a component of recursion depth i𝑖iitalic_i consists of k𝑘kitalic_k connected components of recursion depth i1𝑖1i-1italic_i - 1 for each i1𝑖1i\geq 1italic_i ≥ 1 and a constant k𝑘kitalic_k.

A component of depth 00 just consists of a path with k+1𝑘1k+1italic_k + 1 vertices, where each edge is announced in the interval [1,α]1𝛼[1,\alpha][ 1 , italic_α ] and will get presented with the actual weight 1111. We will refer to the first vertex of the path as the starting vertex s0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, and to the last one as end vertex t0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

Each component of recursion depth i1𝑖1i\geq 1italic_i ≥ 1 is constructed with two distinguished vertices as a start- and end vertex sisubscript𝑠𝑖s_{i}italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and k𝑘kitalic_k components of depth i1𝑖1i-1italic_i - 1 connected in a row between them. The start vertex sisubscript𝑠𝑖s_{i}italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is connected to the vertices si1subscript𝑠𝑖1s_{i-1}italic_s start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT and ti1subscript𝑡𝑖1t_{i-1}italic_t start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT of the first component of depth i1𝑖1i-1italic_i - 1 with an edge of announced weight [ki,αki]superscript𝑘𝑖𝛼superscript𝑘𝑖[k^{i},\alpha k^{i}][ italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_α italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ]. Both will be revealed with the weight of kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT.

Also, two adjacent components are connected with four edges of announced weight [ki,αki]superscript𝑘𝑖𝛼superscript𝑘𝑖[k^{i},\alpha k^{i}][ italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_α italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ], connecting both the start- and end vertex of one component to the start- and end vertex of the following component. The two edges connecting the start-vertex of the first visited component to the neighboring component will be revealed with weight kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT, the two edges incident to the end vertex have the actual weight αki𝛼superscript𝑘𝑖\alpha k^{i}italic_α italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT.

Finally, the start- and end vertices of the last component of depth i1𝑖1i-1italic_i - 1 are connected to the end vertex tisubscript𝑡𝑖t_{i}italic_t start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT with edges of announced weight [ki,αki]superscript𝑘𝑖𝛼superscript𝑘𝑖[k^{i},\alpha k^{i}][ italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_α italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ] as well. Again, the edge incident to the start vertex si1subscript𝑠𝑖1s_{i-1}italic_s start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT is revealed as kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT, the edge attached to the end vertex ti1subscript𝑡𝑖1t_{i-1}italic_t start_POSTSUBSCRIPT italic_i - 1 end_POSTSUBSCRIPT has actual size αki𝛼superscript𝑘𝑖\alpha k^{i}italic_α italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT.

We will call the edges that are announced with weights in the interval [ki,αki]superscript𝑘𝑖𝛼superscript𝑘𝑖[k^{i},\alpha k^{i}][ italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT , italic_α italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ] edges of level i𝑖iitalic_i. Note that this construction is symmetric, so based on the announcements the start- and end vertices of each component are not distinguishable from each other.

Figure 1 depicts an example for an instance of recursion depth 2222.

s2subscript𝑠2s_{2}italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPTs1subscript𝑠1s_{1}italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTt1subscript𝑡1t_{1}italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTs1subscript𝑠1s_{1}italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTt1subscript𝑡1t_{1}italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTs1subscript𝑠1s_{1}italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTt1subscript𝑡1t_{1}italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTs0subscript𝑠0s_{0}italic_s start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt0subscript𝑡0t_{0}italic_t start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTt2subscript𝑡2t_{2}italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT111111111111111111111111111[3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][3,6][9,18][9,18][9,18][9,18][9,18][9,18][9,18][9,18][9,18][9,18][9,18][9,18]
Figure 1: An example instance with the announcements of recursion depth 2222 with k=3𝑘3k=3italic_k = 3 and α=2𝛼2\alpha=2italic_α = 2. An online algorithm needs to traverse from s2subscript𝑠2s_{2}italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT to t2subscript𝑡2t_{2}italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. The edges of level 00 are just given with the weight 1111, as they will be presented with weight 1111 anyway. Depending on the decisions of the algorithm, the other edges will either have a value of the upper or lower bound.

As the start- and end vertices of each component are not distinguishable to the algorithm by the announcements, we can say without loss of generality that the algorithm reaches the start vertex of each component before reaching the end vertex.

How an algorithm traverse the Instance.

We claim that no online algorithm can traverse from the start to the end vertex of a component of recursion depth i0𝑖0i\geq 0italic_i ≥ 0 visiting each vertex within with costs of less than i(αk+1)ki+ki+1𝑖𝛼𝑘1superscript𝑘𝑖superscript𝑘𝑖1i(\alpha k+1)k^{i}+k^{i+1}italic_i ( italic_α italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT, while an optimal offline solution is able to do the traversal with total costs of i(k+1)ki+ki+1𝑖𝑘1superscript𝑘𝑖superscript𝑘𝑖1i(k+1)k^{i}+k^{i+1}italic_i ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT.

For level i=0𝑖0i=0italic_i = 0, it is easy to see that both the online and offline algorithm can traverse through the path, resulting in total costs of k𝑘kitalic_k.

For each level i1𝑖1i\geq 1italic_i ≥ 1, the algorithm needs to traverse through k𝑘kitalic_k components of recursion depth i1𝑖1i-1italic_i - 1, and needs to take at least k+1𝑘1k+1italic_k + 1 edges from the i𝑖iitalic_i-th level. As the k+1𝑘1k+1italic_k + 1 connections between two consecutive components cost at least kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT each, and traversing each of the k𝑘kitalic_k components costs at least (i1)(αk+1)ki1+ki𝑖1𝛼𝑘1superscript𝑘𝑖1superscript𝑘𝑖(i-1)(\alpha k+1)k^{i-1}+k^{i}( italic_i - 1 ) ( italic_α italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT by the induction hypothesis, the algorithm needs to pay at least k((i1)(αk+1)ki1+ki)+(k+1)ki=(k+1)ki+(i1)(αk+1)ki+ki+1𝑘𝑖1𝛼𝑘1superscript𝑘𝑖1superscript𝑘𝑖𝑘1superscript𝑘𝑖𝑘1superscript𝑘𝑖𝑖1𝛼𝑘1superscript𝑘𝑖superscript𝑘𝑖1k((i-1)(\alpha k+1)k^{i-1}+k^{i})+(k+1)k^{i}=(k+1)k^{i}+(i-1)(\alpha k+1)k^{i}% +k^{i+1}italic_k ( ( italic_i - 1 ) ( italic_α italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) + ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT = ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + ( italic_i - 1 ) ( italic_α italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT. However, this calculation only works if the algorithm traverses between the components on the cheap edges with costs of kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT and traverses inside the components just the necessary amount.

To reach a new component, the algorithm can either leave the previous component at the end vertex with costs of αki𝛼superscript𝑘𝑖\alpha k^{i}italic_α italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT, or leave the previous component at its start vertex with costs of kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT. In the former case, this traversal costs (α1)ki𝛼1superscript𝑘𝑖(\alpha-1)k^{i}( italic_α - 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT more than in the previous calculation.

In the latter case, the algorithm still needs to visit all the vertices in the previous component, including reaching its end vertex. This can be done by traversing in this component to the end vertex and back to the start vertex, or by visiting the component at least twice. If the algorithm traverses inside the component to the end and back, it needs to pay for the traversal back at least the costs of an offline algorithm, so (i1)(k+1)ki1+ki𝑖1𝑘1superscript𝑘𝑖1superscript𝑘𝑖(i-1)(k+1)k^{i-1}+k^{i}( italic_i - 1 ) ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT in addition to the previous calculation.

If the algorithm finally decides to visit the component twice, additional cost of a cheap edge of level i𝑖iitalic_i with costs of kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT are unavoidable for an additional movement between components.

Together, the algorithm must pay additional costs of at least (α1)ki𝛼1superscript𝑘𝑖(\alpha-1)k^{i}( italic_α - 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT for each component, resulting in total costs of at least k(α1)ki+(k+1)ki+(i1)(αk+1)ki+ki+1=i(αk+1)ki+ki+1𝑘𝛼1superscript𝑘𝑖𝑘1superscript𝑘𝑖𝑖1𝛼𝑘1superscript𝑘𝑖superscript𝑘𝑖1𝑖𝛼𝑘1superscript𝑘𝑖superscript𝑘𝑖1k(\alpha-1)k^{i}+(k+1)k^{i}+(i-1)(\alpha k+1)k^{i}+k^{i+1}=i(\alpha k+1)k^{i}+% k^{i+1}italic_k ( italic_α - 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + ( italic_i - 1 ) ( italic_α italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT = italic_i ( italic_α italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT.

An optimal solution however arrives at the end vertex of each component first, thus being able to leave each component of recursion depth i𝑖iitalic_i with the cheap kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT edge from its start vertex to the end vertex of the next component. Overall, a traversal of a depth i𝑖iitalic_i component costs k+1𝑘1k+1italic_k + 1 times kisuperscript𝑘𝑖k^{i}italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT for the level i𝑖iitalic_i traversals, and k𝑘kitalic_k traversal of the depth i1𝑖1i-1italic_i - 1 components. This sums up to k((i1)(k+1)ki1+ki)+(k+1)ki=i(k+1)ki+ki+1𝑘𝑖1𝑘1superscript𝑘𝑖1superscript𝑘𝑖𝑘1superscript𝑘𝑖𝑖𝑘1superscript𝑘𝑖superscript𝑘𝑖1k((i-1)(k+1)k^{i-1}+k^{i})+(k+1)k^{i}=i(k+1)k^{i}+k^{i+1}italic_k ( ( italic_i - 1 ) ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i - 1 end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT ) + ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT = italic_i ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT.

Overall, for a given k𝑘kitalic_k, no online algorithm is able to achieve a better competitive ratio than i(αk+1)ki+ki+1i(k+1)ki+ki+1=i(αk+1)+ki(k+1)+k𝑖𝛼𝑘1superscript𝑘𝑖superscript𝑘𝑖1𝑖𝑘1superscript𝑘𝑖superscript𝑘𝑖1𝑖𝛼𝑘1𝑘𝑖𝑘1𝑘\frac{i(\alpha k+1)k^{i}+k^{i+1}}{i(k+1)k^{i}+k^{i+1}}=\frac{i(\alpha k+1)+k}{% i(k+1)+k}divide start_ARG italic_i ( italic_α italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT end_ARG start_ARG italic_i ( italic_k + 1 ) italic_k start_POSTSUPERSCRIPT italic_i end_POSTSUPERSCRIPT + italic_k start_POSTSUPERSCRIPT italic_i + 1 end_POSTSUPERSCRIPT end_ARG = divide start_ARG italic_i ( italic_α italic_k + 1 ) + italic_k end_ARG start_ARG italic_i ( italic_k + 1 ) + italic_k end_ARG to traverse through a component of recursion depth i𝑖iitalic_i. It shows that there cannot be a constant competitive ratio of less than α𝛼\alphaitalic_α, when i𝑖iitalic_i and k𝑘kitalic_k are chosen sufficiently large. ∎

Remark 1

Remark that if α2𝛼2\alpha\geq 2italic_α ≥ 2, we still get a lower bound of 2222 by applying the proof of Theorem 2.1.

This competitive ratio can be achieved by a trivial algorithm that completely ignores the revealed edge weights:

Theorem 2.2

An algorithm that precomputes the shortest possible walk covering all vertices from start to end vertex, based on the lower bounds of the edges and sticks to it, is at most α𝛼\alphaitalic_α times worse than an optimal tour when α𝛼\alphaitalic_α is the maximum factor between the announced bounds of an edge.

Proof

The precomputed walk can be at most α𝛼\alphaitalic_α times more expensive than the sum of the lower bounds of the edges. However, since no solution can be better than the precomputed best tour based on the lower bounds, the precomputed walk is at most α𝛼\alphaitalic_α times more expensive than any valid travelling salesman walk. ∎

Note that the running time of this algorithm will probably be exponential in the worst case, as it needs to solve an NP-hard problem.

3 Uniform announced Edge Weights and restricted Graph Classes

As we have seen in the previous section, we cannot hope for an algorithm that beats the (from the online perspective) trivial precomputing algorithm in general. Therefore, it is interesting to see if there are algorithms that actually perform better on restricted graph classes, using the revealed edge weights during the algorithm’s traversal throughout the graph. A common restriction we will use in this section is the assumption that the graph has uniform announced edge weights [1,α]1𝛼[1,\alpha][ 1 , italic_α ] for all edges. This allows us to achieve improved results e.g. for complete graphs, as without restrictions of the edge weights every graph structure can implicitly be constructed by using edges of arbitrary large announced weight.

3.1 Recalculating Algorithm

An intuitive candidate is the following algorithm, that recalculates the cheapest possible walk visiting all vertices, at each step taking the newly revealed edges into account.

Adaptive Exploration Algorithm
start vertex s𝑠sitalic_s, end vertex t𝑡titalic_t
Agent A𝐴Aitalic_A starts at vertex s𝑠sitalic_s
while Graph is not fully explored do
     Calculate a worst case shortest walk from A𝐴Aitalic_A to t𝑡titalic_t, visiting all unvisited vertices.
     Move A𝐴Aitalic_A to the next vertex on the walk.
end while

In every step, the algorithm can calculate an upper bound for finishing the walk by using the revealed edge weights together with the upper bound of each weight for the unrevealed edge weights. Calculating the walk is computationally expensive, as the algorithm needs to recalculate the optimal for each visited vertex.

This algorithm can be proven to be optimal for graph classes like complete graphs or complete balanced bipartite graphs.

Theorem 3.1

Given a graph where each path can be extended to a Hamiltonian path from the start to a distinct end vertex and uniform announced edge weights [1,α]1𝛼[1,\alpha][ 1 , italic_α ] for α<2𝛼2\alpha<2italic_α < 2, the adaptive exploration algorithm reaches a competitive ratio of at most α+12𝛼12\frac{\alpha+1}{2}divide start_ARG italic_α + 1 end_ARG start_ARG 2 end_ARG.

We start with an easy observation, connecting the adaptive exploration algorithm with the nearest neighbor method.

Lemma 1

In the setting of Theorem 3.1, the adaptive exploration algorithm takes the cheapest outgoing edge towards an unvisited vertex (which is not the end vertex), until it visits the end vertex in its final step.

Note that in general the nearest neighbor method [28] is not identical with the adaptive exploration algorithm, as the nearest neighbor method does not take the global graph structure into account. For example, as the adaptive exploration algorithm computes a walk based on the worst case announcements, it will achieve a competitive ratio of at most α𝛼\alphaitalic_α, while the nearest neighbor method might only achieve a logarithmic approximation.

Proof (of Lemma 1)

At any point there is a Hamiltonian extension to the current solution. Therefore, at any point the algorithm has an option with expected best case costs of the next edge plus the amount of remaining vertices minus one (of which the weight to reach them is unknown between 1111 and α𝛼\alphaitalic_α). This beats every option that involves visiting vertices twice, so no vertex will be visited twice by the adaptive exploration algorithm.

As all outgoing edges towards unvisited vertices (that are not the end vertex) are part of Hamiltonian extensions whose expected worst case costs to the algorithm only deviate by its first edge, the decision which vertex is visited next is determined by the cheapest outgoing edge. ∎

Given this observed behaviour of the adaptive exploration algorithm, we are able to prove Theorem 3.1:

Proof (of Theorem 3.1)

First, we fix an optimal walk for the analysis.

For the sake of notation, we enumerate the vertices from 1111 to n𝑛nitalic_n in the same order as they are visited by the adaptive exploration algorithm. To prove the claim, assume that after k𝑘kitalic_k vertices the algorithm has paid the edge costs of w1,,wk1subscript𝑤1subscript𝑤𝑘1w_{1},\dots,w_{k-1}italic_w start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_w start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT. The optimal solution must be at least twice the cheaper half of the used edges of the algorithm:

Since the adaptive exploration algorithm takes the cheapest possible outgoing edge towards an unvisited vertex, we can assume that every edge leaving a vertex x𝑥xitalic_x towards a vertex y𝑦yitalic_y with y>x𝑦𝑥y>xitalic_y > italic_x costs at least wxsubscript𝑤𝑥w_{x}italic_w start_POSTSUBSCRIPT italic_x end_POSTSUBSCRIPT. Edges to already visited vertices can be cheaper. As the optimal walk is a Hamiltonian one, only two edges incident to each vertex can be used in an optimal solution.

When comparing a fixed optimal solution to the walk of the adaptive exploration algorithm, there are two types of edges this optimal solution consists of: Edges from a vertex x𝑥xitalic_x to a vertex y𝑦yitalic_y with x<y𝑥𝑦x<yitalic_x < italic_y, and edges from x𝑥xitalic_x to y𝑦yitalic_y with x>y𝑥𝑦x>yitalic_x > italic_y.

For each edge with x<y𝑥𝑦x<yitalic_x < italic_y, by Lemma 1, we know that the algorithm has paid at most as much as the optimal solution when leaving x𝑥xitalic_x.

For each edge with x>y𝑥𝑦x>yitalic_x > italic_y, it follows by Lemma 1, that the algorithm has paid at most as much when leaving y𝑦yitalic_y as the optimal solution paid for leaving x𝑥xitalic_x.

Sadly, both cannot be combined as edges might have counted in both cases. But as the optimal solution needs n1𝑛1n-1italic_n - 1 edges in total, one of the two types of edges must occur at least n12𝑛12\frac{n-1}{2}divide start_ARG italic_n - 1 end_ARG start_ARG 2 end_ARG times. Therefore, half the edges of the algorithm’s solution cost as much as half the edges of the optimal solution. Let’s say those edge weights total to K𝐾Kitalic_K.

The costs of an optimal solution are now at least n12+K𝑛12𝐾\frac{n-1}{2}+Kdivide start_ARG italic_n - 1 end_ARG start_ARG 2 end_ARG + italic_K, while the algorithm’s solution is at most α(n1)2+K𝛼𝑛12𝐾\frac{\alpha(n-1)}{2}+Kdivide start_ARG italic_α ( italic_n - 1 ) end_ARG start_ARG 2 end_ARG + italic_K. The ratio between those expressions is largest possible when K𝐾Kitalic_K is small. However, K𝐾Kitalic_K must be at least n12𝑛12\frac{n-1}{2}divide start_ARG italic_n - 1 end_ARG start_ARG 2 end_ARG. It follows that the competitive ratio of the adaptive exploration algorithm is at most α+12𝛼12\frac{\alpha+1}{2}divide start_ARG italic_α + 1 end_ARG start_ARG 2 end_ARG. ∎

Even if the restriction for the graph classes is rather strict, there are some important graph classes where Theorem 3.1 applies.

Corollary 1

For every α<2𝛼2\alpha<2italic_α < 2 and every n𝑛nitalic_n, the adaptive exploration algorithm achieves a competitive ratio of α+12𝛼12\frac{\alpha+1}{2}divide start_ARG italic_α + 1 end_ARG start_ARG 2 end_ARG on the complete graphs Knsubscript𝐾𝑛K_{n}italic_K start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT with uniform announced edge weight and distinct start- and end vertices.

Corollary 2

For every α<2𝛼2\alpha<2italic_α < 2 and every n𝑛nitalic_n, the adaptive exploration algorithm achieves a competitive ratio of α+12𝛼12\frac{\alpha+1}{2}divide start_ARG italic_α + 1 end_ARG start_ARG 2 end_ARG on the complete bipartite graphs Kn,nsubscript𝐾𝑛𝑛K_{n,n}italic_K start_POSTSUBSCRIPT italic_n , italic_n end_POSTSUBSCRIPT with uniform announced edge weights, when start- and end vertices are on different sides, and on the complete bipartite graph Kn+1,nsubscript𝐾𝑛1𝑛K_{n+1,n}italic_K start_POSTSUBSCRIPT italic_n + 1 , italic_n end_POSTSUBSCRIPT with distinct start- and end vertices on the bigger side.

Both corollaries immediately follow from Theorem 3.1.

Indeed, the performance of the adaptive exploration algorithm is optimal on those graph classes.

Theorem 3.2

No algorithm can achieve a better competitive ratio than α+12𝛼12\frac{\alpha+1}{2}divide start_ARG italic_α + 1 end_ARG start_ARG 2 end_ARG even on complete graphs with uniform announced edge weights [1,α]1𝛼[1,\alpha][ 1 , italic_α ] and α2𝛼2\alpha\leq 2italic_α ≤ 2.

Proof

Let us consider any algorithm 𝒜𝒜\mathcal{A}caligraphic_A on the complete graph K2ksubscript𝐾2𝑘K_{2k}italic_K start_POSTSUBSCRIPT 2 italic_k end_POSTSUBSCRIPT. For the first k𝑘kitalic_k rounds, we reveal that every edge adjacent to the current vertex has a weight of 1111. We call the set of vertices visited in this phase A𝐴Aitalic_A. Then, every newly revealed edge has a weight α𝛼\alphaitalic_α (the edges going to vertices in A𝐴Aitalic_A are still of weight 1111). We set B=VA𝐵𝑉𝐴B=V\setminus Aitalic_B = italic_V ∖ italic_A the set of vertices visited in this second phase.

Here, we use weights 1111 and α𝛼\alphaitalic_α, so it never makes sense for an algorithm to go back to an already visited edge, because the triangle inequality is satisfied. Indeed, imagine that 𝒜𝒜\mathcal{A}caligraphic_A follows the path v1vksubscript𝑣1subscript𝑣𝑘v_{1}\cdots v_{k}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ⋯ italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT (for k3𝑘3k\geq 3italic_k ≥ 3) at some point but had already visited v2vk1subscript𝑣2subscript𝑣𝑘1v_{2}\cdots v_{k-1}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⋯ italic_v start_POSTSUBSCRIPT italic_k - 1 end_POSTSUBSCRIPT when it was at v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT. Then, the cost of going from v1subscript𝑣1v_{1}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT to vksubscript𝑣𝑘v_{k}italic_v start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT directly is at most α𝛼\alphaitalic_α, whereas that of the path is at least (k1)2α𝑘12𝛼(k-1)\geq 2\geq\alpha( italic_k - 1 ) ≥ 2 ≥ italic_α. Thus, we can assume that |A|=|B|=k𝐴𝐵𝑘|A|=|B|=k| italic_A | = | italic_B | = italic_k.

The cost of the path computed by the algorithm is (1+α)k1𝛼𝑘(1+\alpha)k( 1 + italic_α ) italic_k, whereas there exists an optimal path of cost 2k2𝑘2k2 italic_k: with A={a1,,ak},B={b1,,bk}formulae-sequence𝐴subscript𝑎1subscript𝑎𝑘𝐵subscript𝑏1subscript𝑏𝑘A=\{a_{1},\cdots,a_{k}\},B=\{b_{1},\cdots,b_{k}\}italic_A = { italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } , italic_B = { italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , ⋯ , italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT }, the path a1b1a2b2akbka1subscript𝑎1subscript𝑏1subscript𝑎2subscript𝑏2subscript𝑎𝑘subscript𝑏𝑘subscript𝑎1a_{1}b_{1}a_{2}b_{2}\cdots a_{k}b_{k}a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ⋯ italic_a start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT only takes edges adjacent to vertices in A𝐴Aitalic_A, and which are thus of weight 1111. Thus, the algorithm cannot do better than a competitive ratio of 1+α21𝛼2\frac{1+\alpha}{2}divide start_ARG 1 + italic_α end_ARG start_ARG 2 end_ARG. ∎

Remark 2

Remark that if α>2𝛼2\alpha>2italic_α > 2, we still get a lower bound of 3/2323/23 / 2 by applying the proof of Theorem 3.2.

A lower bound of α+12𝛼12\frac{\alpha+1}{2}divide start_ARG italic_α + 1 end_ARG start_ARG 2 end_ARG for complete bipartite graphs Kn,nsubscript𝐾𝑛𝑛K_{n,n}italic_K start_POSTSUBSCRIPT italic_n , italic_n end_POSTSUBSCRIPT, matching Corollary 2, can be constructed analogously. Note that the restriction for distinct start- and end vertices is only necessary for achieving a strict bound of 1+α21𝛼2\frac{1+\alpha}{2}divide start_ARG 1 + italic_α end_ARG start_ARG 2 end_ARG, in case of identical start- and end vertex the ratio can decrease by α1n𝛼1𝑛\frac{\alpha-1}{n}divide start_ARG italic_α - 1 end_ARG start_ARG italic_n end_ARG as the final edge leading towards the start/end vertex is revealed as of the beginning, so the algorithm cannot be tricked into taking the last edge as an expensive edge.

However, the adaptive exploration algorithm is not able to outperform the trivial precomputing algorithm even for uniform announced edge weights and α<2𝛼2\alpha<2italic_α < 2, even on grids.

Theorem 3.3

The Adaptive Exploration Algorithm does not achieve a better competitive ratio than α𝛼\alphaitalic_α for graphs with announced weight [1,α]1𝛼[1,\alpha][ 1 , italic_α ] for all edges and α<2𝛼2\alpha<2italic_α < 2 in general.

Proof

We can construct a grid as an example:

s𝑠sitalic_st𝑡titalic_tα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_α11111111111111α𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_α11111111111111α𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_α11111111111111α𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_α11111111111111α𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_αα𝛼\alphaitalic_α
Figure 2: Every edge is announced with [1,α]1𝛼[1,\alpha][ 1 , italic_α ] and revealed with the labeled weight. The adaptive exploration algorithm will traverse on the broad edges, and will not take the small ones (so every second layer). An alternative solution would take the continuous ones (and takes some edges twice in the bottom layer), and ignore the dashed ones. The adaptive exploration algorithm therefore walks on a Hamiltonian path, where each edge costs α𝛼\alphaitalic_α. The alternative solution only pays 1111 for the majority of the edges in the center, but needs more edges.

Assuming the algorithm starts at the marked starting vertex s𝑠sitalic_s in the grid as depicted in Figure 2. As there exists a Hamiltonian path between the starting vertex and the end vertex t𝑡titalic_t, at each point the algorithm prefers staying on the Hamiltionian path instead of visiting a vertex twice just to take a cheaper edge, as taking an edge with weight α𝛼\alphaitalic_α is cheaper than taking two edges even with weight 1111. Also traversing between two connected unvisited vertices will cost either the unrevealed edge of at most α𝛼\alphaitalic_α, or in the best case two revealed edges of weight 1111 as well. Therefore, at every step the algorithm will take the cheapest outgoing edge that is part of a Hamiltonian path.

As in every step of the algorithm (broad path in Figure 2) there is either just one unvisited neighbor that needs to be visited in order to stay on a Hamiltonian path, or one neighbor of which is adjacent to just one unvisited vertex - which also needs to be visited to stay on a Hamiltonian path. Therefore, the adaptive exploration algorithm will traverse the grid in the predetermined path, even if all of those edges are expensive with weight α𝛼\alphaitalic_α. Note that this also seems to be the strategy which promises the cheapest path through the graph.

On the other hand, an alternative offline algorithm may ignore the fact that a Hamiltonian path seems cheaper for the moment. It also needs to visit the upper and lower row of the grid, thus taking edges with costs of α𝛼\alphaitalic_α twice for each vertex in those rows - but it can decide to take the other edges for the middle part of the grid, thus paying only 1111 for each edge but at the cost of taking additional edges for another traversal from the right to the left (depicted by the continuous line).

Overall, for a quadratic grid of n𝑛nitalic_n vertices, the adaptive exploration algorithm pays (n1)α𝑛1𝛼(n-1)\alpha( italic_n - 1 ) italic_α, while the depicted alternative solution only pays 6nα+(n2)n16𝑛𝛼𝑛2𝑛16\sqrt{n}\alpha+(\sqrt{n}-2)\sqrt{n}*16 square-root start_ARG italic_n end_ARG italic_α + ( square-root start_ARG italic_n end_ARG - 2 ) square-root start_ARG italic_n end_ARG ∗ 1. Therefore, the competitive ratio between both solutions converges to α𝛼\alphaitalic_α, as n𝑛nitalic_n gets large. ∎

The fragility of the adaptive exploration algorithm on instances like a lattice is not surprising, since it allows the adversary to develop traps. Even though it follows a walk that - based on his current beliefs - seems like an algorithm’s best hope, it does not value revealing edge weights. If the adversary needs to reveal many edges early, it prevents him from hiding edge weights in order to decide later whether they become cheap or expensive.

3.2 Outlook on other graph classes

Many lower bound constructions for the classical graph exploration problem make use of dead ends or the option to either present edges or not depending on the algorithm’s actions. This is not possible in the graph exploration problem with edge weight estimates, thus many lower bound constructions of the graph exploration problem will not work here - even for a large factor of α𝛼\alphaitalic_α.

For example, Miyazaki et al. [26] proved a matching upper and lower bound of 1+32132\frac{1+\sqrt{3}}{2}divide start_ARG 1 + square-root start_ARG 3 end_ARG end_ARG start_ARG 2 end_ARG for the graph exploration problem on a cycle graph. For a sufficiently large α𝛼\alphaitalic_α, the lower bound construction will also work out in our model with edge weight estimates, as there is no structure to hide in this graph class.

But for most graph classes, like tadpole graphs (a cycle with one attached path), the question whether a vertex is on the cycle or on the path is hidden in the classical graph exploration model. Miyazaki et al. [26] and Brandt et al. [13] also showed that even on unweighted tadpole graphs there cannot be an algorithm for the graph exploration problem yielding a better competitive ratio than 2222. This construction cannot work out for the online graph exploration problem with edge weight estimates, as it is clear which edges belong to the path and thus need to be traversed exactly twice by every reasonable algorithm, thus every edge of the path potentially only decreases the competitive ratio. Therefore, the best possible lower bound for a given α𝛼\alphaitalic_α on tadpole graphs is exactly the same construction as for the same factor α𝛼\alphaitalic_α on a cycle, and will not exceed a competitive ratio of 1+32132\frac{1+\sqrt{3}}{2}divide start_ARG 1 + square-root start_ARG 3 end_ARG end_ARG start_ARG 2 end_ARG.

4 Final Remarks and Open Problems

In this article, we introduced the concept of estimates to the edge weights of the travelling salesman problem resp. graph exploration problem. We believe that the concept is worth looking into: It is not only a natural variant of an exploration problem which can easily be motivated by practical scenarios, but also promises interesting results from the theoretical side.

Since a general solution for the graph exploration problem with edge weight estimates would solve the heavily investigated and long-standing open problem of graph exploration as a special case when the range between the lower- and upper bound gets large, we assume that such a general solution is out of reach with current state-of-the-art approaches.

Apart from the general question, it is likely fruitful to attack special cases of the graph exploration problem with edge weight estimates. This will hopefully give a better understanding also of the graph exploration problem itself, and might help to come up with a solution in the future. An interesting special case would be the setting with uniform announced edge weights, where we conjecture that there is an 1+α21𝛼2\frac{1+\alpha}{2}divide start_ARG 1 + italic_α end_ARG start_ARG 2 end_ARG-competitive algorithm for every graph structure and α2𝛼2\alpha\leq 2italic_α ≤ 2.

{credits}

4.0.1 Acknowledgements

This study was partially supported by the STACS 2024 Extended Stay Support Scheme, funded by the LaBRI, by the NSTC-CNRS International Emerging Action (IEA) 2024-2025 “Quasi-Polynomial Time Approximation Algorithms for Some Network Design Optimization Problems” under grants 113–2927–I–141–501 and 114-2927-I-141-501, and by the ANR project TEMPOGRAL (ANR-22-CE48-0001).

References

  • [1] van den Akker, E., Buchin, K., Foerster, K.: Multi-agent online graph exploration on cycles and tadpole graphs. In: Structural Information and Communication Complexity - 31st International Colloquium, SIROCCO 2024. Springer (2024)
  • [2] Ausiello, G., Feuerstein, E., Leonardi, S., Stougie, L., Talamo, M.: Algorithms for the on-line travelling salesman. Algorithmica 29(4), 560–581 (2001)
  • [3] Azar, Y., Leonardi, S., Touitou, N.: Flow time scheduling with uncertain processing time. In: Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing. STOC 2021, Association for Computing Machinery (2021)
  • [4] Azar, Y., Leonardi, S., Touitou, N.: Distortion-oblivious algorithms for minimizing flow time. In: Proceedings of the 2022 ACM-SIAM Symposium on Discrete Algorithms, SODA 2022. SIAM (2022)
  • [5] Azar, Y., Peretz, E., Touitou, N.: Distortion-Oblivious Algorithms for Scheduling on Multiple Machines. In: 33rd International Symposium on Algorithms and Computation (ISAAC 2022). Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2022)
  • [6] Baligács, J., Disser, Y., Heinrich, I., Schweitzer, P.: Exploration of graphs with excluded minors. In: 31st Annual European Symposium on Algorithms, ESA 2023. Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2023)
  • [7] Bampis, E., Escoffier, B., Gouleakis, T., Hahn, N., Lakis, K., Shahkarami, G., Xefteris, M.: Learning-Augmented Online TSP on Rings, Trees, Flowers and (Almost) Everywhere Else. In: 31st Annual European Symposium on Algorithms (ESA 2023). vol. 274 (2023)
  • [8] Bellmore, M., Nemhauser, G.L.: The traveling salesman problem: A survey. Operations Research 16(3), 538–558 (1968). https://doi.org/10.1287/opre.16.3.538
  • [9] Birx, A., Disser, Y., Hopp, A.V., Karousatou, C.: An improved lower bound for competitive graph exploration. Theoretical Computer Science 868 (2021)
  • [10] Böckenhauer, H., Fuchs, J., Unger, W.: Exploring sparse graphs with advice. Information and Computation 289(Part A) (2022)
  • [11] Böckenhauer, H., Komm, D., Královič, R., Rossmanith, P.: The online knapsack problem: Advice and randomization. Theoretical Computer Science 527 (2014)
  • [12] Borodin, A., El-Yaniv, R.: Online computation and competitive analysis. Cambridge University Press (1998)
  • [13] Brandt, S., Foerster, K.T., Maurer, J., Wattenhofer, R.: Online graph exploration on a restricted graph class: Optimal solutions for tadpole graphs. Theoretical Computer Science 839 (2020)
  • [14] Christofides, N.: Worst-case analysis of a new heuristic for the traveling salesman problem. Carnegie Mellon University 3,  10 (03 2022)
  • [15] Dantzig, G.B., Fulkerson, D.R., Johnson, S.M.: Solution of a large-scale traveling-salesman problem. Oper. Res. 2(4), 393–410 (1954)
  • [16] Dobrev, S., Královic, R., Markou, E.: Online graph exploration with advice. In: Structural Information and Communication Complexity - 19th International Colloquium, SIROCCO 2012. Springer (2012)
  • [17] Eberle, F., Lindermayr, A., Megow, N., Nölke, L., Schlöter, J.: Robustification of Online Graph Exploration Methods. Proceedings of the AAAI Conference on Artificial Intelligence 36(9) (2022)
  • [18] Gehnen, M., Lotze, H., Rossmanith, P.: Online Simple Knapsack with Bounded Predictions. In: 41st International Symposium on Theoretical Aspects of Computer Science (STACS 2024). vol. 289. Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024)
  • [19] Kalyanasundaram, B., Pruhs, K.R.: Constructing competitive tours from local information. Theoretical Computer Science 130(1) (1994)
  • [20] Karlin, A.R., Klein, N., Gharan, S.O.: A (slightly) improved approximation algorithm for metric TSP. In: Khuller, S., Williams, V.V. (eds.) STOC ’21:. pp. 32–45 (2021)
  • [21] Karp, R.M.: Reducibility among Combinatorial Problems. Springer US (1972)
  • [22] Komm, D.: An Introduction to Online Computation – Determinism, Randomization, Advice. Texts in Theoretical Computer Science. An EATCS Series, Springer (2016)
  • [23] Liu, F., Lu, C., Gui, L., Zhang, Q., Tong, X., Yuan, M.: Heuristics for vehicle routing problem: A survey and recent advances (2023), https://arxiv.org/abs/2303.04147
  • [24] Manasse, M., McGeoch, L., Sleator, D.: Competitive algorithms for on-line problems. In: Proceedings of the Twentieth Annual ACM Symposium on Theory of Computing. p. 322–333. STOC ’88, Association for Computing Machinery, New York, NY, USA (1988)
  • [25] Megow, N., Mehlhorn, K., Schweitzer, P.: Online graph exploration: New results on old and new algorithms. Theoretical Computer Science 463 (2012)
  • [26] Miyazaki, S., Morimoto, N., Okabe, Y.: The online graph exploration problem on restricted graphs. IEICE Trans. Inf. Syst. 92-D (2009)
  • [27] Purohit, M., Svitkina, Z., Kumar, R.: Improving online algorithms via ML predictions. In: Advances in Neural Information Processing Systems. Curran Associates, Inc. (2018)
  • [28] Rosenkrantz, D.J., Stearns, R.E., Lewis, II, P.M.: An analysis of several heuristics for the traveling salesman problem. SIAM Journal on Computing 6(3) (1977)
  • [29] Schrijver, A.: On the history of combinatorial optimization (till 1960). In: Discrete Optimization, Handbooks in Operations Research and Management Science, vol. 12, pp. 1–68 (2005)
  • [30] Sleator, D.D., Tarjan, R.E.: Amortized efficiency of list update rules. In: DeMillo, R.A. (ed.) Proceedings of the 16th Annual ACM Symposium on Theory of Computing, April 30 - May 2, 1984, Washington, DC, USA. ACM (1984)