Cristina
Moyaert
a,
Philippe
Nimmegeers
abc,
Bilal
Mellouk
a,
Dimitri
Voordeckers
d,
Paul
De Meulenaere
ef and
Pieter
Billen
*a
aIntelligence in Processes, Advanced Catalysts and Solvents (iPRACS), Faculty of Applied Engineering, University of Antwerp, Groenenborgerlaan 171, 2020 Antwerp, Belgium. E-mail: pieter.billen@uantwerpen.be
bEnvironmental Economics (EnvEcon), Department of Engineering Management, Faculty of Business and Economics, University of Antwerp, Prinsstraat 13, 2000 Antwerp, Belgium
cFlanders Make@UAntwerp, 2000 Antwerp, Belgium
dResearch Group for Urban Development, Faculty of Design Sciences, University of Antwerp, Mutsaardstraat 31, 2000 Antwerp, Belgium
eCoSys-Lab, Faculty of Applied Engineering, University of Antwerp, Groenenborgerlaan 171, 2020 Antwerp, Belgium
fAnsymo/Cosys Core Lab, Flanders Make, 2020 Antwerp, Belgium
First published on 15th January 2024
Human activity depends on resources that are often consumed without regard for their future availability. Consequently, resources in the form of raw materials and finished products are widely dispersed across society, creating energetic challenges for resource management, since the processes of procuring materials, purifying and processing them, distributing goods, and collecting waste all require significant amounts of energy. The costs and energy requirements for these activities depend on factors such as the mode of transportation and the distance travelled. Efficient transportation strategies can help reduce the negative environmental impact of human activities and ensure the sustainable use of resources for future generations. Quantifying the impact of this transport requires specific and expert logistics management knowledge. The current approach relies on information that is often not readily available, making it impractical and costly. Fast and quantitative methods to support decision making are especially needed when evaluating different potential circular economy (CE) strategies and business models that aim to reduce environmental impact by keeping materials at a high functionality level by closing material cycles (e.g., through reuse, reparation, refurbishment, remanufacturing, repurposing, recycling or material recovery). As a consequence, in this article, geospatial entropy definitions are studied as novel metrics to quantify the geospatial distribution of resources. The overall goal of this article is to review existing geospatial entropy definitions and evaluate their potential to be applied for assessing resource management strategies in view of a circular economy. In doing so, insight into the decision making behind the location of a value-added activity through a collection and processing of resources is gained, as well as how entropy can be used to support this. To achieve this, we analyse several definitions used in the field of urban sprawling, illustrate how they are calculated using conceptual examples, and translate these to relevant research questions for resource management. This analysis results in several promising definitions, which, in our view, are able to quantify the geospatial distribution of resources accurately. The resulting entropy value can then serve as a proxy for collection efforts. As a result, a viewpoint is presented on how these geospatial entropy definitions can support resource management decisions, such as the appraisal of resource/waste collection schemes and the location of processing and recycling facilities.
Environmental significanceIn the framework of sustainability, resource recovery is paramount. However, there is no easy and accessible method to optimize resource recovery. As a result, research questions about relevant collection schemes and site location decision making are currently performed based on complex (multi-objective) optimization problems, and are time-consuming and energy intensive. As a result, we investigate geospatial entropy definitions as innovative measures for quantifying the geospatial dispersion of resources, allowing for general, unambiguous and easy metrics. We have demonstrated how such geospatial entropy-based metrics can support resource management decisions, such as the evaluation of waste/resource collection schemes and the location of processing/recycling facilities, by elaborating a case study. |
Currently, society is becoming increasingly dependent on the use of these resources (i.e. metals, plastics, etc.) to the extent that policymakers and executives are rapidly gaining interest in their sustainability.8–11 As a result, sustainability goals are being set, with a circular economy (CE) being a key factor in the achievement of these goals. For instance, in 2022, the European Union (EU) introduced specific objectives to increase the EU's self-sufficiency on key raw materials needed for the green and digital transitions.12 In addition, the United Nations (UN) have set specific sustainable development goals, which include CE goals such as: ensuring sustainable consumption and production patterns, and ensuring availability and sustainable management of water and sanitation.13 However, achieving a CE comes with many challenges, including poor recycling technologies, a lack of proper resource management infrastructure, and a lack of government legislations and specific implementation guidelines. Lastly, geopolitical dynamics can also have a significant impact on achieving a CE, as geopolitics heavily influence the allocation of resources, environmental policies, the price of resources and their economic progress.
In addition to these challenges, the spread of resources and goods also poses a problem for resource management, as procuring materials, transporting them to facilities for purification and processing, distributing goods, and collecting waste all demand energy for transportation. Since economies of scale are crucial to the profitability of waste collection and waste management businesses,14–17 and the costs and energy requirements of transportation rely largely on the mode of transportation and on the distance travelled,18,19 a method which can easily quantify geospatial distributions is paramount for answering resource management related research questions.
Historically, the 4-step-model, or trip-based model, is the most commonly used aggregate technique for quantifying transportation impacts. More recently, more detailed models such as activity-based models or agent-based models (ABM) and incremental pivot-point models have been adopted.20 In their current states, the transportation impact quantification models require a lot of information, such as detailed logistics data and expert knowledge and insights, all of which incur additional costs. Lamentably, most companies do not have sufficient data and/or insight into the entirety of relevant logistic chains, resulting in an unfeasible impact quantification. In addition, even if the logistics management would be sufficiently advanced, the remaining information required for this assessment would not be readily available. As a result, the current methodology is both impractical, costly, and in certain situations even impossible. Moreover, it is important to calculate the performance of materials rather than the performance of a region or a specific logistics system. Consequently, a need for easily calculated and interpretable methods to quantify transportation impacts arises. We hypothesize that methods which quantify geospatial distributions based on readily available data can support this.6,21–23
We envision that the quantification of distributions holds the key to evaluating transportation impacts, owing to the close relationship between the distribution of materials and the energy required for their retrieval, as established by Gutowski.24 Specifically, the concentration of materials correlates inversely with the energy necessary for their collection: the more concentrated materials are, the less energy required to collect them, while a high degree of dilution leads to a corresponding increase in energy requirements. Notably, the most arduous scenario is the uniform spread, which represents the pinnacle of dilution, and demands the greatest effort for retrieval.
An established approach to quantifying distributions can be found in the field of information theory: statistical entropy (more commonly referred to as Shannon entropy, as it originates from his mathematical theory of communication). Statistical entropy quantifies the average uncertainty on a distributed variable. The higher the statistical entropy, the higher the average uncertainty and the more spread the variable is (with as maximum a uniform distribution). The lower the statistical entropy, the lower the average uncertainty on the variable and the less spread the variable is (with as minimum the ideal case in which the probability that a variable takes a specific value is 100%).25,26
The notion of statistical entropy has been applied widely: from information sciences and computer engineering to describe image randomness27 and to improve the evaluation of cross efficiencies,28 to the field of biology where it can be used to address the parameters of muscle biopotentials29 or to study the expression patterns of genes.30 Moreover, considerable advances have recently been made in the field of circular economy (CE). Several papers have been published directly linking the two concepts. This connection is achieved by interpreting a circular economy as a sustainability approach which aspires to keep entropy levels low, i.e. at a functional state, or at least an approach that allows a high entropic product or material to decrease to a low entropy state with minimum efforts. As a general rule, the maximum degree of functionality should be maintained for materials and products in a CE, whereas processes such as mixing, dilution and contamination should be avoided, since they are regarded as an entropy increasing process.21,31–35
Even though significant advances have been made in the field of CE, the allocation of resources is a new area of interest. While the concept of statistical entropy based on geospatial distributions has been studied extensively in the field of urban sprawling,32,36–39 the distribution used is mainly population based. To the authors' best knowledge, no other studies have been performed on interlinking this concept with resource management.
The goal of this paper is to provide insight into the current entropy definitions used to measure geographical distributions, as well as their possible application to the areas of CE and resource management. This entails (i) providing a clear understanding of the various entropy definitions employed and exploring their nuances, differences and similarities and (ii) assessing the suitability of different entropy definitions in capturing and quantifying geographical distributions, as well as identifying scenarios where certain definitions may be more effective or appropriate, and (iii) assessing the applicability of these definitions for resource management strategies in view of a circular economy.
The main research question of the paper corresponds with the following: how can we make decisions about where to locate a value-added activity through a collection and processing of resources based on a simple calculation that does not require much information? And how can we use entropy for that purpose? In addition to addressing the primary research question, this paper also aims to achieve a secondary goal, namely to bridge the gap between theoretical concepts and practical applications by offering a case study as a practical example. Not only does this offer concrete examples on how entropy definitions can be applied in real-world situations, but it also provides interpretations of results to enhance practical understanding, as well as providing practical recommendations or guidelines for decision-makers in CE and resource management based on the insights derived from the case study.
H = −∑pilog2(pi) | (1) |
Entropy can in essence be seen as a statistical property of distributions just as a standard deviation and an average. A high entropy corresponds with a high uncertainty and a low entropy with a low uncertainty. As illustrated in the examples in Fig. 1, entropy should not be confused with other dispersion metrics like variance, which are relative to a mean value. The multimodal distribution (Fig. 1c) has a higher variance but a lower entropy than the normal distribution in Fig. 1b.
Entropy has found its use in many different domains including power systems,41 physics,26 finance,42 biology,43 physiology,44 and cardiovascular medicine.45,46 More specifically, recent research into the Shannon entropy has further developed the concept into an assessment methodology suited for several research questions in the domain of circular economy (CE).21,31,47,48 The link between entropy and CE was first developed by Rechberger and Brunner47 in order to examine the distribution of certain compounds in waste-to-energy facilities and later modified for streams of mostly inorganic/metallic components. Parchomenko et al.31 extended the approach, which allows for statistical entropies at different levels, such as component (consisting of multiple substances, e.g., a multilayer) and product (consisting of multiple components, e.g., a drink carton), to evaluate resource effectiveness of CE strategies.
Furthermore, the concept has also been used to quantify the separation complexity of mixed plastic waste streams, as Skelton et al.49 addressed the issue of applying the metric to material mixtures with radically varied origins or compositions. Through their research, a strategy based on statistical entropy and a multi-level material classification system that is extensively applicable to many resource types over the whole life cycle has been developed. Lastly, newly found interest into assessing product recyclability from a design point-of-view is developing. This entails using entropy during the design-phase to assess the recyclability of the product, allowing for recyclability by design choices to be made.21,35 Even though the above contributions have already been made to the domain of CE, the geospatial distribution or resources and products has not been considered.
As previously mentioned, in the search for statistical entropy methods that are not contingent upon specific background systems, Statistical Entropy Analysis (SEA) was devised.47 In recent times, extensions to this approach have been introduced, giving rise to Multilevel SEA (MSEA).31 Some of the already proposed extensions include product level and component level, where the presented methodology has already proven capable of capturing unwanted entropy increases that do not correspond with a clear energy gain in recycling and, thus, likely negatively influence waste management systems.33,35 We propose an extension of this methodology to encompass a geospatial level, as the geospatial dispersion of products can impact their recyclability via numerous factors such as accessibility to recycling infrastructure, market demand for recycled materials, environmental regulations, and cultural attitudes towards recycling. Therefore, this type of distribution, as well as their link to CE will be the central thesis of this paper.
Distribution type | Probability (pi) | Mathematical formulations | Interpretation | References |
---|---|---|---|---|
Area (m2) | (3) | (10) | How distributed is an area in different sub-areas? How is an area divided among sub-areas? | Chapman (1970),39 Batty (1976)38 |
Population number | (4) | (11) | How distributed is a population in different areas? Suppose we pick one person from a larger area, what is the probability that person comes from a certain sub-area? | Chapman (1970),39 Batty (1976),38 Liu et al. (2018)36 |
Population density | (5) | (12) | How geometrical are spatial systems in terms of population density? | Batty (1976)38 |
How equal is the distribution of densities over several sub-areas? | ||||
Population density levels | (6) | (13) | How common is this level of occurrence, compared to the total occurrences? | Semple (1970)51 |
If we pick a random square, what are the chances it has a certain density level? | ||||
“Flows” from one area i to an other area j (2) | P Flow ij = Fract. Flowij (7) | (14) | Which flow is preferential over other flows? How uniform is the flow pattern between different cities? | Chapman (1970)39 |
(8) | (15) | |||
Percentage of land development pixels within buffer zones around | (9) | (16) | How is the distribution of land development densities? | Yeh and Li (2001)37 |
If we are in a random patch of developed land, what are the chances we are in a specific zone? |
This type of distribution has been studied by a number of different researchers, most important among them being Chapman39 and Batty.38 The main difference between their area distribution techniques lies in the demarcation of the sub-areas: Batty works with pre-set geographical boundaries (e.g. municipalities) while Chapman works with Thiessen polygons of municipal centres. Thiessen polygons are generated from a set of points such that each polygon defines an area of influence around its sample point. Any location inside the polygon is closer to that point than any of the other sample points. Essentially, it defines an area around a point (in Chapman's case a municipal centre), where every location is nearer to this point than to all the others.52
An alternative method of setting up an area distribution, was proposed by Liu et al. in 2018.36 They stated that the use of Shannon's entropy together with the spatial correlation rule of Tobler's Law, could be used to determine the distribution of populations in an area. Tobler's first law of geography states that “everything is related to everything else, but near things are more related than distant things”.53 Essentially, the law is founded on the idea of distance friction, or how physical separation prevents contact between locations. The difficulty or cost of an interaction increases with the distance between two locations. For instance, walking to the corner store for the groceries is more likely than driving across town for the same groceries.54
In order to establish a population distribution, only data regarding the population of the sub-areas is required, which is relatively simple to acquire. Once the distribution is obtained, this can be converted into probabilities, which form the basis of the entropy calculations. The illustration of this concept, together with an explanatory case study is elaborated in the ESI.†
Liu et al. (2018)36 defined the local spatial entropy method, which essentially consists of pairing the Shannon entropy with the Floating Catchment Area (FCA) method. FCA divides an area into a grid. In doing so, surrounding areas within a threshold distance can be searched. Next, the probability of a point i is then calculated based on the population of the surrounding 8 points in the grid, rendering the equation:
pi = population of point i/total population of 9 points | (17) |
The approach is as follows: an overarching area of different cities is scaled down to a field of points, and is then divided into Q equal squares. Each point corresponds with a city or town centre, meaning where the central part, or main business and commercial area of a city or town is located. The size of the squares is crucial: each square should contain at least one point, which means that each square should be larger than the average distance between the points. Next, each square's points are counted to get n levels of density Di, where n < Q. The amount of occurrences for each density level is reported and given the abbreviation mi. Following the above statements, it is discernible that , and the required probabilities pi for the entropy definition can be calculated as: pi = mi/∑mi. The illustration of this concept, together with an explanatory case study is elaborated in the ESI.†
An important feature of this definition is the use of squares. By dividing the total area of the studied case into squares of identical sizes, the resulting entropy is still normalized to area, but the size of the sub-areas as a parameter is redundant. Additionally, since the definition works with levels, it is not the value of the level that is important, but the distribution of the levels. Meaning, an area with levels LMMM will have the same entropy as an area with levels LHHH, since level L has one occurrence and level M/H has three. This is different from the above definitions, where the value of the ‘level’ has an effect on the entropy value of the distribution, except for the case where the distribution is uniform.
The entropy formula for this definition consists of 2 parts: (1) a general entropy part and (2) an entropy aggregation part. The general entropy part quantifies the incoming and outgoing flow of a city in relation to the incoming and outgoing flow of all surrounding cities. In general, it is a measure for the total mobility of a city within an area. The entropy aggregation part can be used to calculate the entropy of an overarching area, like a province. The calculation exists of multiplying the obtained entropy with the total flow of the overarching area, essentially creating a weighted sum. This approach allows us to directly compare different overarching areas.
In order to illustrate this concept, several cases were developed: the cases comprise of 4 cities, and in each instance, the geometric design and the separation between the cities are altered. Starting with 5 alternative patterns and 5 population number options, there is a total of 25 possible choices, which are shown in Fig. 2. The entropy values resulting from these cases are also illustrated in Fig. 2. The values of flowij and the fractional values of the flow Pij are summarized in Table S1 in the ESI.†
Fig. 2 Illustrative overview of the 25 possible combinations for flow from one area i to an other area j. |
The procedure is as follows: buffer zones are constructed around the studied town centres and roads. The width of the buffer zone is arbitrarily chosen depending on the chosen case study (for the particular work read by the authors, the buffer zone was 250 m). With help of a GIS system, a density value is obtained by using a function which determines the percentage of land developed pixels for each buffer zone. The GIS function calculates the percentage of land development by using the colours of the pixels. In doing so, a distribution pattern for land development densities of an area is obtained.
The resulting distribution is interesting for the measurement of urban sprawling, as well as for studying the development and growth of urban regions. Once land development densities are obtained, they can be converted into probabilities, which are obtained by dividing the developed area of a specific buffer, by the total developed area in all the studied buffers. The illustration of this concept, together with an explanatory case study is elaborated in the ESI.†
In the first option, the probability is defined based on the division of a larger area (e.g. a province) into sub-areas (e.g. municipalities). The probability of landing in a specific sub-area when throwing a dart at the map of the larger area represents the distribution of resources or activities across the municipalities in the province.
When all municipalities have the same surface area, it implies a uniform distribution of resources or activities. The entropy of the reaches a maximum value here. This suggests that resources are evenly distributed across the region, and there is no particular concentration or preference. If there is only one municipality in the province, it implies that all the resources or activities are concentrated in a single location. The entropy reaches a minimum value. This indicates a lack of diversity and a highly centralized distribution.
By quantifying the entropy, you can measure the level of dispersion or concentration of resources or activities across different areas. Higher entropy indicates a more evenly distributed allocation, while lower entropy suggests a more concentrated distribution. Analysing the entropy of area distributions can provide valuable insights into where it makes sense to start a new activity, expand an existing activity, or mine resources.
In the second option, the probability is defined based on the specific activities or raw materials available in an area. For example, determining the probability of growing a certain produce in a region represents the distribution of agricultural activities.
When the area is uniformly distributed in terms of different activities or raw materials, it suggests a balanced allocation of resources, and the entropy reaches a maximum value here. This implies that various activities or raw materials have an equal presence across the region. If there is only one dominant activity or raw material, it indicates a highly specialized or concentrated distribution. As a result, the entropy reaches a minimum value. For instance, if all the area is allocated to growing a certain produce, it implies a lack of diversification in agricultural activities.
By calculating the entropy, we can assess the diversity or concentration of activities or resources across different areas. Higher entropy signifies a more diverse allocation, while lower entropy indicates a more focused or specialized distribution.
In both options, the concept of entropy provides a quantitative measure to understand the distribution patterns of resources, activities, or raw materials across different areas. It can be valuable for identifying regions with potential for new activities, expansion of existing activities, or resource mining, depending on the specific context and goals.
The probability of a particular population number is determined by the scattering of the population across an area. The probability of randomly choosing a person from a specific sub-area over a larger area, represents the distribution of population.
A uniform population distribution is observed when all regions within an area have the same population size. In this case, the entropy reaches its maximum value. This suggests that the population is evenly spread across the entire area, without any specific concentration or preference for particular regions. On the other hand, if the entire population is concentrated in a single location, the entropy reaches its minimum value, namely 0. This indicates a lack of diversity and a highly centralized distribution of population.
By quantifying the entropy, we can assess the level of dispersion or concentration of population across different areas. A higher entropy value indicates a more evenly distributed population, while a lower entropy value suggests a more concentrated distribution. Measuring entropy provides valuable insights into the spatial patterns of population distribution. It helps inform decision-making processes related to resource allocation, urban planning, and policy development, allowing for targeted interventions that address disparities, promote equitable growth, and optimize resource utilization.
The entropy pattern for population density closely aligns with the distribution of area and population numbers. High entropy is observed when the population density pattern is uniform, meaning that the density is evenly distributed across the region. This uniformity can be achieved either by the population or the area size. For instance, a small area with a small population can exhibit the same density as a large area with a large population.
Conversely, low entropy is observed when the population density pattern is highly irregular. This can occur when there are extremely concentrated or diluted densities attributed to small populations in large areas or large populations in small areas, respectively. In such cases, the density distribution is uneven and exhibits a significant deviation from uniformity.
The concept of entropy in population density analysis helps us understand the level of variation and dispersion within a region. Higher entropy indicates a more uniform distribution of population density, while lower entropy suggests a more pronounced disparity in density patterns. Analysing population density entropies provides insights into the spatial distribution of population and its concentration within specific areas. This understanding can inform decision-making processes related to urban planning, resource allocation, and the development of policies aimed at addressing inequalities and optimizing resource utilization in different regions.
The entropy pattern for population density levels closely mirrors that of population density. The only distinction lies in the introduction of levels to represent different ranges of population density. High entropy in population density levels is observed when the distribution of population density across the regions is relatively uniform, regardless of the specific density values. This indicates a balanced and even distribution of population density across the area, without significant concentration or disparity in density levels. Conversely, low entropy in population density levels occurs when the distribution of population density exhibits a significant skewness or disparity among different regions. This can manifest as highly concentrated or diluted population density levels, where certain areas have notably higher or lower densities compared to others.
The introduction of density levels provides a structured and systematic framework for analysing the relationship between population density and resource distribution. This classification enables us to identify distinct density ranges and examine their specific impacts on resource distribution in a more granular manner. By studying density thresholds or ranges, we can gain insights into how different levels of population concentration influence resource availability, access, and long-term sustainability.
By considering the specific density levels, decision-makers and planners can develop targeted strategies and policies to address the unique challenges and opportunities associated with each density category. This approach aids in optimizing resource allocation, promoting sustainable development, and ensuring equitable access to resources across regions with varying population densities.
The entropy definition is based on the fractional flow of the cities, which portrays the share of each city separately in the total mobility of the area. This mobility can be interpreted as mobility of people, but also as the mobility of goods, which is where the opportunity for resource management arises from. This mobility of goods can be translated to the localization of recycling plants and/or centralized collection hubs.
The entropy definition, translated to products and materials, quantifies the entropy of expected material flows. In other words, it is capable of expressing how concentrated or dispersed material flows are. This feature of the definition can aid in assessing future logistic problems, as described above, but can also aid in the assessment of new products. The definition is capable of assessing the energy impact connected to the spread of new products, which is also dependent on the market penetration. This assessment is namely achieved by a very generic prediction of the logistic efforts required to recollect them, which is exactly what this definition is capable of quantifying.
As already mentioned, the calculated flow between two cities says something about the interaction between these cities, with interaction possibly being general trade, population interaction, shared resources, etc. A high flow value means there is a lot of potential to set up an interaction between the cities, whilst a low value means there is little potential and it is not worth setting up an interaction. Based on this, it is then possible to compare cities to one another in terms of interaction potential (or interaction possibilities/opportunities).
In addition to being a representation of interaction potential, the flow values of cities also indicate where it would make the most sense to localise collection centres. Since this definition also takes distance into account, it can be used on its own as a valuable parameter for resource management, but it can also be used as a proxy for the required energy to bridge this distance. In general, if things are close to each other and there is a large potential, it would be interesting to merge the flows/potential of the analysed cities. If the cities are very apart in distance, and have a low interaction, then it would be more efficient to keep them separated. In terms of waste management, this definition can be used to decide where it makes sense to start clustering or whether it is more useful to treat waste together or separately.
In essence, this definition takes several crucial elements into account: on the one hand, the flow of a resource, where we wish to gather huge amounts at the same collecting sites. On the other hand, the flow's entropy. The ideal location for a collection centre would be the place where both the flow and the entropy is the highest. The flow should be high because you want to transport as many resources as possible to the same spot, either in large amounts or across short distances. In addition, a high entropy also indicates that the flow from all surrounding sites to the analysed area is (near) equal, implying that no preferential flows exist inside the zone. In most cases there will be a trade-off between high entropy and high flow, and the most optimal choice will essentially rely on distance versus quantity.
This definition is of particular interest for the collection of natural resources, since GIS data is available for most of them. In addition, GIS data is also available for e.g. the location of different companies and different manufacturing industries, which would prove very interesting for a localisation research question on for example a processing plant.
In the field of urban sprawling, this entropy value is used to examine whether land development is dispersed or compact. In general, the rule of thumb for land development in urban sprawling studies is the following: the density of land development decreases as the distance from roads or the town centre increases. Since the amount of land development can be directly linked to a compact or dispersed urban settlement, it can also be used as a proxy for determining the required energy values necessary to recollect the products and resources. Evidently, a compact settlement requires less collection energy than a dispersed settlement. In addition, it is a valuable tool to gauge the rationale to recollect resources or treat waste in either central or spatially distributed hubs, especially when researching natural resources such as lignocellulose biomass.
Similarly to the previous definitions, a high entropy is achieved when the land development across buffer zones evenly distributed, with a maximum value achieved when the distribution is uniform. A lower entropy is obtained when the land development across buffer zones is irregularly distributed, meaning some buffer zones are very developed, whilst others are not. A minimum possible entropy of 0 is obtained when the land development is concentrated in only 1 buffer zone.
The results of this approach are presented by Fig. 3: in total, the province of Antwerp has been divided into 145 squares. The colour of the square represents the population number: the deeper the shade of red, the higher the population.
Fig. 3 Overview of the grid overlay for the province of Antwerp. The deeper the shade of red, the higher the population. |
Entropies can be derived from the aforementioned data using a straightforward methodology: the procedure involves dividing the population count of each square by the total population of the province of Antwerp, thereby establishing population probabilities. These probabilities provide insight into the likelihood of randomly selecting an inhabitant from a specific square. Subsequently, the calculated probabilities can be transformed into entropies utilizing eqn (11). To ensure fairness in the analysis, squares situated on the province's border, which are frequently incomplete, are excluded. Once the entropies have been calculated, the squares with the highest and lowest entropy values were highlighted on the map. The results of these calculations are set out in Fig. 4.
Fig. 4 Overview of the highest and lowest population entropies for a grid of the province of Antwerp. |
The entropies reflect the level of dispersion or concentration of the population across different sub-regions within that area. It provides insights into the variations in population density. A high entropy indicates a greater likelihood of finding an inhabitant of the province within that square. In other words, if you were to randomly choose an inhabitant of Antwerp, there would be a higher probability that the selected individual resides in a square with a dense population and thus exhibits a high entropy. In contrast, a low entropy means there is a small chance of finding an inhaitant of the province within that square, since it is not densely populated.
The entropies of the squares can also be summarized over the region, where it would give an insight into the degree of heterogeneity or homogeneity across the whole province area. Once aggregated, a higher statistical entropy suggests a more evenly dispersed population across the sub-regions within the area. This indicates a relatively balanced distribution of population density, with no significant concentration or preference for specific sub-areas. Conversely, a lower statistical entropy in population distribution implies a higher degree of concentration or disparity in population density among the sub-regions within the area.
When considering the localization of a recycling centre, it is advantageous to place it in an area surrounded by high entropy squares. This is because squares with high entropy indicate a dense population, which correlates with a higher quantity of mattresses in need of recycling. By locating the recycling centre near such areas, it ensures that the centre is in close proximity to the highest concentration of waste, reducing transportation costs and minimizing the distance that waste needs to travel for recycling purposes.
Despite its simplicity, this methodology lacks crucial attributes necessary for the localization of a recycling centre, specifically the consideration of distance and interaction with the neighboring environment. Due to this inherent limitation, the existing methodology proves to be overly simplistic in its approach, failing to align with the desired outcome of this case study. Consequently, it becomes imperative to modify and enhance the methodology to facilitate a more comprehensive and in-depth analysis.
In order to implement the FCA method, the preceding approach was once again employed for the province of Antwerp. However, a prerequisite for the successful application of FCA was to delimit the largest possible rectangle within the province. This step was necessary as the FCA method requires a defined rectangular or square shape for its successful application. Once the rectangle was outlined, a FCA analysis was performed for every square inside the rectangle. This analysis generated FCA population probabilities specific to each square. Utilizing eqn (11), these probabilities were further transformed into entropies. The entropies obtained were then visually represented in Fig. 5, with the lowest and highest entropy values being prominently depicted.
Fig. 5 Overview of the highest and lowest population entropies for a grid of the province of Antwerp using the FCA method. |
The calculation of entropies based on a floating catchment area implies that their meaning is specific to that catchment area. Therefore, the value of the entropies provides information about how the population of a particular square relates to its immediate surrounding squares. A high entropy signifies that the population distribution is localized within that specific square. This means that the probability of finding a person from the catchment area is highest in this square, indicating a higher concentration of population in that area. On the other hand, a low entropy suggests the opposite scenario within the catchment area, where the population in the square is comparatively diluted compared to the rest of the squares. In this case, the chances of encountering a person are higher in the other squares of the catchment area.
When considering the placement of a recycling centre, it is logical to choose a square with a high entropy that is surrounded by other squares with high entropy. By doing so, the centre is strategically located in an area with a dense population that generates a significant amount of mattress waste. This facilitates the efficient collection and reprocessing of materials since the centre is situated in close proximity to the source of waste. By reducing travel distances and associated costs, concentrating the recycling efforts in a high entropy area offers logistical advantages and enhances the overall effectiveness of the recycling process.
This adjustment to the approach enables a more precise estimation of the population probabilities within the context of the specific catchment areas. However, despite the introduction of the floating catchment area, the methodology still fails to account for a fundamental aspect of localization problems: distance. Furthermore, while the floating catchment area permits a more granular estimation at a local level, it overlooks interactions and considerations on a larger scale. As a result, the current methodology remains insufficient in addressing the comprehensive requirements necessary for an effective analysis in this context.
The updated methodology, featuring definition 3.5, is displayed for the geographical province of Antwerp, Belgium, which has 69 smaller municipalities and a total area of 2.9 km2. The sectioning of an area into different squares is no longer required, since the method of quantifying flows between areas takes the distance between geographical town or city centres into consideration. The geographical centre of an area is defined as the centre of gravity of the surface, or that point on which the surface of the area would balance if it were a plane of uniform thickness.
The approach is as follows: first, population statistics for the entire country of Belgium as well as the 69 smaller cities in the province of Antwerp are gathered from the federal government database.55 The total number of mattresses introduced into the Belgian market each year is also obtained, which corresponds to 930.000 matrasses each year. Since a distinction must be made between single and double mattresses, the number of mattresses is converted into ‘laying surfaces’, using data obtained from Valumat, which is the producer responsibility organisation for mattresses in Flanders. Each year, Valumat is required to report annual data on the purchase and disposal of mattresses to the regulatory authorities.
Laying surfaces are defined as the area of a mattress suited to accommodate one person, meaning a double mattress has two laying surfaces, and a single matrass has just one.
Next, to translate the obtained population data per city/town into mattresses data, the total population of Belgium is divided by the total amount of ‘laying surfaces’ sold each year, resulting in a scaling index of 5.061 (inhabitants. year/laying surfaces). This scaling index can then be used to translate the population of each town or city into a ‘laying surface’ value, visualised in Fig. 6a. To summarize, the distribution of mattresses is derived from population distribution data, and the population number is tailored to ‘laying surfaces' using Valumat data.
Once the amount of ‘laying surfaces’ is obtained, the flow between all the cities Flowij can be calculated using eqn (2) from Table 1. Instead of population numbers, ‘laying surfaces’ numbers are used to make the translation towards mattresses. Lastly, the fractional flow Fract. Flowij is calculated by dividing each flow with the total flow of the province (i.e. the sum of all the flows). The result of this translation from population to ‘laying surfaces’ per city/small town in the province of Antwerp is illustrated in Fig. 3b (as fractional flows) and Fig. 6 left in the ESI† (as normal flows). The flow probabilities PFlowij and Pi are then calculated (according to eqn (7) and (8) in Table 1), and the resulting entropy can be obtained by implementing the values in eqn (14). The overall result of these calculations, namely the entropy flow values per city/town is plotted in Fig. 3c, and Fig. 6 right in the ESI.† A detailed excel-file with the values of the Flowij and the fractional values of the flow Pij is provided with the ESI.†
In terms of collection placement, the following guidelines are suggested: on the one hand, areas with high flows are preferred, since this entails, that people, resources or even products, will move naturally towards that area. This makes these areas interesting locations for central collection centres, since this is already a naturally occurring mobility hub. An essential aspect to consider is that for this case study we assume equal thermodynamic efficiency among each transportation type, which is evidently not the case in reality. On the other hand, maximum entropy is also desired, so that the flow from different locations is equal in size. A maximum entropy means that the flows from other municipalities into the studied municipality are equal in size, meaning no preferential flow is present. We prefer this over a preferential flow (which would have a low entropy), because this would mean a municipality will have a higher potential interaction with a preferred other municipality, and therefore will have lower flows to the surrounding municipalities. For the case of a central processing location, a uniform flow pattern is preferred, since this entails the highest flows from all surrounding areas, and not just one. In theory, this point of maximum flow and entropy is the ideal point to place collection or sorting plants. However, in reality, the values will stray from this ideal point, and we will opt for the area closest to this (meaning highest flow and entropy).
To illustrate this with data from the case study, a graph (see Fig. 6d) is created with the entropy on one axis and the total flow of each region on the other. The area closest to the ideal point (the upper right hand corner), is chosen as the optimal location (in this case the city of Antwerp). The result of this analysis is shown in Fig. 6d.
To finalize this section, it is crucial to reiterate the primary objective of the study, which is to investigate and determine the optimal approach for determining the most suitable location for a recycling centre or waste collection facility. Through a process of experimentation and evaluation, it became evident that definition 3.5 best aligns with the study's objectives. This particular definition was chosen as it takes into account both neighbouring interactions and direct distances, two critical properties that were absent in the previously considered definitions. By adopting definition 3.5, the study aims to provide a comprehensive analysis that encompasses all relevant factors for effective decision-making regarding the placement of recycling centres or waste collection facilities. However, it is crucial to acknowledge that our assertion does not imply that the entropy of flows is universally regarded as the superior metric for localizing collection centres. Rather, it reflects our subjective opinion that, based on the study's specific objectives and considerations, the entropy of flows aligns most effectively with our research goals. It is important to recognize that, while the entropy of flows provides valuable insights into the dynamics and interactions within a given area, different metrics may be more appropriate or relevant depending on the specific context, and alternative viewpoints and approaches should be considered in the overall decision-making process. To provide a comprehensive overview of the various definitions and their capacity to address the research question, a summarized comparison has been compiled and presented in Table 2.
Distribution type | Relevant research questions for resource management | Required data | Sufficiently useful to support decisions around the localisation of a value-added activity around resources? |
---|---|---|---|
Area (m2) | How is area spread throughout our society? | Size and location of sub-areas | No: area solely refers to areas and areas as a proxy for quantities. It is only beneficial if we have limited data and if area is proportionate to quantities. Nevertheless, this definition only applies to the distribution of amounts and not to distances or flows |
Population number | How is population spread throughout our society? | Population and location of sub-areas | No: more realistic assessment when looking at resource quantities than area, but again, this definition only applies to the distribution of amounts and not to distances or flows |
Population density | How is population density spread throughout our society? | Size and population of sub-areas | No: distribution is hard to interpret, and does not provide clear enough information to serve as a proxy |
Population density levels | How is are population density levels spread throughout our society? | Location and either population or sub-area amounts | No: not very useful since values rely heavily on how the levels are defined. In doing so, you can inflate or diminish entropies depending on how you define your levels. From this point of view, it doesn't convey much, and it's not directly useful |
Relative location and “flows” | How are flows between areas spread throughout our society? | Population, location and distance of sub-areas/cities | Yes: this definition takes several crucial elements into account, namely the flow of a resource, and the resource flow's entropy. This definition allows for the trade-off relationship between high entropy and high flow to be studied |
Percentage of land development | How is land development spread throughout our society? | Complete map of the area and GIS system | Yes: GIS data is accessible for a wide range of resources, and for the location of various organizations, industrial sectors, and so on. This availability allows for the research questions such as the localisation of processing plants. Thus, both in terms of availability of natural resources and localisation research, it is a useful definition |
The selection of a warehouse site from a set of alternatives is a multi-objective optimization and multi-criteria decision-making problem that includes both quantitative and qualitative decision variables. Most decisions about warehouse localisation are taken depending on 5 criteria: movement flexibility, average distance to suppliers, average distance to shops, stock holding capacity and unit price.60,61 Alternatively, the flow-based entropies and the flow calculations could serve as criteria (in the ideal case both maximum) for locating the optimal site location (e.g., warehouse or collection centre), hence reducing the number of objective functions from 5 to 2. As such the dimensionality and complexity of the multi-objective optimization problem and the multi-criteria decision making problem would be reduced.
By considering economies of scale when locating recycling centres, decision-makers can estimate the optimal amount of centres, as well as ensure the centre's financial viability, cost-effectiveness, and profitability, while also maximizing the volume of materials processed and contributing to the circular economy.
Several simplified approaches, which cater to different levels of complexity and data resolution, are illustrated in Fig. 7. The most straightforward method involves assigning quantities to municipalities based on the highest concentration (a). Another simplified method involves an analysis with reduced data resolution, focusing on the distribution of items relative to a central point, typically a municipality centre. In addition entropy is only calculated based on the quantity of mattresses, removing the need for distances (b). A more nuanced approach introduces a maximum collection distance of 200 km, delineating catchment areas around each municipality centre. Within each province, the catchment areas are assessed to determine the number of mattresses present. This allows for the identification of maximum entropy and flow values within each catchment area, revealing the areas where mattress concentration and distribution are most pronounced. Since a catchment area is introduced, the resolution of data is lowered and less data and analysis is required (c). An alternative approach would be to reduce the geographical area to a square grid, thus reducing the complexity of distance data, since all distances are easily calculated mathematically (d). Another, even simpler approach considers only the 8 neighbouring municipalities of this grid and the centre municipality, narrowing the approach down to only 9 municipalities. This approach inherently acknowledges the disadvantage of municipalities without neighbours (e). A last approach could be to tackle the analysis with a multiresolution approach. One could first analyse the area on a higher aggregation level and reduce the resolution of the approach as you go to lower aggregation levels. This way, the required amount of data and data processing effort are reduced (f).
For the specific case of localising recycling centres, our research points to flow-based statistical entropy as the most promising method. The case study focusing on Antwerp demonstrated that the ideal location for such centres is where both entropy and flow are maximised. However, the impracticality of localising a recycling centre in Antwerp underscores the importance of considering the geographical variability of a region in interpreting results and making informed decisions.
We recognise geospatial entropy as a valuable proxy for estimating the energy requirements associated with material recollection and transport. Beyond this application, our method can answer additional research questions, such as the determination of the most effective resource collection methods and the optimal localisation of waste processing plants. The flexibility of the employed methodology makes it an elegant tool for addressing a spectrum of research questions within the broader field of resource management.
X i /i | statistical event, for this specific paper a city or town |
H | Statistical entropy (or Shannon entropy) |
N | Amount of statistical events |
p i | Probability of event i |
Areai | Area of event i |
Popi | Population of event i |
m i | Number of occurrences for each level of density i |
p pop i | Population number for event i |
d ij | Distance between events i and j (in km) |
Fract. Flowij | Flow between events i and j, normalised to sum = 1 for all flows |
DENi | Density of event i |
Q | Number of squares |
D i | Level of density |
CE | Circular economy |
EU | European Union |
UN | United Nations |
TEA | Techno-economic analysis |
LCA | Life cycle assessment |
SEA | Statistical entropy analysis |
MSEA | Multilevel statistical entropy analysis |
FCA | Floating catchment area |
GIS | Geograhphical information system |
PWSS | Public warehouses selection support |
QFD | Quality function deployment |
AHP | Analytic hierarchy process |
ABM | Agent based model |
EoS | Economies of scale |
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3va00166k |
This journal is © The Royal Society of Chemistry 2024 |