The Comparison of Neural Networks’ Structures for Forecasting

Document Type : Technical Note

Authors

1 Al-Qualsadi Research and Development Team, National Higher School for Computer Science and System analysis (ENSIAS), Mohammed V University, Rabat, Morocco

2 Laboratory LSE2I, National School of Applied Sciences (ENSAO), Mohammed first University, Oujda, Morocco

Abstract

This paper considers the application of neural networks to demand forecasting in a simple supply chain composed of a single retailer and his supplier with a game theoretic approach. This work analyses the problem from the supplier’s point of view and the employed dataset in our experimentation is provided from a recognized supermarket in Morocco. Various attempts were made in order to optimize the total network error and the findings indicate that different neural net structures can be used to forecast demand such as Adaline, Multi-Layer Perceptron (MLP), or Radial Basis Function (RBF) Network. However, the most adequate one with optimal error is the MLP architecture.

Keywords

Main Subjects


1. Introduction

Since the relationships between supply chain (SC) members have an important influence on the whole supply chain performance, “the research focus in supply chain management, recently, has shifted from inter-functional to inter-organizational integration and coordination” (Jain & Dubey, 2005). Many researchers, such as (Cachon, 2003) or (Gomez-Padilla, 2005), studied SC coordination to manage interdependencies among its members  for approaching a decentralized system in order to work together and act in way that guaranties the alignment of the plans and objectives defined mutually as a centralized system. Different coordination mechanisms are presented in the literature (Arshinder, Kanda, & Deshmukh, 2008). This paper focuses on coordination among a two-echelon supply chain of a single retailer and his supplier by sharing demand information, and more precisely, on how to predict demand level using different neural network models in case no information is provided.

This paper is structured as follows: The first section is dedicated to the literature review and the definitions of different key concepts in our research field. Then, in the second section, the problem statement and the proposed solution are presented. Finally, the third part describes and analyses the numerical experimentations.

1.1.  Definition of basic concepts

1.1.1. Information sharing in supply chains: Game theory approach

“Game theory is a bag of analytical tools designed to help us understand the phenomena that we observe when decision-makers interact” (Osborne & Rubinstein, 1994). This theory was first introduced in the 40’s (Morgenstern & Neumann, 1944), it is concerned with the analysis of situations of conflict as well as of cooperation where two or more decision makers, called players affect each other’s payoffs. In order to apply game theory, two assumptions (Colman, 1998) must be made: (a) rationality of players in the sense of always seeking for the maximum of their own expected utilities, and (b) common knowledge where each player take into account the information he has on the other player’s strategies (Schotter, 1996).

Games can be categorized into two sets: non-cooperative games and cooperative games. In the first category, communication or collaboration among players are not allowed in any way. However, in the second category, players are allowed to adopt negotiations and agreements in solving out the problem.

1.1.2. Information sharing as a cooperation tool

In this case study, sharing information as a cooperation mechanism means that retailer and supplier act in a cooperative manner and exchange demand information and action plans in order to align their forecasts for capacity and long-term planning. However, by revealing his private information, retailer could lose the benefit of negotiating future prices. Therefore, sharing information depends on whether it increases the supply chain’s performance (Ha, Tong, & Zhang, 2010).

Information sharing in supply chains has many obstacles (Lee & Whang, 2000) such as the confidentiality, rapidity, trustworthiness, and most importantly accuracy of the provided information. Thus, and for the sake of all supply chain partners, trust and cooperation are two of the most important ingredients.

We are studying a basic SC of a single retailer (R) having an imperfect and private information about demand and his supplier (S) and focusing on the supplier’s point of view. In a game theoretic context, this can be interpreted as a game of two players (retailer, supplier) having two strategies (share, not share) for (R) and (trust, distrust) for (S). Our study focuses on the supplier’s point of view. More details about the game’s structure as well as the payoff matrix are given in the following section. The implementation of the neural networks of forecasting for finding the equilibrium of the game is not included in this work. In this paper, we are more interested in modeling our game and finding the best neural networks structure for demand forecasting when no information is shared by the retailer.

 2.  Artificial Neural networks for forecasting

2.1. Related studies

Since the artificial neural networks (ANNs) are able to learn, several research projects are found to anticipate some costly calculations. For example, in sales, it is possible to predict the upcoming requests that provide a more accurate vision of production or stock level (Slimani, ElFarissi, & Achchab, 2015): (Borade and Bansod,2011) made a comparison of neural network forecasts on the basis of costs and profits in supply chain using three multi-criteria decision-making tools for evaluation. The results indicated that this method helps companies save money and improve their profitability by reducing supply chain costs.

(Doganis, Alexandridis, & Panagiotis, 2006) Present a complete framework that can be used for developing nonlinear time series sales forecasting models and apply a combinatory technique of two artificial intelligence methods, namely the RBF (Radial Basis Function) neural net architecture and a specifically designed genetic algorithm (GA) for forecasting sales data of fresh milk. Other researchers applied ANNs with fuzzy systems in order to predict demand in supply chains, the results show that adding fuzzy logic to neural networks learning ability gives more accurate predictions. (e.g. (Chang & Wang, 2006), (Gumus, Guneri, & Keles, 2009), (Wang, Chen, Wang, & Lin, 2006)).

Since ANN algorithms have the  potential of accommodating the non-linear data to capture the subtle functional relationships among empirical data, (Kumar, Herbert, & Rao, 2014) use ANN for demand forecasting so that demand and supply are in balance by fulfilling customer’s demand and reducing excess inventories. They also present a comparative analysis of different training methods of neural network using the results obtained from the demand-forecasting model.

(Efendigil, Önüt, & Kahraman, 2009) Present a comparative analysis between ANN’s models and adaptive network-based fuzzy inference system (ANFIS) to forecast demand of future period with incomplete information. Using real-world data for 96 months from an industrial company active in durable consumer goods Turkey of three retailers. Through their experimentations, the authors demonstrate that ANFIS method gives closer forecasted values than the ANNs technique.

ANN is utilized in this work to predict future demand based on previous information. This case scenario is adopted by the supplier when no information is shared or when the information shared by the retailer is not trustful.

Different structures of neural networks are used in the literature, such as Adaline, MLP and RBF. But the most commonly used structure of neural nets is the MLP model. This architecture shows the best results for demand forecasting.

2.2. Artificial Neural network

2.2.1. What is ANN?

A neural network is composed of at least two components, i.e. neurons and the connections between them known as links. The functioning of biological human neurons (McClelland, Rumelhart, & Hinton, 1986) (Luger & Stubblefield, 1993) that receives a signal and then emits an output signal after his activation inspires neural networks (see Figure 1). In the biological neuron, dendrites transport the electrical signals sent from other neurons to the cell body and the output electrical signals are directed through the axon to other neurons. Analogous to this biological functioning, artificial neuron is activated after receiving signals (inputs) from other neurons. These signals are summed and delivered through an activation function, then the result value (output) is sent to the other neuron.

Figure 1. Graphic comparison between biological neuron and artificial neuron

With the capacity of learning and responding to issues such as the problems of analysis, diagnosis or prediction, the human neural network contains approximately  neurons and trillions of connections between them. In computer science terms, this is translated into ANN (Rennard, 2006) which contains layers of interconnected neurons and has the ability of learning, calculating, or forecasting. In general, ANNs are capable of responding to problems relating to what it has been learned before, as they are able to program themselves. This is what we call the supervised learning (El Farissi, Azizi, & Moussaoui, 2012).

ANN consists of several interconnected layers where each layer contains more neurons. There are three types of layers: The input layer (or predictors) that receives an external input data, the output layer used to return the result(s), and the hidden layer containing neurons helping to obtain a better result. At the beginning of the network’s construction, the number of neurons in the input and output layers are fixed, but there is no rule for determining the number of hidden layers or their neuron’s number. The network structure depends on different parameters such as the number of hidden layers, the existence/or non-existence of bias neuron, and the chosen learning rate. Therefore, the train and test method is adopted to choose the best topology that returns better results.

Unlike the traditional forecasting methods, neural networks allow complex nonlinear relationship between inputs and outputs. (Yu, Choi, & Hui, 2011) Demonstrate that ANNs are more efficient and effective than the other traditional forecasting techniques.

A neural network is composed by at least two components, neurons and the connections between them known as links. This concept is inspired from the human neuron that receives a signal then emits an output signal after his activation. With the ability of learning, calculating and forecasting, ANN is capable of solving problems based on the learning skills, also called supervised learning (ElFarissi, 2013). 

ANN is utilized in this work to predict future demand based on the previous information. This case scenario is adopted by the supplier when no information is shared or when the information shared by the retailer is not trustful.

Different structures of neural networks are used in the literature, such as Adaline, MLP, and RBF).But the most commonly used structure of neural nets is the MLP model. In our previous work, this architecture shows the best results for demand forecasting.

3. Problem statement and proposed solution

3.1.  Uncertainty and forecasting

Accuracy, ease of use, and flexibility are the ideal blend for demand planning. However, the ultimate goal of any successful SC businesses is to find the balance between satisfying consumer’s demand while minimizing their stock and avoiding excess inventory, due to the discerning and the unpredictable behavior of customers, it is a challenge to predict and project consumer’s demand in the future. That is why forecasting is one of the commonly used methods to decrease demand uncertainty in supply chains.

3.2.  Neural nets for demand forecasting

Quantifying information with accuracy is a hard, but not an impossible task to do. As a proposed solution, this work uses the artificial intelligence of neural networks to reduce demand uncertainty between a retailer and a supplier. In the realm of game theory with a cooperative approach, this can be projected as a game of two players, i.e. a single retailer and a single supplier

3.3.  Introducing the game with neural nets

3.3.1.   Game’s description

As mentioned previously, we are studying a simple form game of two players, aiming to provide a decision making tool for the supplier. We assume that each player has two strategies as explained in table 1 below:

Table 1.  Player’s strategies 

3.3.2.   Players’ expected profits

  • Retailer’s expected profit

Retailer’s profit = Sales revenue – Purchasing cost – Shortage cost – Production cost – Information sharing cost

Where:      

Purchasing cost 

Shortage cost

Production cost 

Information sharing cost 

Where

  • Supplier’s expected profit

Supplier’s profit = Sales revenue – Production cost – Shortage cost – Cost related to unsold product

Production cost 

Shortage cost 

Cost related to unsold products 

3.3.3.         Notations

The following notations are utilized in our formulation model:

: Retailer’s profit (difference between incomes and spending)

: Supplier’s profit

D: Market demand of the product during the current period

T: Length of each period

Q: Ordered quantity to the supplier per period T

r: Unit price with which the retailer buys the product from the supplier

p: Unit selling price of the product on the market by the retailer; (where r < p)

: Unit production cost of the retailer

: Unit breakdown cost of the retailer

: Unit production cost of the supplier

 Unit breakdown cost of the supplier

N: Supplier’s replenishment level, it is the quantity in stock at the end of each period before demand arrives

 Cost of unsold unit paid by the supplier

 Cost of unsold unit paid by the retailer

3.3.4.         Payoff matrix

All possible combinations of the players’ actions and their possible outcomes are presented in a matrix called the decision table or the payoff matrix. Analyzing this matrix in order to determine the optimal strategies for all players is the goal of game theory. The payoff matrix of our game is defined as follows:

In a typical form, it is a two-by-two matrix (Slimani-a & Achchab, 2014) (Slimani-b & Achchab, 2013) with each square divided in half, with one half for each one of the two players involved. Having:

Table 2. Player’s strategies 

4.  Numerical examples

4.1.  Methodology

Without accurate business information and, more precisely, demand information, all supply chain resources are useless. Since planning production, inventory, distribution policy, and even marketing would not be an easy task for managers. Therefore, the accuracy of demand forecasts in a supply chain management is an important key to competitiveness.

In this paper, various ANN models were presented and utilized to predict future demand of customer’s product based on previous information. The training and validating data were provided from a recognized supermarket in Morocco. In our experimentation, we have tried different neural networks structures like Adaline, NoProp, Perceptron, MLP and RBF aiming to compare their results and come up with the best model with the optimal error. To predict the demand of the fourth week of a month, the networks are powered by real values of the first three Mondays (d, d+7 and d+14), for example, then the value of the fourth Monday (d+21) is forecasted. Forecasts of the other days of the week are obtained by following the same method.

4.2.  Results

The network is trained using real data from a supermarket in Morocco. The training dataset contains 77 recordings and the test dataset contains 60 recordings. In our experimentation, networks are powered by three inputs and return one output. The following table represents a preview of these data:

Table 3.  Preview of preprocessed inputs and outputs

Initially, we present the Mean Squared Error (MSE) of different ANN architectures obtained during the training and the testing phases.

 Adaline

-          Training

Figure 2.  MSE training graph of Adaline

-          Test

Total MSE = 0.060704

Table 4.  Preview of testing results: Adaline

 N.B:

The number of results obtained with an absolute difference ≥ 300 is 36

 NoProp

-          Training

Figure 3.  MSE training graph of NoProp

-          Test

Total MSE = 0.271268

Table 5.  Preview of testing results: NoProp 

N.B:

The number of results obtained with an absolute difference ≥ 300 is 56

  • Radial Basis Function (RBF)

-          Training 

Figure 4.  MSE training graph of RBF

-          Test

Total MSE = 0.209663

Table 6.  Preview of testing results: RBF 

N.B:

The number of results obtained with an absolute difference ≥ 300 is 48

  • Perceptron

-          Training 

Figure 5.  MSE training graph of Perceptron

-          Test

Total MSE = 0.191204

Table 7.  Preview of testing results: Perceptron

N.B:

The number of results obtained with an absolute difference ≥ 300 is 51

  • Multi Layer Perceptron

-          Training 

Figure 6.  MSE training graph of MLP

-          Test

Total MSE = 0.098359 

Table 8.  Preview of testing results: MLP 

N.B:

The number of results obtained with an absolute difference ≥ 300 is 26

5.  Analysis

We can easily conclude from the obtained results that even if the structure Adaline has the least MSE in the testing phase, the best neural net structure is the MLP. This is due to the difference between the obtained output and the desired one. In fact, with Adaline, the number of results with difference ≥ 300 is 36. However, with the MLP structure we have got only 26 recordings.

6.   Conclusion

In this paper, various neural network architectures are implemented to test their ability to forecast demand of a supermarket in Morocco. We concluded that the MLP structure is the most adequate one in this context. For future studies, we propose to add a preparative phase before the forecasting phase using another technique such as neural networks with classification capability or fuzzy logic to guarantee the consistency and the homogeneity of the input data. Another perspective of this study consists of implementing the obtained results in the payoff matrix in order to find the equilibrium of the game.

Arshinder, K., Kanda, A. and Deshmukh, S. (2008). A review on suply chain coordination: coordination mechanisms managing uncertainty and research directions. International handbooks on information systems, Vol. 115(2), 316-335.
Arshinder, K., Kanda, A. and Deshmukh, S. (2011). A review on suply chain coordination: coordination mechanisms managing uncertainty and research directions. Berlin, Heidelberg: Springer.
Cachon, G. P. (2003). Supply Chain Coordination with Contracts. In Handbooks in Operations Research and Management Science, Supply chain management, pp. 1-126.
Chang, P.-C. and Wang, Y.-W. (2006). Fuzzy Delphi and back-propagation model for sales forecasting in PCB industry. Expert Systems with Applications, Vol. 30, pp. 715-726.
Colman, A. M. (1998). Rationality assumptions of game theory and the backward induction paradox. In Rational Models of Cognition. University of Leicester: Oxford University Press.
Doganis, P., Alexandridis, A. and Panagiotis, P. (2006). Time series sales forecasting for short shelf-life food products based on artificial neural networks and evolutionary computing. Journal of Food Engineering, Vol. 75, pp. 196-204.
Efendigil, T., Önüt, S. and Kahraman, C. (2009). A decision support system for demand forecasting with artificial neural networks and neuro-fuzzy models: A comparative analysis. Expert Systems with Applications, Vol. 36, pp. 6697–6707.
El Farissi, I., Azizi, M. and Moussaoui, M. (2012). Detection of smart card attacks using neural networks. International Conference on Multimedia Computing and Systems (ICMCS), pp. 949-954. (doi:10.1109/ICMCS.2012.6320286)
Gumus, A. T., Guneri, A. F. and Keles, S. (2009). Supply chain network design using an integrated neuro-fuzzy and MILP approach: A comparative design study. Expert Systems with Applications, Vol. 36, pp. 12570–12577.
Ha, A., Tong, S. and Zhang, H. (2010). Sharing Imperfect Demand Information in Competing Supply Chains with Production Diseconomies. Management science, Vol. 57(3), pp. 566-581.
Jain, K. and Dubey, A. (2005). Supply chain coordination: A governance perspective. Supply Chain Forum, An International Journal, Vol. 6 (2), pp. 50-57.
Kumar, P., Herbert, M. and Rao, S. (2014). Demand forecasting Using Artificial Neural Network Based on Different Learning Methods: Comparative Analysis. International journal for research in applied science and engineering technology. Vol. 2(4), pp. 364-374.
Lee, H. and Whang, S. (2000). Information Sharing in Supply Chains. International Journal of Manufacturing Technology and Management, Vol. 1(1), pp. 1-16.
Luger, G. and Stubblefield, W. (1993). Artificial Intelligence: Structures and Strategies for Complex Problem Solving. Redwood City, California: Benjamin/Cumming Publishing.
McClelland, J., Rumelhart, D. and Hinton, G. (1986). The appeal of parallel distributed processing. Parallel Distributed Processing: Explorations in the Microstructure of Cognition - Foundations, Cambridge: MIT Press.
Osborne, M. J. and Rubinstein, A. (1994). A Course in Game Theory. London, England: MIT Press.
Rennard, J.-P. (2006). Réseaux neuronaux: Une introduction accompagnée d'un modèle Java. (Vuibert, Ed.) Paris, France.
Schotter, A. (1996). Microéconomie: une approche contemporaine Vuibert. Vuibert.
Slimani, I., El Farissi, I. and Achchab, S. (2016). Coordination by Sharing Demand Forecasts in a Supply Chain Using Game Theoretic Approach. 4th IEEE International Colloquium on Information Science and Technology (CIST), pp. 122-127. doi:10.1109/CIST.2016.7805028
Slimani, I., ElFarissi, I. and Achchab, S. (2015). Application of game theory and neural network to study the behavioral probabilities in supply chain. Journal of Theoretical and Applied Information Technology, Vol. 82(3), pp. 411-416.
Slimani-a, I. and Achchab, S. (2014). Game theory to control logistic costs in a two-echelon supply chain. (IEEE, Ed.) International Conference on Logistics and Operations Management (GOL), 168 - 170. doi:10.1109/GOL.2014.6887435
Slimani-b, I. and Achchab, S. (2013). Sharing demand forecasts in a basic supply chain using game theory. (IEEE, Ed.) International Conference on Modeling, Simulation and Applied Optimization (ICMSAO),pp. 1-4. doi:10.1109/ICMSAO.2013.6552667.
Wang, C. E., J C Chen2, H.-M. W., Wang, K.-J. and Lin, Y. S. (2006). Supply Chain Inventory Strategies Using Fuzzy Neural Network. Joint Conference on Information Sciences (JCIS),pp. 1-4. doi:10.2991/jcis.2006.285.
Wang, D.-p., Shang, Q.-y. and Li, X.-y. (2017). Study on the coordination strategy of supply chain considering the uncertainty demand of product. Chinese Control and Decision Conference (CCDC), pp. 5718-5723. doi:10.1109/CCDC.2017.7978186
Yu, Y., Choi, T.-M. and Hui, C.-L. (2011). An intelligent fast sales forecasing model for fashion products. Experts systems with application, Vol. 38, pp. 7373-7379. (doi:10.1016/j.eswa.2010.12.089)