<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE article  PUBLIC "-//NLM//DTD Journal Publishing DTD v3.0 20080202//EN" "http://dtd.nlm.nih.gov/publishing/3.0/journalpublishing3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" dtd-version="3.0" xml:lang="en" article-type="research article"><front><journal-meta><journal-id journal-id-type="publisher-id">OALibJ</journal-id><journal-title-group><journal-title>Open Access Library Journal</journal-title></journal-title-group><issn pub-type="epub">2333-9705</issn><publisher><publisher-name>Scientific Research Publishing</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.4236/oalib.1104854</article-id><article-id pub-id-type="publisher-id">OALibJ-87244</article-id><article-categories><subj-group subj-group-type="heading"><subject>Articles</subject></subj-group><subj-group subj-group-type="Discipline-v2"><subject>Biomedical&amp;Life Sciences</subject><subject> Business&amp;Economics</subject><subject> Chemistry&amp;Materials Science</subject><subject> Computer Science&amp;Communications</subject><subject> Earth&amp;Environmental Sciences</subject><subject> Engineering</subject><subject> Medicine&amp;Healthcare</subject><subject> Physics&amp;Mathematics</subject><subject> Social Sciences&amp;Humanities</subject></subj-group></article-categories><title-group><article-title>
 
 
  Hybrid Delay Optimization and Workload Assignment in Mobile Edge Cloud Networks
 
</article-title></title-group><contrib-group><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Abdul</surname><given-names>Rasheed Mahesar</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Abdullah</surname><given-names>Lakhan</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Dileep</surname><given-names>Kumar Sajnani</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib><contrib contrib-type="author" xlink:type="simple"><name name-style="western"><surname>Irfan</surname><given-names>Ali Jamali</given-names></name><xref ref-type="aff" rid="aff1"><sup>1</sup></xref></contrib></contrib-group><aff id="aff1"><addr-line>Southeast University, Nanjing, China</addr-line></aff><pub-date pub-type="epub"><day>06</day><month>09</month><year>2018</year></pub-date><volume>05</volume><issue>09</issue><fpage>1</fpage><lpage>12</lpage><history><date date-type="received"><day>20,</day>	<month>August</month>	<year>2018</year></date><date date-type="rev-recd"><day>8,</day>	<month>September</month>	<year>2018</year>	</date><date date-type="accepted"><day>11,</day>	<month>September</month>	<year>2018</year></date></history><permissions><copyright-statement>&#169; Copyright  2014 by authors and Scientific Research Publishing Inc. </copyright-statement><copyright-year>2014</copyright-year><license><license-p>This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/</license-p></license></permissions><abstract><p>
 
 
  Nowadays, the usage of mobile devices is progressively increased. Until, delay sensitive applications (Augmented Reality, Online Banking and 3D Game) are required lower delay while executed in the mobile device. Mobile Cloud Computing provides a rich resource environment to the constrained-resource mobility to run above mentioned applications, but due to long distance between mobile user application and cloud server introduces hybrid delay (
  i.e.
  , network delay and process delay). To cope with the hybrid delay in mobile cloud computing for delay sensitive applications, we have proposed novel hybrid delay task assignment (HDWA) algorithm. The preliminary objective of the HDWA is to run the application on the cloud server in an efficient way that minimizes the response time of the application. Simulation re
  sults show that proposed HDWA has better performance as compared to baseline approaches.
 
</p></abstract><kwd-group><kwd>Offloading</kwd><kwd> Machine Learning</kwd><kwd> Process Delay</kwd><kwd> Transmission Delay</kwd><kwd> Delay Constraint</kwd><kwd> Augmented Reality</kwd><kwd> Video Analytics</kwd></kwd-group></article-meta></front><body><sec id="s1"><title>1. Introduction</title><p>An increasing proportion of mobile cloud application is developing day by day. Still, cloud computing provides the richer environment to run the computation workload of application for execution. Furthermore, more than thousands of running applications are by managing by rich resource cloud servers. While, each individual application is unparalleled and has different resource requirements. Nevertheless, due to long WAN far away cloud services from UE devices, it incurred hybrid latency (i.e., transmission and process delay). The central requirements of latency bound application must meet lower response time. The new paradigm mobile edge computing (MEC) has been proposed by European Telecommunications Standards Institute [<xref ref-type="bibr" rid="scirp.87244-ref1">1</xref>] . Moreover, the purpose behind MEC is to enable the remote cloud service proximity closer to the mobile network and improve the application performance with lower response time [<xref ref-type="bibr" rid="scirp.87244-ref2">2</xref>] . The MEC architecture includes fog computing, cloudlet server and edge network bring cloud resources such as computing and memory capabilities at radio access network layer with lower network delay.</p><p>In this paper, we are studying the response time minimization problem of latency bound application, and task scheduling over hybrid computing server (cloudlet and remote cloud) infrastructure simultaneously [<xref ref-type="bibr" rid="scirp.87244-ref3">3</xref>] . Whereas, the response time is the amalgam of hybrid latency (i.e., transmission and process delay). Furthermore, we divided the response time problem into three sub problems; first, application task scheduling; it is challenging to partition the application tasks dynamically and allocate the resources such as procussing and memory for local execution or bandwidth for offloading computational tasks to the local cloudlet server for execution; second, optimal network route selection; every smart applications including vehicle traffic signal, security cameras and video analytics generated stream of data at network edge; it incurred the heavy burden on network for processing the data exceeding its limit. It is challenging to optimize the route selection for computational task offloading to the local cloudlet server with minimum transmission delay; third, resource allocation for offloaded task; while cloudlet is a collection of tiny data centers, if requested workload exceeds its limits it incurred by hybrid latency (queue and process delay). It is challenging to allocate the resource optimally at a cloudlet server with minimum hybrid latency.</p><p>・ Proposed the dynamic application task scheduling framework, the ultimate goal of proposed framework which application task components either task execution locally or remaining part offloading to the cloudlet server, in order to minimize average execution time in this paper we have following contribution in this paper.</p><p>・ Proposed the machine learning (reinforcement learning) based algorithm to choose the efficient, optimize network route with lower transmission delay for computational task offloading.</p><p>・ Proposed the HDWAH heuristic which always allocates the optimal cloudlet to offloaded task with minimize response time including hybrid latency.</p><p>The rest of the paper is organized as follows. Section 2 elaborates related work and Section 3 explains the problem description and formalizes the problem under study. A heuristic is proposed for the considered problem in Section 4 which describes the propose algorithm and sequences. Section 5 evaluates the simulation part. Section 6 is about conclusions.</p></sec><sec id="s2"><title>2. Related Work</title><p>Moreover, offloading cost is the important mechanism nowadays for mobile cloud application. Existing literature research has been proposed their frameworks to improve the offloaded cost at the mobile run time environment.</p><p>In [<xref ref-type="bibr" rid="scirp.87244-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref5">5</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref6">6</xref>] Mobile Assistance User Interface (MAUI) proposed the framework in which the objective was to minimize the device energy during computation offloading. MAUI framework is consisted two run time environments (such as mobile run time and cloud run time) to support mobile cloud application. MAUI has considered only mobile end run time performance it did not consider cloud end offloading cost. CloneCloud run time environment framework for computational offloading is proposed in [<xref ref-type="bibr" rid="scirp.87244-ref7">7</xref>] . The objective is to minimize the device energy consumption. It is highly dynamic technique for computational offloading in which mobile application services wrapped into a virtual machine. Application run time manager captured the full image of the application and offload to the cloud server for execution. However, mobile application and cloud server both have a virtual environment for the offloading and execution [<xref ref-type="bibr" rid="scirp.87244-ref8">8</xref>] . CloneCloud run time focused on mobile device execution time performance and did not consider the cloud server end. Focused on mobile device execution time performance and did not consider the cloud server end. Whereas, [<xref ref-type="bibr" rid="scirp.87244-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref11">11</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref12">12</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref13">13</xref>] and [<xref ref-type="bibr" rid="scirp.87244-ref14">14</xref>] proposed their based partial offloading whether compute intensive tasks are offloaded to the remote cloud based on network condition.</p></sec><sec id="s3"><title>3. System Model</title><p>In this composition, the scenario is restricted by two cloudlets as shown in <xref ref-type="fig" rid="fig1">Figure 1</xref>. Whereas, different kinds of application are deployed with specific application services (i.e., augmented reality, virtual reality, and real time video analytics). u ∈ U Set of mobile users, c k ∈ C K is cloudlet servers, ξ k is the cache capacity of cloudlet server. All specified applications are wrapped into associated virtual machines (VMs). Local cloudlet are located beside mobile user base station, it is proximity closer to mobile users (UEs). The unit delay between UE and base stations is negligible. Two phases of unit delay in this account are calculated such as ∈ U C K and ∈ C K R C . Furthermore, the response time minimization problem is the combination transmission delay and process delay is describing into different parts.</p><sec id="s3_1"><title>3.1. Task Characterization</title><p>A set a cloudlet server C K = { c k 1 , c k 2 , ⋯ , c k n } whereas, c k ∈ C K and individual cloudlet has similar service rate and capacity C<sub>P</sub>. A cloudlet is the collection of homogeneous virtual machines which are deployed at cloudlet data centers. Resources (i.e., memory, CPU processing, storage and bandwidth) are similar in all set of cloudlets. If the requested user works excessive cloudlet capacity, then some tasks forwarded to the remote cloud for further execution.</p></sec><sec id="s3_2"><title>3.2. Latency Analysis</title><p>The response time of application is calculated as input from user mobility and gets back their result; it is the round trip time. The hybrid latency (transmission delay and process delay) has whipped influence on task offloading. The term latency defines many kinds of delay such transmission delay, process delay (queue and execution delay) and propagation delay between inter-servers [<xref ref-type="bibr" rid="scirp.87244-ref15">15</xref>] .</p></sec><sec id="s3_3"><title>3.4. Transmission Delay</title><p>Transmission delay has thumping in influence on mobile data offloading, network congestions occurred due to many factors such as noise, interference and network traffic [<xref ref-type="bibr" rid="scirp.87244-ref16">16</xref>] . Most of the time preceding elements is not in our control. The dynamic of network traffic is destructive for data offloading, and take in the offloading benefits less efficient due to high transmission delay. In this process integer mixed programming language is used to identify the decision variable, for instance, The decision variable one shows the mobile user offloaded the computation tasks at local cloudlet with lower transmission delay otherwise zero. X = { x u c k : c k ∈ C K , u ∈ U } with x u c k = { 0 , 1 } . Furthermore, X is the decision variable and identify either binary variable x u c k is accessed associated cloudlet ck for task execution or not and expressed as follows</p><p>X u c k = { 1 ,   if   X u c k = 1 0 ,   otherwise (1)</p><p>To end with the average transmission delay is followed:</p><p>∑ u ∈ U ∑ c k ∈ C K ∈ U C K + ∈ C K R C (2)</p></sec><sec id="s3_4"><title>3.5. Process and Queuing Delay</title><p>The system offloading takes randomly generated input with different arrival rates for I/O System, after being processed I/O System, the result will dispatch as departure [<xref ref-type="bibr" rid="scirp.87244-ref17">17</xref>] . The vector Y = { x u c k : c k ∈ C K , u ∈ U } process delay and queue has signification important in mobile cloud application. The user application generated the workload randomly and followed by Poisson Process as shown in <xref ref-type="fig" rid="fig2">Figure 2</xref>. The arrival rate is represented by λ i and before scheduling the appropriate resources to the requested, we are prioritizing the all workloads according to the given sequences (i.e., earliest deadline first, short workload first, least slack time). Furthermore, workload scheduler schedule them on the optimal cloud server k , the cloud service rate is represented by the exponential distribution μ k . All the cloud servers are geographically distributed.</p><p>y u c k = ∑ 0 1 (3)</p><p>By using queuing model [<xref ref-type="bibr" rid="scirp.87244-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref18">18</xref>] cloudlet server has two important elements; Request and arrival rate of single user make single request, and the rate at which requests are generated at λ u arrival rates. Second, service rate and server, this is part of a cloudlet server system which has service rate u c k . To evaluate the task queue wait time and process time, a system adhere by the Little’s law [<xref ref-type="bibr" rid="scirp.87244-ref5">5</xref>] as follows:</p><p>C c k u = y u c k c m u λ u u c k − ∑ u = 1 U y u c k c m u λ u (4)</p><p>This equation y u c k c m u λ u computation task offload to the cloudlet server whereas, 1 u c k − ∑ u = 1 U y u c k c m u λ u is the process and queue time in the system. Furthermore, average service rate of cloudlet must meet the service the result will dispatch as departure; the vector latency bounds such as u c k − ∑ u = 1 U y u c k c m u λ u &gt; 0 . If requested workload exceeds the capacity of cloudlet C p few of the tasks migrate to remote cloud for further execution as shown below:</p><p>R C u c k = ( 1 − ∑ c k ∈ C K ) c m u λ u R C u − ( 1 − ∑ c k ∈ C K ) λ u , (5)</p><p>( 1 − ∑ c k ∈ C K ) c m u λ u workload allocation for service at remote cloud, and it must be ( 1 − ∑ c k ∈ C K ) c m u λ u &gt; 0 . However, θ u is the processing capacity of local device to execute some small tasks on the local device? UEs workload offloaded to the proximity closer cloudlet server, the unit transmission delay follows:</p><p>T U C K = ∑ c k ∈ C K ∈ U C K ( y u c k c m u + d w u ) , (6)</p><p>Local cloudlet located on the edge network and remote cloud situated at long WAN distributed network, therefore, the unit propagation delay describes as follows:</p><p>T C K R C = ∑ c k ∈ C K ∈ C K R C ( 1 − x u c k ) d w u , (7)</p><p>The final road map of application response time is the combination of transmission delay and process delay</p><p>And depict as go behind:</p><p>T = ∑ u ∈ U ( C c k u + R C u c k + T C C K + T C K R C ) . (8)</p><p>λ ( x ) = λ 0 1 x ≤ T (9)</p></sec><sec id="s3_5"><title>3.6. Problem Formulation</title><p>The response time minimization problem has decision variables and set constraints, system follow feasible solution in linear integer programming as follows:</p><p>Minimize   T = ∑ u ∈ U ( C c k u + R C u c k + T C C K + T C K R C ) (10)</p><p>s.t. ∑ u = 1 U d w u x u c k c m u ≤ C p , ∀ c k ∈ C K , ∀ u ∈ U (11)</p><p>u c k − ∑ u = 1 U y u c k λ u &gt; 0 , ∀ c k ∈ C K , ∀ u ∈ U (12)</p><p>x u c k = { 0 , 1 } , ∀ c k ∈ C K , ∀ u ∈ U (13)</p><p>∑ u ∈ U x u c k = 1 , ∀ c k ∈ C K (14)</p><p>Equations (10) (11) illustrate that the arrival rate UEs with computational workload must be less than the service rate and capacity of cloudlet as system become more and more stable. Whereas, Equations (5) (12) (13) exemplify single workload if only allocated to single cloudlet as minimizing the chances of overhead at the system is the minimization problem; it is well known as NP-hard problem.</p></sec></sec><sec id="s4"><title>4. Hybrid Delay Workload Assignment (HDWA)</title><p>To satisfy the constraints (13), (14) and (15), we have designed the efficient HDWA to solve the problem. The primary objective of HDWA, iteratively chooses the optimal network path and cloudlet, and allocates the user computational task according to the given deadline. In algorithm 1, first we initialize end user workload assignment Z to cloudlet in step-2-3, in step-5 iteratively select the optimal cloudlet, which has not been influenced by disk bound, and allocate user with that optimal one cloudlet, in order to reduce the average response time of computational tasks; in step-7-8 allocate the resources offloaded tasks at the server which has minimum lateness and completed within deadline; step-9 terminate the loop, if all requested user has been allocated with their respective cloudlet.</p><p>The time complexity of this algorithm o ( I | log n | k ) whereas I expressed the user workload, K the number of iterations in the system, till allocated all user workloads in the system.</p><disp-formula id="scirp.87244-formula1"><graphic  xlink:href="//html.scirp.org/file/87244x42.png"  xlink:type="simple"/></disp-formula><p>Algorithm 1 is iterative in nature. Whereas, in step 1 we initialize the application workload zero; in steps 2-3, in step-5 iteratively select. The optimal cloudlet, which has not been influenced by disk bound, allocate user with that optimal one cloudlet, in order to reduce the average response time of computational tasks. In steps 7-8 we allocate the resources offloaded tasks at the server which has minimum lateness and completed within deadline, step 9 terminate the loop, if all requested user has been allocated with their respective cloudlet. The time complexity of this algorithm o ( I | log n | k ) whereas I expressed the user workload, K the number of iterations in the system, till allocated all user workloads in the system. The application response time T must be less than application deadline b.</p></sec><sec id="s5"><title>5. Experiment Analysis</title><p>In the results analysis, we are comparing the performance and evaluation of proposed Latency Aware Task Assignment (HDWA) with the conventional approach [<xref ref-type="bibr" rid="scirp.87244-ref9">9</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref10">10</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref11">11</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref18">18</xref>] and [<xref ref-type="bibr" rid="scirp.87244-ref19">19</xref>] Baseline approach [<xref ref-type="bibr" rid="scirp.87244-ref1">1</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref2">2</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref4">4</xref>] [<xref ref-type="bibr" rid="scirp.87244-ref10">10</xref>] . Based in <xref ref-type="table" rid="table1">Table 1</xref> numerical simulations evaluated the performance of the proposed algorithm with conventional and base-line approach of giving parameters. We have used core i7 dual core processor, 24-GB Ram, 2000 GB for communication and computation in our experiment.</p><sec id="s5_1"><title>5.1. Simulation Environment</title><p>We have designed a simulation framework based on SimPY python API [<xref ref-type="bibr" rid="scirp.87244-ref14">14</xref>] as shown in <xref ref-type="fig" rid="fig3">Figure 3</xref>, and moving everywhere dataset user application [<xref ref-type="bibr" rid="scirp.87244-ref14">14</xref>] as shown in <xref ref-type="fig" rid="fig6">Figure 6</xref>. Online cloud server purchased for application installation, and local cloudlet install at local city on two places. The distance between two cloudlet is 2 Km, We have examined the Pokemon (A.R) and Holo (A.R) mobile applications are on our system. These applications installed by many university students and played during peak and normal hours. Our monitoring system checks the system performance in every 1 hour. In the framework we have managed two kinds of queues (network and process) simultaneously. Overall system monitors the transmission delay, process delay and server capacity handles the different arrival of users.</p><table-wrap id="table1" ><label><xref ref-type="table" rid="table1">Table 1</xref></label><caption><title> Experiment parameters</title></caption><table><tbody><thead><tr><th align="center" valign="middle" >Notation</th><th align="center" valign="middle" >Explanation</th></tr></thead><tr><td align="center" valign="middle" >λ i</td><td align="center" valign="middle" >[1; 20] = s</td></tr><tr><td align="center" valign="middle" >Mobile users (MUs)</td><td align="center" valign="middle" >[100; 1000] Numbers</td></tr><tr><td align="center" valign="middle" >ψ U i</td><td align="center" valign="middle" >[0:2; 1:5] 10<sup>2</sup></td></tr><tr><td align="center" valign="middle" >C i</td><td align="center" valign="middle" >[1024; 5120] KB</td></tr><tr><td align="center" valign="middle" >d i</td><td align="center" valign="middle" >[1024; 10; 240] KB</td></tr><tr><td align="center" valign="middle" >∈ U K</td><td align="center" valign="middle" >2<sub>m</sub>s = MB Augmented Reality</td></tr><tr><td align="center" valign="middle" >K C</td><td align="center" valign="middle" >4<sub>m</sub>s = MB Propagation Latency</td></tr><tr><td align="center" valign="middle" >μ k</td><td align="center" valign="middle" >[2; 3:5] 10<sup>4</sup></td></tr><tr><td align="center" valign="middle" >Bandwidth (B) ξ k</td><td align="center" valign="middle" >[50; 100] = mbps [<xref ref-type="bibr" rid="scirp.87244-ref2000">2000</xref>] GB</td></tr></tbody></table></table-wrap></sec><sec id="s5_2"><title>5.2. Proposed Algorithm versus Existing Approaches</title><p>Comparison of proposed algorithm HDWA and existing approaches are followed.</p><sec id="s5_2_1"><title>5.2.1. Conventional Approach</title><p>The conventional approach has a basic offloading scheme either executes of computation task locally on the device or offloaded to remote cloud for additional processing. This approach does not handle the end to end latency due to cloud services are multiple hops away from mobile device [<xref ref-type="bibr" rid="scirp.87244-ref1">1</xref>] .</p></sec><sec id="s5_2_2"><title>5.2.2. Baseline Approaches</title><p>Baseline Approaches brings the computing resources at the mobile network edge, application scheduler randomly decides either computation task execute on the mobile device or offload to the server. This approach attempts to reduce the application response time at some extent, but requested computation workload increase its capacity, it leads lateness in systems such as queue and process delay.</p></sec><sec id="s5_2_3"><title>5.2.3. Proposed Approach (HDWA)</title><p>The proposed approach randomly takes the advantage of the local task computation at the device and local cloudlet server. If the requested computation tasks exceed the limit of cloudlet dynamically it migrates some computation tasks on the remote server for further execution.</p></sec></sec><sec id="s5_3"><title>5.3. Performance Evaluation of Different Algorithms</title><p>In order to achieve the objective of proposed strategy works better as compared to existing approaches. <xref ref-type="fig" rid="fig7">Figure 7</xref> demonstrates the performance and comparison of process delay at local device and the server. We have monitored mobile users’ range 50 to 500 with computational tasks. Simulation parameters are declared in <xref ref-type="table" rid="table1">Table 1</xref> and evaluated the performance of the proposed algorithm; it shows that HDWA 50% has a lower delay as compared to conventional and baseline approach. <xref ref-type="fig" rid="fig4">Figure 4</xref> and <xref ref-type="fig" rid="fig5">Figure 5</xref> show that proposed algorithm HDWA minimizes process delay as compared to baseline approaches. <xref ref-type="fig" rid="fig6">Figure 6</xref> shows that our proposed scheme is more optimized rather than existing baseline schemes. <xref ref-type="fig" rid="fig7">Figure 7</xref> demonstrates the performance and comparison of transmission delay during peak hours and normal hours. We have tested user mobile users randomly arrival rates with λ i with computational tasks, as we mentioned the bandwidth (B) parameter in <xref ref-type="table" rid="table1">Table 1</xref> and evaluated the performance of the proposed algorithm, it shows that HDWA 60% lower transmission delay as compared to Conventional and Baseline approach during peak hours.</p><p><xref ref-type="fig" rid="fig7">Figure 7</xref> shows that HDWA works better during peak and normal hours, whereas, task deadline completion demonstrate that HDWA has a better response time related to test applications; we have examined two applications with different latency threshold (0.3 and 0.6). The average response time of application is the sum of heterogeneous latency (transmission and process delay) and local and server execution time.</p></sec></sec><sec id="s6"><title>6. Conclusion</title><p>In this study, we have examined the problem response time minimization problem with heterogeneous latency (transmission and process delay). We have focus to support the latency sensitive application with lower response time. Our proposed methods and framework has better performance and progress as compared to existing methods and frameworks. In this paper, we have studied the minimization problem, divided it into three sub problems and efficiently solved all sub problems with minimum response time. This problem is NP-complete problem. By integer linear programming we proof it, and our proposed solution always satisfies the feasible solution. In future includes the detail study of above latency aware computation offloading problem with more detail constraints such as (load-balancing latency, communication energy, transmission energy) via geographically distributed cloudlet network.</p></sec><sec id="s7"><title>Acknowledgements</title><p>The team members Abdul Rasheed Mahesar, Abdullah Lakhan, DIleep Kumar Sajnani and Irfan Ali Jamali work hard to complete this paper, and proposed a novel idea as compared to an existing offloading scheme. Hope In future we will achieve many milestones in our lives.</p></sec><sec id="s8"><title>Conflicts of Interest</title><p>The authors declare no conflicts of interest regarding the publication of this paper.</p></sec><sec id="s9"><title>Cite this paper</title><p>Mahesar, A.R., Lakhan, A., Sajnani, D.K. and Jamali, I.A. (2018) Hybrid Delay Optimization and Workload Assignment in Mobile Edge Cloud Networks. Open Access Library Journal, 5: e4854. https://doi.org/10.4236/oalib.1104854</p></sec></body><back><ref-list><title>References</title><ref id="scirp.87244-ref1"><label>1</label><mixed-citation publication-type="other" xlink:type="simple">Kumar, K. and Lu, Y.-H. (2010) Cloud Computing for Mobile Users: Can Offloading Computation Save Energy? Computer, 43, 51-56. https://doi.org/10.1109/MC.2010.98</mixed-citation></ref><ref id="scirp.87244-ref2"><label>2</label><mixed-citation publication-type="other" xlink:type="simple">Chen, X. (2015) Decentralized Computation Offloading Game for Mobile Cloud Computing. IEEE Transactions on Parallel and Distributed Systems, 26, 974-983. https://doi.org/10.1109/TPDS.2014.2316834</mixed-citation></ref><ref id="scirp.87244-ref3"><label>3</label><mixed-citation publication-type="other" xlink:type="simple">Chen, M., Yin, Z., Li, Y., Mao, S.W. and and Leung, V.C.M. (2015) EMC: Emotion-Aware Mobile Cloud Computing in 5G. IEEE Network, 29, 32-38. https://doi.org/10.1109/MNET.2015.7064900</mixed-citation></ref><ref id="scirp.87244-ref4"><label>4</label><mixed-citation publication-type="other" xlink:type="simple">Waseem, M., Lakhan, A. and Jamali, I.A. (2016) Data Security of Mobile Cloud Computing on Cloud Server. Open Access Library Journal, 3, 1-11.</mixed-citation></ref><ref id="scirp.87244-ref5"><label>5</label><mixed-citation publication-type="other" xlink:type="simple">Deng, S.G., Huang, L.T., Taheri, J. and Zomaya, A.Y. (2015) Computation Offloading for Service Workflow in Mobile Cloud Computing. IEEE Transactions on Parallel and Distributed Systems, 26, 3317-3329. https://doi.org/10.1109/TPDS.2014.2381640</mixed-citation></ref><ref id="scirp.87244-ref6"><label>6</label><mixed-citation publication-type="other" xlink:type="simple">Zhang, W.W., Wen, Y.G. and Wu, D.O. (2015) Collaborative Task Execution in Mobile Cloud Computing under a Stochastic Wireless Channel. IEEE Transactions on Wireless Communications, 14, 81-93. https://doi.org/10.1109/TWC.2014.2331051</mixed-citation></ref><ref id="scirp.87244-ref7"><label>7</label><mixed-citation publication-type="other" xlink:type="simple">Kosta, S., Andrius, A., Hui, P., Mortier, R. and Zhang, X.W. (2012) ThinkAir: Dynamic Resource Allocation and Parallel Execution in the Cloud for Mobile Code Offloading. 2012 Proceedings IEEE INFOCOM, Orlando, 25-30 March 2012, 945-953. https://doi.org/10.1109/INFCOM.2012.6195845</mixed-citation></ref><ref id="scirp.87244-ref8"><label>8</label><mixed-citation publication-type="other" xlink:type="simple">Chun, B.-G., Ihm, S., Maniatis, P., Naik, M. and Patti, A. (2011) CloneCloud: Elastic Execution between Mobile Device and Cloud. Proceedings of the Sixth Conference on Computer Systems, Salzburg, 10-13 April 2011, 301-314. https://doi.org/10.1145/1966445.1966473</mixed-citation></ref><ref id="scirp.87244-ref9"><label>9</label><mixed-citation publication-type="other" xlink:type="simple">Soomro, A.A., Lakhan, A.R. and Khan, A. (2011) The Secure Data Storage in Mobile Cloud Computing. Journal of Information and Communication Technolog, 6, 69-76.</mixed-citation></ref><ref id="scirp.87244-ref10"><label>10</label><mixed-citation publication-type="other" xlink:type="simple">Rahimi, M.R., Jian, R., Liu, C.H., Vasilakos, A.V. and Venkatasubramanian, N. (2014) Mobile Cloud Computing: A Survey, State of Art and Future Directions. Mobile Networks and Applications, 19, 133-143. https://doi.org/10.1007/s11036-013-0477-4</mixed-citation></ref><ref id="scirp.87244-ref11"><label>11</label><mixed-citation publication-type="other" xlink:type="simple">Shiraz, M. and Gani, A. (2014) A Lightweight Active Service Migration Framework for Computational Offloading in Mobile Cloud Computing. The Journal of Supercomputing, 68, 978-995. https://doi.org/10.1007/s11227-013-1076-7</mixed-citation></ref><ref id="scirp.87244-ref12"><label>12</label><mixed-citation publication-type="other" xlink:type="simple">Chen, X., Lei, J., Li, W.Z. and Fu, X.M. (2016) Efficient Multi-User Computation Offloading for Mobile-Edge Cloud Compu-ting. IEEE/ACM Transactions on Networking, 24, 2795-2808. https://doi.org/10.1109/TNET.2015.2487344</mixed-citation></ref><ref id="scirp.87244-ref13"><label>13</label><mixed-citation publication-type="other" xlink:type="simple">Nkosi, M.T. and Mekuria, F. (2010) Cloud Computing for Enhanced Mobile Health Applications. 2010 IEEE Second In-ternational Conference on Cloud Computing Technology and Science, Indianapolis, 30 November-3 December 2010, 629-633.</mixed-citation></ref><ref id="scirp.87244-ref14"><label>14</label><mixed-citation publication-type="journal" xlink:type="simple"><name name-style="western"><surname>Yao</surname><given-names> Y.-C. </given-names></name>,<etal>et al</etal>. (<year>1987</year>)<article-title>Approximating the Dis-tribution of the Maximum Likelihood Estimate of the Change-Point in a Sequence of Inde-pendent Random Variables</article-title><source> The Annals of Statistics</source><volume> 15</volume>,<fpage> 1321</fpage>-<lpage>1328</lpage>.<pub-id pub-id-type="doi"></pub-id></mixed-citation></ref><ref id="scirp.87244-ref15"><label>15</label><mixed-citation publication-type="other" xlink:type="simple">Barbarossa, S., Sardellitti, S. and Lorenzo, P.D. (2014) Communicating While Computing: Distributed Mobile Cloud Computing over 5G Heterogeneous Networks. IEEE Signal Processing Magazine, 31, 45-55. https://doi.org/10.1109/MSP.2014.2334709</mixed-citation></ref><ref id="scirp.87244-ref16"><label>16</label><mixed-citation publication-type="other" xlink:type="simple">Sardellitti, S., Scutari, G. and Barbarossa, S. (2015) Joint Optimization of Radio and Computational Re-sources for Multicell Mobile-Edge Computing. IEEE Transactions on Signal and Infor-mation Processing over Networks, 1, 89-103.</mixed-citation></ref><ref id="scirp.87244-ref17"><label>17</label><mixed-citation publication-type="other" xlink:type="simple">Huang, D.J., Xing, T.Y. and Wu, H.J. (2013) Mobile Cloud Computing Service Models: A User-Centric Approach. IEEE Network, 27, 6-11. https://doi.org/10.1109/MNET.2013.6616109</mixed-citation></ref><ref id="scirp.87244-ref18"><label>18</label><mixed-citation publication-type="other" xlink:type="simple">Kovachev, D., Cao, Y.W. and Klamma, R. (2011) Mobile Cloud Computing: A Comparison of Application Models. arXiv:1107.4940v1.</mixed-citation></ref><ref id="scirp.87244-ref19"><label>19</label><mixed-citation publication-type="other" xlink:type="simple">Flores, H. and Srirama, S. (2013) Adaptive Code Offloading for Mobile Cloud Applications: Exploiting Fuzzy Sets and Evidence-Based Learning. Proceeding of the Fourth ACM Workshop on Mobile Cloud Computing and Services, Taipei, 25-28 June 2013, 9-16. https://doi.org/10.1145/2497306.2482984</mixed-citation></ref></ref-list></back></article>