- Research
- Open access
- Published:
Content and buffer status aware packet scheduling and resource management framework for video streaming over LTE system
EURASIP Journal on Image and Video Processing volume 2017, Article number: 73 (2017)
Abstract
With the development of the video encoding and wireless communication technologies, DASH (Dynamic Adaptive Streaming over HTTP) services have an increasing and great share of all the mobile services. However, we find some problems which still need to be addressed for DASH service optimization: (1) the limitation of the video segment representations cannot keep pace with the change of wireless channel states; (2) the characteristics of the video transmission have been not fully utilized in current DASH services; and (3) long interruption time will yield poor QoE (Quality of Experience). To solve these problems, we build a new transmission architecture by modifying the traditional TCP transmission flow. We first add a TCP (Transmission Control Protocol) proxy responsible for discarding the expiry packets automatically before putting them into the TCP sending window and then build a “ACK (acknowledgement) reconstruction module” to reconstruct the received ACK for concealing the packets discarded by the packet scheduler. Based on the new framework, we explore the interdependence among all the packets from the encoder to indicate the importance of each packet and update the interdependence relationship of every scheduling period based on the feedback ACK information. At MAC (Media Access Control) layer, a buffer status estimation module is employed to estimate the client buffer and playback information, which can be used to calculate the packet urgency. Then, a cross-layer design, which consists of an application layer of media server and client, TCP layer, MAC layer, and physical layer, is formulated and the packet scheduling and resource allocation can be jointly optimized. At client, through the analysis of the status of client buffer and MAC queue, an adaptive segment request scheme is developed to determine how and when to send the segment request. Simulation results show that the proposed algorithms can efficiently improve the received video quality as well as the playback continuity compared with other existing algorithms.
1 Introduction
1.1 Motivation
With the development of the video compression technology and the wireless network, video streaming in wireless system has taken a growing part of mobile service and attracted more and more attention. Representing the next generation in mobile technology, LTE system based on OFDMA (orthogonal frequency division multiple access) and MIMO (multiple-input multiple-output) technologies can promise faster speed and better coverage. Therefore, it is crucial to investigate how to provide satisfactory multi-client video streaming services to mobile clients by using this promising technology.
Researches have been carried out to enable high-quality video services like HTTP streaming over TCP (Transmission Control Protocol), where each video sequence is pre-encoded into many segments which cannot start to play until all the packets of it have been received [1, 2]. However, besides the high received video quality, continuous playback experience is also an important issue to be addressed from the perspective of the end clients. In addition, due to the error concealment strategy employed in the decoding, some packet loss can be tolerated for video applications. Waiting for the arrivals of all the packets inevitably leads to some unnecessary interruptions for the playback. There are some prior works [3–5] in which the playback time of each packet was fixed. Those algorithms can provide continuous playback but may result in large fluctuation of the packet loss rate because of the time-varying channel states. To get a better trade-off between the video quality and the playback continuity, we introduce a maximum allowed waiting time for the playback of each frame. In our multi-client video streaming system, the packets whose arrival times expire the maximum allowed playback time are discarded. Besides time-varying wireless channel states and the limited availability of resources, unique characteristics of video packets such as the packet priority and stringent playback deadline are also considered in our proposed packet scheduling and RB assignment scheme.
To improve the system performance of video streaming, it is equally important to keep the client buffer and MAC queue at a desired level. Conventional strategies usually request segment with fixed time interval [1] or only consider the status of client buffer [6]. All of those algorithms may result in a low level of client buffer but too many packets accumulated in MAC queue due to the bad channel states. In a scenario with limited MAC queue buffer size, the packet loss rate would increase due to the overflow of the MAC queue. As a result, we propose a MAC (Medium Access Control) queue and client buffer aware segment request strategy to determine the appropriate time to send the segment request.
Since only the above two schemes cannot provide the best system performance, to make the packet scheduling more flexible, many prior works have been carried out to select the most appropriate video presentation or even code the video source to match the time-varying channel states [7–10] or control the client buffer length [1]. However, none of them consider the status of MAC queue. Therefore, we propose a rate adaption strategy based on the MAC queue and the client buffer.
1.2 Related work
The MLWDF scheduling rule in [11] aimed at maximizing the system throughput by considering the packet delay rather than the queue length. Considering both current channel conditions and the packet delay makes MLWDF more appropriate for real-time applications. The authors in [12–17] paid their attention to DASH service optimization by determining the segment adaptation and resource allocation scheme. The authors in [13] presented a system framework description of DASH service and mentioned many key factors that we should consider in DASH service optimization. The authors in [14] proposed a QoE-driven cross-layer optimization for wireless dynamic adaptive streaming of scalable videos over HTTP. They first designed a mapping scheme from scalable video coding layers to DASH layers. Then, they developed a cross-layer design to achieve the resource allocation by considering both the video content and the client buffer status. They also proposed a rate adaptation scheme based on the client buffer status. In general, these works are worthy to be referred and investigated. However, the limitation of the video segment representations cannot keep pace with the change of wireless channel states. Moreover, the characteristics of the video transmission have been not fully utilized in current DASH services and long interruption time will yield poor QoE. Some recent research focused on packet importance and the delay requirements. Our work [18] focused on the video content and estimated the packet importance based on the extracted information from the compressed video. Similarly, [19] employed the video compression characteristics to indicate the packet importance. The wireless channel condition was also considered by using the queue length, input rate, and output rate. However, the packet urgency has not been considered in the above work. In [3–5, 20, 21], the delay requirement depended on the time interval during which the packets can be waiting to be scheduled in MAC queue. They can have a good effect on the video streaming over UDP because the clients experience continuous playback all the time. For instance, the authors in [22] assumed that there is only one frame waiting in MAC queue and the packets of this frame which cannot be transmitted in a certain time interval will be dropped. For the video streaming over TCP, the client will wait until all the packets are received. In this case, different clients will suffer different playback experiences and different playback buffer status. The previous playback experience will affect the following playback condition. Thus, it is inappropriate to use the waiting time in the MAC queue to determine the delay requirement. To accurately examine the packet urgency, the authors in [23–25] analyzed the distribution of the packet display time and proposed a playback control algorithm for real-time video services. In [24], the frame type and its corresponding playback time are considered to improve the received video quality and playback experience. However, their proposed models cannot be adapted in wireless systems because of the varying channel state. Furthermore, since many error concealment strategies can be employed at the decoder and the loss of different video packets will have different effects on the received video quality, it is not necessary to transmit all the video packets for the video service especially in the system with limited wireless resource.
For the segment request, a client usually requests video segments with predetermined time interval (around 2 s for IIS and around 3 s for Netflix in [1]) without considering any information about client in the conventional mechanisms. This may result in some clients having too many frames stored at the client buffers while the others suffer playback buffer underflow. The authors in [6] developed the segment request strategy based on the playback information. But they ignored the information of the MAC queue and the time-varying channel states. With the time when to send the segment request known, how to select the requested segment to adapt to the time-varying channel states becomes a crucial problem to be solved. H.264 SVC (Scalable Video Coding) technology is widely employed in many researches [26–29] to provide different video qualities by encoding each video sequence into many different layers. The authors in [26, 29] exploited the cross-layer design framework for a dynamic scalable video adaptation in varying network capacity. In order to increase the number of the clients that the system can support, the authors in [27] focused on the multi-client video transmission over OFDMA-based communication system by combining the adaptive subcarrier allocation and bit loading with the transmission of the H.264 SVC encoded video sequences. Since scalable video usually consists of multiple scalable layers with different importance which brings high implementation complexity, most of the researches still focus on the pre-encoded video applications. In [7, 8], some studies have been carried out to determine the optimal coding rate that the server should provide. The authors in [7] first allowed the server to drop some unimportant frames by considering the channel states. Then, according to the dropping strategy, the optimal coding rate can be determined by using a simple rate-distortion model. In [8], to aim for maximum QoE (Quality of Experience) for clients, maximum system video throughput, and QoE fairness among clients, a cross-layer optimized coding rate adaptation scheme was proposed. However, both of them focus on the received video quality but neglect another important issue of video streaming: playback continuity. To improve the playback experience, [1] studied a no-reference QoE (quality of experience) monitoring module for adaptive HTTP streaming. The authors showed the playback interruptions can be reduced by changing the QP (quantitative parameters) of the encoder to provide a presentation with a close coding rate to the estimation transmission rate. The authors in [9, 10, 30] formulated the video streaming process with multiple links. Gouache et al. [9] provided a method to estimate the transmission rate of the next scheduling period which can be used to determine the data distribution among the servers. In [10, 30], it is assumed that each client receive the video data from different servers or the other clients in the system. The bit rate can be obtained based on the source node selection strategy. In those algorithms, the bit rate was determined with the assumption that there is no packet in MAC queue waiting to be scheduled. In fact, when we send the next segment request, there may be some packets left in the MAC queue. And due to the different channel states of the end clients, their numbers of the remaining packets maybe different from each other. Since the remaining packets would affect the transmission of the next segment, the status of the clients’ MAC queue should be considered in the bit rate determination.
1.3 Approach
In summary, we find some problems which still need to be addressed for DASH services over TCP as follows: (1) the limitation of the video segment representations cannot keep pace with the change of wireless channel states; (2) the characteristics of the video transmission have been not fully utilized in current DASH services; (3) long interruption time will yield poor QoE. Inspired by the above prior works, we propose a buffer status and content aware packet management framework in LTE system. Our work is novel in the following aspects.
-
A new transmission framework for DASH services: We modify the transmission flow of the whole DASH service. With the purpose to transmit the packets which are really important to the received video quality as soon as possible and discard the ones that have little effect on the improvement of the QoE level of all the clients. We added a TCP proxy to control the main functions of the TCP layer. It is responsible for “Packet rearrange” which can discard the expiry packets automatically according to the feedback information from the packet urgency estimation at the MAC layer and the dropped packets will not be put into the “Packet sending window.” At the MAC layer, we build a “ACK reconstruction module” to reconstruct the received ACK for concealing the discarded packets. With the aim of maximizing the sum utility of the scheduled packets, some packets with little utility will be discarded instead of wasting lots of time for their arrival at the client. At the client, we introduce a maximum allowed waiting time for each segment. The client does not need to wait until all the video packets arrive at the client. Then, based on the new transmission architecture, we can improve the QoE of DASH service by jointly optimizing the application layer of media server and client, TCP layer, MAC layer, and PHY layer. The strategy of packet scheduling, resource allocation, and segment request can be obtained in our new framework.
-
A new packet importance estimation and update method based on the feedback ACK: We explore the interdependence among all the packets to indicate the importance of each packet. Specially, we update the interdependence relationship between the packets waiting to be scheduled and the packets already transmitted or lost based on the feedback ACK information. This update mechanism makes the packet importance more close to the real contribution that the packet can create. Moreover, we take into consideration the playback information to predict the packet urgency.
-
A rate adaptation scheme based on both MAC queue and the client buffer status: We consider the MAC queue status in our rate adaptation scheme. We use the playback information of client to estimate the continuous playback time that the packets in MAC queue can support. Then, by considering the MAC queue, client buffer, and the estimated transmission rate, we can acquire the urgency of each client and determine when and how to send the new segment request.
The rest of the paper is organized as follows. Section 2.1 describes the system model, and Section 2.2 presents how to calculate the packet utility. Section 2.3 show the problem formulation and solution to our proposed packet scheduling and resource management scheme. Section 2.4 presents a rate adaption scheme to derive when and which presentation should be requested and selected. Section 3 provides the discussion and the experimental results of the proposed algorithms. Finally, we draw the conclusion in Section 4. We provide a list of abbreviations used in this paper as shown in Table 1
2 Methods
2.1 System Model
Figure 1 shows the framework of this paper. In this system, K video clients can request different video sequences they want. Each video sequence is pre-encoded into a series of video segments with different coding rates (different presentations) by employing the H.264/AVC encoder in advance. The segments are stored at the media server attached to eNodeB via a lossless wired network, and the clients can send the segment requests to the media server. The packet priority can be determined based on the information extracted from the encoder and the feedback ACK information.
We build a TCP proxy to control the main functions of TCP layer. It is responsible for “Packet rearrange,” “ACK receiving window,” and “Packet sending window.” The module “Packet rearrange” can discard the expiry packets according to the feedback information from packet urgency estimation at the MAC layer automatically, and the dropped packets will not be put into the “Packet sending window.” In other words, these packets discarded by TCP have no chance to be scheduled and transmitted. The packets not expiry will be put into “Packet sending window” according to the receiving ACK information from the “ACK receiving window.”
At the MAC layer, the playback information can be estimated based on the feedback information from the clients. Considering the estimated playback information and the packet priority, we can calculate the importance of each packet. Based on the packet importance and the feedback CQIs (channel quality indicator), different RBs are assigned to different clients to maximize the system performance. We build a “ACK reconstruction module” to reconstruct the received ACK for concealing the discarded packets at the MAC layer. It depends on the received ACK information from the “ACK sending window” at the client and the packet scheduling strategy at the MAC layer. When we consider the packet utility during the resource allocation to maximize the sum utility of the scheduled packets, some packets with little utility will be discarded instead of wasting lots of time for their arrival at the client. The ACK of these discarded packets will be constructed and integrated with the received ACKs by the “ACK reconstruction module” and then sent to the “ACK receiving window” at the TCP layer. And thus, the dropped packets at the MAC layer are concealed during the TCP flow. In addition, at the MAC layer of eNodeB, we focus on the estimation of the playback time that the packets in MAC queue can support by using the estimated buffer status and the predicted transmission rate.
At the client, we build “ACK sending window” and “Packet receiving window” to simulate TCP flow. We give a maximum allowed waiting time for each segment. The packets violate its deadline will be discarded by the MAC layer and TCP layer. The exact buffer information helps us to easily get how long the remaining buffered frames in client buffer can take. Based on the analysis of the client buffer and MAC queue, an adaptive segment request strategy is developed to send each segment request at an appropriate time. Knowing when to send the segment request, a rate adaption scheme is proposed based on the estimated transmission rate and the status of the client buffer and MAC queue to provide the presentation which should be selected.
2.2 Packet utility estimation
Packet utility in our work depends on two factors: the packet priority determined by the encoding information and the packet urgency determined by the playback information. In this paper, the packet priority is calculated in terms of the potential degradation caused in the video quality due to loss of the packet. Since the same method is applied to all the video sequences, we omit the client index in the following analysis. Let λ denote the number of B frames inserted into two P frames. Let \(D_{f,p}^{\text {loss}}\) denote the distortion caused by the loss of the packet p of frame f (f is the encoding index of the frame), which can be expressed by
It consists of two parts:
(1) \(D_{f,p}^{\text {rec}}\) is the relative distortion caused by the reconstructed packet. Here, the distortion \(D_{f,p}^{\text {rec}}\) can be obtained by using the method presented in [31].
(2) \(D_{f,p}^{\text {ref}}\) is the relative distortion caused by the packets dependent on the current one. As shown in Fig. 2, if we set the encoding parameter Number Reference Frames=1, I or P frame will be directly referred according to the following principles:
-
The following two B frames (f+1,⋯,f+λ) refer the frame f for backward prediction.
-
The following P frame f+λ+1 refers the frame f for forward prediction.
-
The following two B frames (f+λ+2,⋯,f+2λ+1) refer the frame f for forward prediction.
Let γ f,p be the sum times that each of the pixels of packet p of frame f is referenced. To calculate γ f,p , the information of each packet such as MB (macroblock) partition modes and MVs should be extracted from the encoder. H.264/AVC encoder supports the intra-coded and inter-coded prediction modes and also provides abundant MB partition modes. Intra-coded prediction has 16×16 and 4×4 partition modes, and the inter-coded prediction mode possesses seven MB partition modes, namely 16×16, 16×8, 8×16, 8×8, 8×4, 4×8, and 4×4 as shown in Fig. 3. Let \(\psi _{\eta,\theta }^{\widetilde {f},f,p}\) denote the set of the pixels of packet p of frame f referenced by the SB θ of the MB η of the frame \(\widetilde {f}\). The calculation of \(\psi _{\eta,\theta }^{\widetilde {f},f,p}\) is described in Table 2.
Note that I frame is directly referenced by its following P frame. If the P frame is not the last P frame in this GoP, it is also referenced by its following P frame. Except for the first P frame, other P frames refer I frame undirectly. Consequently, the loss of I or P frame will cause the quality degradation of all the following frames in the same GoP. For P frame, the smaller frame encoding index is the more important. In this paper, the referenced times of each packet is calculated by considering both direct and undirect reference modes. Let β specify the total number of I and P frames in one GoP. Any I or P frame is given another index called reference frame index to indicate its index among all the I and P frames in a GoP. Let π f denote the reference frame index of frame f, 0≤π f ≤β−1. Let ζ(∙) denote the number of the elements of the input set. For the packet p of frame f:
-
If π f =β−1, frame f is the last P frame in its GoP and it is only referenced by its following B frames in direct mode. γ f,p can be computed by
$$ \gamma_{f,p}=\sum\limits_{\widetilde{f}=f+1}^{f+2\lambda+1}{ \sum\limits_{\eta,\theta}{\zeta\left(\psi_{\eta,\theta}^{\widetilde{f},f,p}\right)}} $$(2) -
If π f <β−1, frame f may be referenced by its following B frames and P frames in direct or undirect mode. γ f,p can be computed by
$$ \begin{array}{l} \gamma_{f,p} = \underbrace{\sum\limits_{\widetilde{f} = f + 1}^{f + 2\lambda + 1} { \sum\limits_{\eta,\theta} {\zeta\left(\psi_{\eta,\theta }^{\widetilde{f},f,p}\right)} }}_{\mathrm{direct\;reference\;mode}} \\ + \underbrace{\sum\limits_{\widetilde{f} = f + \lambda + 2}^{f + \lambda (\beta - \pi_{f}) + \beta - \pi_{f} - 1} { \sum\limits_{\eta,\theta} {\zeta\left(\widehat{\psi}_{\eta,\theta }^{\widetilde{f},f,p} \right)} }}_{\mathrm{undirect\;reference\;mode}}\\ \end{array} $$(3)
where \(\widehat {\psi }_{\eta,\theta }^{\widetilde {f},f,p}\) is the region of pixels of packet p of frame f referenced undirectly by SB θ of MB η frame f. To calculate \(\widehat {\psi }_{\eta,\theta }^{\widetilde {f},f,p}\), we need to calculate the reference region of frame \(\widehat {f}\) directly referenced by frame \(\widetilde {f}\), \(\phi _{\eta,\theta }^{\widetilde {f},\widehat {f}}\) first. Then, according to all the reference relationships between the undirect reference frames such as \(\Psi ({\widehat {f},\widehat {f}-\lambda -1})\), ⋯, Ψ(f−λ−1,f), \(\widehat {\psi }_{\eta,\theta }^{\widetilde {f},f,p}\) can be obtained finally, where Ψ(f 1,f 2) is the set of the reference relationships for all the SBs between frame f 1 and f 2, which can be easily acquired by using the method presented in Table 2. Based on the above analysis, \(D_{f,p}^{\text {ref}}\) can be computed by
where \(A_{\hat {f},\hat {p}}\) is a binary variable which indicates whether the packet \(\hat {p}\) of frame \(\hat {f}\) is received or lost. \(A_{\hat {f},\hat {p}}=1\) is equal to the successful delivery and \(A_{\hat {f},\hat {p}}=0\) otherwise. The value of \(A_{\hat {f},\hat {p}}\) depends on the feedback ACK information. \(D_{f,p}^{\text {con}}\) is the distortion caused by the loss of its reference packet \(\hat {p}\) of frame \(\hat {f}\). Finally, \(D_{f,p}^{\text {loss}}\) can be obtained according to (1). In the following, \(D_{f,p}^{\text {loss}}\) is regarded as the priority of packet p of frame f and at the RLC (radio link control) layer of LTE system, each video packet is fragmented into many small packets. It is assumed that the packets attributed to the same packet have the same priority. In MAC queue, we can acquire the packet priority from the packet header denoted by pri k,m .
We use the feedback information such as ACKs of the received packets from the clients to predict the playback information and then examine the packet urgency. Let \(\widetilde {F}_{c}(k)\) denote the estimated index of the frame being played of client k. Since the frames of one segment should be decoded simultaneously, the packet urgency of all the video packets of the frames of the same segment should be the same. Thus, the packet urgency μ k,m can be expressed by
where P k,m is the index of the first frame of the segment waiting in the MAC queue. For the packets of the same segment, the values of P k,m are the same. By employing the method proposed in [3], the scheduling probability of packet m for client k, ρ k,m , can be defined as \(\rho _{k,m} = e^{-\mu _{k,m}}\phantom {\dot {i}\!}\).
Then, considering the packet priority and the packet urgency, the utility of packet m of client k, U k,m , can be calculated by
2.3 Problem formulation and solution
2.3.1 Problem formulation
We formulate the packet scheduling and resource management into a new mathematical model with the objective to maximize the weighted sum of utilities of all the scheduled packets of all the clients:
where
-
Ω: the set of the total available RBs.
-
N: the number of the available RBs in this system.
-
M k : the number of the packets in client k’s MAC queue.
-
w k : client k’s weight, a positive constant calculated by \(w_{k}=\sum \limits _{k = 1}^{K }(\widetilde {N}_{b}(k))/\widetilde {N}_{b}(k)\), such that \(\sum \limits _{k = 1}^{K }(w_{k})=1\). \(\widetilde {N}b(k)\) is the predicted number of frames waiting to be played at client buffer. The aim is to balance the playback continuity experience among the clients. \(\widetilde {N}b(k)\) can be predicted by using the feedback ACK information with RLC mode AM. It can also be fedback from the client by employing the playback buffer status analysis module at client.
-
τ k,m : τ m,k =1 means the packet m of client k is selected to be scheduled and τ m,k =0 otherwise.
-
a k,n : a k,n =1 indicates the RB n is assigned to client k and a k,n =0 otherwise.
-
N k : the set of RBs assigned to client k.
-
q(N k ): the maximum MCS (Modulation and Coding Scheme) that client k can support based on N k .
-
b k,j : b k,j =1 means client k uses MCS j and b k,j =0 otherwise.
-
r j : the transmission capacity of each RB by using MCS j.
-
R k,m : the size of the packet m of client k.
Constraint (c1) shows that the transmission capacity of client k should be enough to transmit its scheduled packets. We assume that all the scheduled packets can be transmitted to the client successfully by advanced channel coding. Constraint (c2) indicates that each packet can be scheduled or not at each scheduling period. Constraint (c3) means that each client can only choose one MCS. The last constraint implies that one RB can only be assigned to one client.
2.3.2 Packet scheduling and resource management scheme
From the mathematical expression in Eq. (7), the relationship between the receive time and other parameters cannot be explicitly evaluated without restrictive assumptions on the arrival process and the time-varying channel states. Thus, we can not obtain a mathematically optimal solution of the above problem in its most generic form. Due to this, we develop a heuristic suboptimal solution, which jointly performs RB assignment and packet scheduling by employing our proposed utility function.
To solve the problem (7), a cost function C k,n is developed to present the increment of the sum utilities of the clients brought by assigning RB n to client k. C k,n can be expressed by
where N k is the set of the RBs already assigned to client k. The function f(∙) can be expressed as
Achieving problem (7), we should assign RB n to the client with the maximum C k,n . To get C k,n , we need to solve f(τ k,m ,a k,n ,b k,j ,N k ) first. Next, we will provide how to calculate f(τ k,m ,a k,n ,b k,j ,N k ).
For a certain client with fixed RB set, the utility is an increasing function of the transmission capacity. The maximum capacity these RBs can support can be calculated by solving the following problem:
Then, we can get the optimal \(\left (a_{k,n}^{*},b_{k,j}^{*}\right),\) and then the transmission capacity \(\sum \limits _{n \in N_{k}} a_{k,n}^{*}{\sum \limits _{j = 1}^{q(N_{k})} {b_{k,j}^{*} r_{j}} }\) can be obtained according to [32]. Knowing the transmission capacity, the optimal packet scheduling strategy \(\tau _{k,m}^{*}\) can be obtained by solving the following problem:
Optimal \(\tau _{k,m}^{*}\) can be easily acquired by our method proposed in [33]. Then, we can have
The detailed process of the packet scheduling and resource management scheme is described in Table 3.
2.4 Adaptive segment request scheme
In this section, an adaptive segment request strategy is proposed to determine when to send the segment request and which presentation should be selected.
2.4.1 Segment request time determination
The segment request time determination is carried out at the client side, so we can get the accurate number of the unplayed but successfully decoded frames stored in client k’s playback buffer N b (k) and how long continuous playback these frames can support T b (k)=N b (k)/F pr , where F pr is the frame rate. Since some packets in the MAC queue can be transmitted to the client side during T b (k) and therefore can bring extra continuous playback time, immediately sending the new segment request can not bring benefit to the client but may occupy the wireless resource which is really needed by other clients. Letting the packets in MAC queue are the ones that are really needed by the clients is the most important.
Let T mac(k) denote how long the continuous playback time interval that the packets in MAC queue can support and R P k denote the set of the packets in MAC queue of client k after the estimation. The estimation of T mac(k) is shown in Table 4.
If R P k =∅, it means that all the packets in the MAC queue can arrive at the client on time and the client can enjoy T mac(k)+T b (k) continuous playback back at least. If R P k ≠∅, it means that the client k will suffer interruption after the time interval T mac(k)+T b (k). Let T l (k) denote the playback time that the remaining packets in MAC queue will take, which can be calculated by T l (k)=N l (k)/F pr , where N l (k) is the number of the remaining frames in the MAC queue. The transmission time they will take can be estimated by \(T_{r}(k)=\text {SM}(k)/\widehat {r}_{k}\), where SM(k) indicates the sum size of the left packets in the MAC queue. A guard time interval T g (k) is introduced to protect the client against suffering playback interruption. The appropriate time interval needed to send the next segment request, T i (k), can be computed by
2.4.2 Rate adaption at client
With the time when to send the segment request known, we need to determine which presentation should be selected. To keep the continuous playback, the next requested segment should be transmitted to the client during the time we estimated above. Based on the estimated transmission rate, we can determine the optimal coding rate we should request. With the optimal coding rate known, we can select the segment with most closest coding rate. Using the metadata contained in MPD (Media Presentation Description), the unique URL of the requested segment can be constructed. Then, the request will be sent to the media server by HTTP GET command. The media server will pick the segment with the unique URL and send it to the DASH client.
Let S mac(k) denote the sum size of the packets in client k’s MAC queue, \(S_{\text {mac}}(k)=\sum \limits _{m=1}^{M_{k}}{R_{k,m}}\). The transmission time taken by those packets, \(\widehat {T}_{\text {mac}}(k),\) can be estimated by
There are two possible transmission cases according to the value of T l (k):
-
If T l (k)=0, all the packets in MAC queue can be transmitted to the client during the time interval T b (k)+T mac(k). Therefore, to keep the continuous playback of client k, the next segment should be transmitted to client during the interval \(T_{b}(k)+T_{\text {mac}}(k)-\widehat {T}_{\text {mac}}(k)\). Let T seg(k) be the duration of a segment for client k. Then, the coding rate that the client should recommend can be computed by
$$ {\text{pr}}_{k}=\widehat{r}_{k} \frac{T_{b}(k)+T_{\text{mac}}(k)-\widehat{T}_{\text{mac}}(k)}{T_{\text{seg}}(k)} $$(15) -
If T l (k)>0, client k would suffer interruption after the time interval T b (k)+T mac(k). Since the left packets also take some time \(\widehat {T}_{l}(k)\) to be transmitted and played, the next segment should arrive at client during the time interval \(\widehat {T}_{l}(k)\). Then, the recommended coding rate can be acquired by
$$ {\text{pr}}_{k}=\widehat{r}_{k} \frac{T_{l}(k)}{T_{\text{seg}}(k)} $$(16)
In general, the recommended coding rate can be determined by
Obviously, the recommended coding rate may be not equal to the real coding rate of the segments stored at the media server. Thus, the client should determine which segment should be selected based on the recommended rate. In this paper, we will pick the representation with the most close coding rate to pr k from all the pre-encoded representations. The encoding information of the representation can be found in MPD.
Assume that there be ε k representations of the video sequence requested by client k and let PR k indicate the set of those representations, \(\phantom {\dot {i}\!}\text {PR}_{k}=\{\text {PR}_{k,1}, \cdots, \text {PR}_{k,\varepsilon _{k}}\}\) with PR k,i <PR k,i+1. If the recommended rate pr k is lower or higher than the coding rates of all the encoded representations, the DASH client will pick the representation with the lowest or the highest coding rate of PR k . The principle of the final decision on the selected representation sp k can be summarized as follows:
From Eq. (18), we can determine which segment should be requested. As long as the segment going to be requested is obtained, the unique URL of the segment can be constructed by using the metadata contained in MPD. Then, the request will be sent to the media server by HTTP GET command and the media server will pick the segment with the unique URL and send it to the DASH client.
3 Results and discussion
We provide our experimental results and discussion in this section. All the experiments are carried out in LTE system with the Network Simulation 3 (NS3) simulator [34]. eNodeB can obtain the CQIs through the feedback of all UEs. Each scheduling period called TTI (Transmission Time Interval) consists of two consecutive RBs and the time slot duration of each RB is 0.5 ms. Therefore, CQIs will be periodically updated and transmitted to eNodeB in the feedback information every 1 ms. It is assumed that the power be equally allocated among the RBs. Since we need the ACK information to predict the playback buffer status of the DASH client, the RLC mode is set to be AM (acknowledged mode). All the video sequence have been encoded into a series of the video segments with five different coding rates (presentations). Each segment consists of 60 frames with the time duration of 2 s. The detailed system configuration is shown in Table 5. The encoding information about the average coding rate, PSNR, and the corresponding presentation is shown in Table 6. For each segment, there is no need to receive all the video packets especially the packets with little effect on the received video quality. In our simulation, each video packet should be transmitted to the client before its deadline which depends on the maximum allowed waiting time of each video packet. We define T a k as the maximum allowed waiting time for the packet of client k. It means that from the time the packet should be started to be played, the client only wait T a k for the arrival of the video packet at most. In our simulation, T a k =5 s,1≤k≤K. Using this parameter, we can determine which packet has the chance to be transmitted and which should be discarded automatically. The clients can dynamically switch their request presentations at the times that they send their segment requests.
“PROPOSED” is the algorithm which consists of both the proposed packet scheduling and resource management scheme. “AR” (adaptive request) only indicates the adaptive segment request scheme. When AR is not used at the client, the clients will send the segment request with a fixed time interval and the rate adaption scheme is to request the segment whose rate is the most closest to the estimated transmission rate. The algorithm proposed in this paper will be compared with the following four algorithms:
-
QTRMA in [15]: It investigated QoE optimization approaches for adapting the adaptive HTTP video delivery. They considered the client buffer status and proposed a novel playback buffer-dependent approach that determines for each client the streaming rate for future video segments according to its buffer time and the achievable QoE under current radio conditions.
-
ADCLA in [18]: It developed a packet importance model based on the information extracted from the compressed video and proposed a content-aware scheduling scheme for resource allocation. The waiting time in MAC queue is used to calculate the packet urgency.
-
MAXCI in [35]: It is a purely channel-dependent scheduling scheme, in which the clients with good channel states always occupy the channel most of time even if they are well ahead of their video decoding deadlines.
-
PF in [36]: It allocates the limited network resources by considering both the experienced channel states and the past client throughput. Its goal is to maximize the system throughput and to guarantee fairness among end clients.
In Fig. 4, we provide the average PSNR of each frame of all the clients for different algorithms respectively. From this figure, we can see that our proposed algorithm can outperform other existing algorithms for all the clients. This indicates that considering the packet importance and the packet urgency can make the packet scheduling and resource allocation strategy more reasonable. Due to taking into consideration the packet importance, ADCLA+AR has higher PSNR than MAXCI+AR and PF+AR. We can also observe that QTRMA has more stable performance than other algorithms. This is because for QTRMA, there is no packet loss, the client will wait until all the video packets of each segment arrive at the client. No packet loss means not so serious quality fluctuation during one segment. But transmitting all the video packets may lead to requesting the segment with low bit rate which will also degrade the value of the average PSNR. In fact, the packet with little importance should be discarded not scheduled.
Figures 5 and 6 shows the corresponding playback time of each frame of each client and the average playback time of each frame of all the clients. In the simulation, each client should experience a rebuffering time interval until all the packets of two segments are received. From the two figures, we can see that some continuous playback can be guaranteed for all the clients with different algorithms. From Fig. 6, we can deserve that the client can finish the whole video in a shortest time among all the clients by employing our proposed method. It means that the clients suffer the fewest rebuffering events and shortest interruption time which brings most satisfactory playback experience to the clients. From Fig. 5, we can discover that the performance of our proposed method is not always the best among the algorithms for each frame. This is because the objective of our proposed algorithm is to maximize the sum of the utilities of all the clients not aiming at optimizing an individual client. We can also see that the algorithm QTRMA yields longer interruption time than the PROPOSED and ADCLA+AR. This is because QTRMA has to start to play until all the video packets of each segment arrive at the client while the former algorithms will wait for only T a k . It is unreasonable to wait for the arrival of the packets with little importance especially in a wireless network with a bad channel state.
To examine the performance of the packet scheduling strategy of our proposed scheme, we present the average packet loss rate of the clients for each frame in Fig. 7. From the figure, we can see that the packet loss rate can be kept to zero for some time because a rebuffering process exists for all the videos at the beginning of the playback. In addition, the packet loss rate can be restricted in a lowest range by using our proposed algorithm. The number of the frames with no packet loss is larger than other compared algorithms too. Lower packet loss is equal to the higher PSNR especially when we consider the packet utility in the packet scheduling and resource management. The packet with higher utility will be scheduled with higher priority. Reasonable packet scheduling strategy can improve the received video quality by allocating the limited resource to the clients who really need. Together with the results in Figs. 4 and 6, we can conclude that our proposed algorithm can acquire higher video quality as well as more continuous playback experience.
The simulation result of the average PSNR of all the clients for each frame for different packet scheduling and resource management algorithms integrating with our proposed segment request scheme AR is shown in Fig. 8, and the interruption time experienced by each client by employing our proposed segment request scheme AR is shown in Fig. 9. Note that “PROPOSED-AR” means the resource allocation scheme used is the algorithm proposed in this paper but the segment will be requested with a fixed time interval. Since QTRMA has its unique segment request scheme, we did not employ AR for it. From Fig. 8, we can see that the video quality based on AR is higher than that without AR for the most of the frames. This is because the segment adaption scheme can make the segment switch more flexible and adaptive to the varying channel states. Without rate adaptation, the rate of the requested segment may exceed the transmission rate that the wireless channel can support. By using AR, the limited resource can be allocated to the packets which are suitable to be transmitted. As shown in Fig. 9, AR can bring the playback experience improvement apparently for all the packet scheduling and resource management schemes. Different schemes earn different achievements. In summary, QoE of the clients can be improved by employing AR.
4 Conclusions
A content and buffer status aware packet scheduling and resource management framework is proposed for DASH video streaming over LTE system. We first modify the transmission flow to make the whole framework tolerant of the loss of unimportant packets. Then, depending on the encoding information extracted from the encoder, we calculate the priorities of the video packets. Based on the feedback information from the clients we can estimate the packet urgency. The priorities of the video packets, the feedback channel states and the packet urgencies are considered to optimize the packet scheduling and RB assignment. Instead of requesting the video segments with fixed time interval, we propose an adaptive segment request time determination strategy which can effectively control the status of the client buffer and MAC queue so as to reduce the packet loss rate caused by the overflow of the client buffer and MAC queue. To get a better trade-off between the received video quality and playback continuity, we further propose a novel rate adaption algorithm which depends on the status of the client buffer, the MAC queue and the predicted transmission rate. Numerical simulation results show that our proposed client buffer and content aware packet scheduling framework can provide superior performance in terms of the PSNR of the received video and playback continuity.
References
O Oyman, S Singh, Quality of experience for HTTP adaptive streaming services. IEEE Commun. Mag. 50(4), 20–27 (2012).
LD Cicco, S Mascolo, An adaptive video streaming control system: modeling, validation, and performance evaluation. IEEE/ACM Trans. Netw. 22(2), 526–539 (2013).
Q Ai, P Wang, F Liu, Y Wang, F Yang, J Xu, QoS-guaranteed cross-layer resource allocation algorithm for multiclass services in downlink LTE system. Int. Conf. Wireless Commun. Signal Process, 1–4 (2010).
J Huang, W Lin, H Ko, in IEEE Region 10 Conf. A resource allocation algorithm for maximizing packet transmissions in downlink lte cellular systems, (2011), pp. 445–449.
H Ramli, K Sandrasegaran, R Basukala, R Patachaianand, X Minjie, L Chung, Resource allocation technique for video streaming applications in the LTE system. Wirel. Opt. Commun. Conf.1–5 (2010).
T Kupka, P Halvorsen, C Griwodz, An evaluation of live adaptive HTTP segment streaming request strategies. IEEE Conf. Local Comput. Netw, 604–612 (2011).
M Kalman, B Girod, P Beek, Optimized transcoding rate selection and packet scheduling for transmitting multiple video streams over a shared channel. IEEE Int. Conf. Image Process. 1:, 165–168 (2005).
J Xing, W Fan, Z Lu, A cross-layer scheduling scheme for video streaming based on fuzzy decision-making. IEEE Int. Conf. Commun. Technol, 84–87 (2011).
S Gouache, G Bichot, A Bsila, C Howson, Distributed and adaptive HTTP streaming. IEEE Int. Conf. Multimedia Expo, 1–6 (2011).
M Xing, S Xiang, L Cai, Rate adaptation strategy for video streaming over multiple wireless access networks. IEEE Global Commun. Conf, 5745–5750 (2012).
M Andrews, K Kumaran, K Ramanan, A Stolyar, P Whiting, R Vijayakumar, Providing quality of service over a shared wireless link. IEEE Commun. Mag. 39(2), 150–154 (2001).
T Cong, H Le, H Nguyen, A Pham, R Man, An evaluation of bitrate adaptation methods for HTTP live streaming. IEEE J. Sel. Areas Commun. 32(4), 693–705 (2014).
J Jiang, V Sekar, H Zhang, Improving fairness, efficiency, and stability in HTTP-based adaptive video streaming with festive. IEEE/ACM Trans. Netw. 22(1), 326–340 (2012).
M Zhao, X Gong, J Liang, W Wang, X Que, S Cheng, QoE-driven cross-layer optimization for wireless dynamic adaptive streaming of scalable videos over HTTP. IEEE Trans. Circ. Syst. Video Technol. 25(3), 451–465 (2015).
A Essaili, D Schroeder, E Steinbach, D Staehle, M Shehada, QoE-based traffic and resource management for adaptive HTTP video delivery in LTE. IEEE Trans. Circ. Syst. Video Techn. 25(6), 988–1001 (2015).
V Ramamurthi, O Oyman, Video-QoE aware radio resource allocation for HTTP adaptive streaming. IEEE Int. Conf. Commun, 1076–1081 (2014).
N Bouten, S Latre, J Famaey, In-network quality optimization for adaptive video streaming services. IEEE Trans. Multi. 16(8), 2281–2293 (2014).
F Li, G Liu, L He, Application-driven cross-layer approaches to video transmission over downlink OFDMA networks. IEEE Globecom Workshops, 1–6 (2009).
M Zhao, X Gong, X Que, et al, Context-aware adaptive active queue management mechanism for improving video transmission over IEEE 802.11E WLAN. J. China Univ. Posts Telecommun. 19(11), 65–72 (2012).
G Liebl, M Kalman, B Girod, Deadline-aware scheduling for wireless video streaming. IEEE Int. Conf. Multimedia Expo, 1–4 (2005).
R Seungwan, R Byunghan, S Hyunhwa, S Mooyong, Urgency and efficiency based packet scheduling algorithm for OFDMA wireless system. IEEE Int. Conf. Communications. 4:, 2779–2785 (2005).
A Dua, C Chan, N Bambos, J Apostolopoulos, Channel, deadline, and distortion (C D 2) aware scheduling for video streams over wireless. IEEE Trans. Wireless Commun. 9(3), 1001–1011 (2010).
K Fujimoto, S Ata, M Murata, Playout control for streaming applications by statistical delay analysis. IEEE Int. Conf. Commun. 8:, 2337–2342 (2001).
S Wee, T tian, J Apostolopoulos, M Etoh, Optimized video streaming for networks with varying delay. IEEE Int. Conf. Multimedia Expo. 2:, 89–92 (2002).
C Boutremans, J Boudec, Adaptive joint playout buffer and FEC adjustment for internet telephony. IEEE Int. Conf. Comput. Commun. 1:, 652–662 (2003).
E Piri, M Uitto, J Vehkapera, T Sutinen, Dynamic cross-layer adaptation of scalable video in wireless networking. IEEE Global Commun. Conf, 1–5 (2010).
S Islam, M Hossain, A wireless video streaming system based on OFDMA with multi-layer H.264 coding and adaptive radio resource allocation. Int. Conf. Intell. Inf. Process, 1–6 (2011).
K Tappayuthpijarn, T Stockhammer, E Steinbach, HTTP-based scalable video streaming over mobile networks. IEEE Int. Conf. Image Process, 2193–2196 (2011).
R Radhakrishnan, A Nayak, Cross layer design for efficient video streaming over LTE using scalable video coding. IEEE Int. Conf. Commun, 6509–6513 (2012).
W Pu, Z Zou, C Chen, Video adaptation proxy for wireless dynamic adaptive streaming over HTTP. Packet Video Workshop, 65–70 (2012).
F Li, G Liu, L He, A cross-layer scheduling algorithm for H.264 video transmission over wireless networks. Int. Workshop Cross Layer Design, 1–6 (2009).
TS 36.213, Evolved Universal Terrestrial Radio Access (E-UTRA): physical layer procedures, https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=2427. Accessed 23 March 2017.
L He, G Liu, Quality-driven cross-layer design for H.264/AVC video transmission over OFDMA system. IEEE Trans. Wirel. Commun. 13(12), 6768–6782 (2014).
G Piro, L Grieco, G Boggia, F Capozzi, P Camarda, Simulating LTE cellular systems: an open-source framework. IEEE Trans. Veh. Technol. 60(2), 498–513 (2011).
S Borst, User-level performance of channel-aware scheduling algorithms in wireless data networks. IEEE Int. Conf. Comput. Commun. 1:, 321–331 (2003).
A Jalali, R Padovani, R Pankaj, Data throughput of CDMA-HDR a high efficiency-high data rate personal communication wireless system. IEEE Vehicular Technol. Conf. 3:, 1854–1858 (2000).
Funding
This work is supported by the National Natural Science Foundation of China, No.61701389.
Availability of data and materials
Not applicable.
Author information
Authors and Affiliations
Contributions
LH and FL conceived and designed the research. LH performed the experiments and analyzed the data. LH and FL wrote and edited the manuscript. Both authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
He, L., Li, F. Content and buffer status aware packet scheduling and resource management framework for video streaming over LTE system. J Image Video Proc. 2017, 73 (2017). https://doi.org/10.1186/s13640-017-0222-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13640-017-0222-1