Theory

The inverted cart–pendulum is an example of under-actuated, non-minimum phase and highly unstable system. The first step in the analysis of control system is to derive its mathematical model to understand the working of the complete system.

The Plant (Pendulum-Cart)

Pendulum set description
The pendulum setup consists of a cart moving along the 1 metre length track. The cart has a shaft to which two pendulums are attached and are able to rotate freely. The cart can move back and forth causing the pendulums to swing. The movement of the cart is caused by pulling the belt in two directions by the dc motor attached at the end of the rail. By applying a voltage to the motor the force is controlled with which the cart is pulled. The value of the force depends on the value of the control voltage. The voltage is the control signal. The two variables that are read from the pendulum (using optical encoders) are the pendulum position (angle) and the cart position on the rail. The controller’s task will be to change the dc motor voltage depending on these two variables in such a way that the desired control task is fulfilled (stabilizing in an upright position, crane control). To develop control algorithms effectively, a deep understanding of the underlying physical principles governing the process is essential, along with conducting identification experiments. The next section explains the modelling process of the pendulum.

Fig. 1. Digital Pendulum mechanical unit

Pendulum Model
Every control project starts with the plant modelling. The phenomenological model of the pendulum is presented in Fig. 2. The phenomenological model of the pendulum is nonlinear, meaning that at least one of the states (x and its derivative or θ and its derivative) is an argument of a nonlinear function (x – position of cart (m), θ – angle of the pendulum with respect to vertical (rad)). For such a model to be presented as a transfer function (a form of linear plant dynamics representation used in control engineering), it has to be linearised.

Fig. 2. Pendulum phenomenological model

Fig. 3. Cart-pendulum system

The cart–pendulum system, shown in Fig. 3, has two degrees of freedom of motions.
i) Linear motion of the cart in the X – axis.
ii) Rotation of the pendulum about the X –Y plane.
The equations of motion are described by – $$\ddot{\theta}=\frac{mL}{\sigma}\{[F-b\dot{x}]cos\theta - mL(\dot{\theta})^2 \ cos\theta \ sin\theta + ( m+M )g \ sin\theta\} \tag 1$$ $$\ddot{x} = \frac{1}{\sigma}\{(J+mL^2)[ F - b\dot{x} - mL(\dot{\theta})^2 sin\theta ] + mL^2g \ sin\theta \ cos\theta \} \tag 2$$ $$\sigma = mL^2( M + m cos^2\theta ) + J ( M + m )$$ Linearising equations (1) and (2) for small angle of θ from the vertical equilibrium point, the following state space model is obtained.
State Space Representation \begin{gather} \begin{bmatrix} \dot{x}\\ \ddot{x}\\ \dot{\theta}\\ \ddot{\theta}\\ \end{bmatrix} = \begin{bmatrix} 0 & 1& 0 & 0 \\ 0 & \frac{-(J+mL^2)b}{\sigma'} & \frac{m^2 L^2 g}{\sigma'} & 0 \\ 0 & 0& 0 & 1 \\ 0 & \frac{- mLb}{\sigma'} & \frac{mgL(m+M)}{\sigma'} & 0 \\ \end{bmatrix} \begin{bmatrix} x \\ \dot{x} \\ \theta \\ \dot{\theta} \\ \end{bmatrix} + \begin{bmatrix} 0 \\ \frac{(J+mL^2)}{\sigma'} \\ 0 \\ \frac{mL}{\sigma'} \\ \end{bmatrix} F \tag 3 \end{gather}
\begin{gather} y = \begin{bmatrix} 1 & 0 & 0 & 0\\ 0 & 0 & 1 & 0 \end{bmatrix} \begin{bmatrix} x \\ \dot{x} \\ \theta \\ \dot{\theta} \\ \end{bmatrix} \tag 4 \end{gather} where $$\ x, \ \dot{x}, \ \theta \ and \ \dot{θ} \ are \ the \ states$$ and y is the output vector, $$\sigma' = J(M+m) + MmL^2$$

Plant Parameter

Parameter	Value
g - Gravity	9.81 m/s²
L - Length of the pendulum	0.4 m
M - Mass of cart	2.4 Kg
m - Mass of the pendulum	0.23 Kg
J - Moment of inertia of pendulum	0.099 Kg.m²
b - Coefficient of cart friction	0.055 Ns/m
d - Pendulum damping coefficient	0.005 Nm.s/rad
U - Control signal	- 2.5 V < U < + 2.5 V
F - Applied force to cart	- 24 N < F < + 24 N
x - Position of cart from the reference	- 0.3 m < x < + 0.3 m
θ - Angle of the pendulum with respect to vertical	θ≤0.1 rad

Next, substituting the values of M, L, J, m and g in equations (3) and (4) and neglecting the friction coefficient b (which is very small compared to other parameters) the following transfer functions are obtained
$$\frac{X(s)}{F(s)} = \frac{( J + mL^2 )s^2 - mgL}{s^2((J( m + M ) + MmL^2)s^2 - mgL ( M + m ))}$$ $$= \frac{0.3894s^2 - 2.6506}{s^2(s^2 - 6.807)} \approx \frac{0.3894}{s^2} \tag 5$$ $$\frac{\theta(s)}{F(s)} = \frac{mLs^2}{s^2((J( m + M ) + MmL^2)s^2 - mgL ( M + m ))}$$ $$= \frac{0.2638s^2}{s^2(s^2 - 6.807)}$$ $$\approx \frac{0.2638}{s^2 - 6.807} \tag 6$$ In equations (5) and (6) above, cancellations do not cause any internal stability problem because the cancelled modes are available for feedback in either of the transfer functions, the outputs (x, θ) of both of which are taken as feedback for stabilisation. Now, the dc motor is used to convert the control voltage U to force F is represented by only a gain block of gain = 15. Hence, the transfer functions X(s)/U(s) and θ(s)/U(s) become - $$\frac{X(s)}{U(s)} \triangleq \frac{b_1}{s^2} = \frac{5.841}{s^2} \tag 7$$ $$\frac{\theta(s)}{U(s)} \triangleq \frac{b_2}{s^2 - a^2} = \frac{3.957}{s^2 - 6.807} \tag 8$$

Two loop PID controller

The two-loop PID controller to be employed for the cart–pendulum system is shown in Fig. 4. Let the two PID controllers be $$C_1 = \frac{k^1_d s^2 +k^1_p s +k^1_i}{s} \tag 9$$ $$C_2 = \frac{k^2_d s^2 +k^2_p s +k^2_i}{s} \tag {10}$$ where $$k^1_p \ denotes \ proportional \ gain \ for \ C_1$$ $$k^1_i \ denotes \ integral \ gain \ for \ C_1$$ $$k^1_d \ denotes \ derivative \ gain \ for \ C_1$$ $$k^2_p \ denotes \ proportional \ gain \ for \ C_2$$ $$k^2_i \ denotes \ integral \ gain \ for \ C_2$$ $$k^2_d \ denotes \ derivative \ gain \ for \ C_2$$ With the above controllers the characteristic equation for the control scheme presented in Fig. 4 becomes, $$1 - P_1C_1 + P_2C_2 = 0 \tag {11}$$ Substituting P₁, P₂ (from (7) and (8)), and C₁, C₂ (from (9) and (10)) in (11), we get $$1 - (\frac{b_1}{s^2}\frac{k^1_d s^2 +k^1_p s +k^1_i}{s})+ (\frac{b_2}{s^2 - a^2}\frac{k^2_d s^2 +k^2_p s +k^2_i}{s}) = 0 \tag {12}$$ which yields $$s^5 + ( -b_1 k^1_d + b_2 k^2_d )s^4 + ( -a^2 - b_1 k^1_p + b_2 k^2_p )s^3 + ( -b_1 k^1_i + a^2b_1k^1_d + b_2k^2_i )s^2 + ( a^2b_1k^1_p )s + ( a^2b_1k^1_i ) = 0 \tag {13}$$ Since the above characteristic equation is of fifth order, let the desired characteristic equation be $$s^5 + p_1s^4 + p_2s^3 + p_3s^2 + p_4s + p_5 = 0 \tag {14}$$ Comparing the coefficients of (13) and (14) the following matrix equation is obtained \begin{gather} \begin{bmatrix} -b_1 & 0 & 0 & b_2 & 0 & 0 \\ 0 & -b_1 & 0 & 0 & b_2 & 0 \\ a^2b_1 & 0 & -b_1 & 0 & 0 & b_2\\ 0 & a^2b_1 & 0 & 0 & 0 & 0\\ 0 & 0 & a^2b_1 & 0 & 0 & 0 \\ \end{bmatrix} \begin{bmatrix} k^1_d\\ k^1_p \\ k^1_i\\ k^2_d\\ k^2_p\\ k^2_i\\ \end{bmatrix} = \begin{bmatrix} p_1\\ p_2 + a^2\\ p_3\\ p_4\\ p_5\\ \end{bmatrix} \tag {15} \end{gather}

Fig. 4. Two-loop PID controller for an inverted cart–pendulum system

LQR Design

The LQR is an optimal state feedback controller designed to minimise a particular quadratic performance index, which takes care of the design constraints. For an LTI system,
$$\dot{X}= AX + BU$$ $$Y=CX \tag {16}$$ The performance index is taken as, $$J = \frac{1}{2}\int_{0}^{\infty}\{X^TQX + U^TRU \}dt \tag {17}$$ where, Q is positive semi-definite (or positive definite) and R is positive definite. The minimisation of J is obtained by solving the algebraic Riccati equation – $$A^TP+PA-PBR^{-1}B^TP+Q = 0 \tag {18}$$ The optimal state feedback gain vector, $$K = -R^{-1}B^TP \tag {19}$$ Now for the inverted pendulum system, the above LQR design is carried out. Substituting the system parameters value in (3) and (4) and then comparing these equations with (16) we get, \begin{gather} A = \begin{bmatrix} 0 & 1 & 0 & 0\\ 0 & 0 & 0.238 & 0 \\ 0 & 0 & 0 & 1\\ 0 & 0 & 6.807 & 0\\ \end{bmatrix}, B = \begin{bmatrix} 0\\ 0.3894\\ 0\\ 0.2638\\ \end{bmatrix} \times 15, \ C = \begin{bmatrix} 1 & 0 & 0 & 0\\ 0 & 0 & 1 & 0 \\ \end{bmatrix} \tag {20} \end{gather} The A matrix being of fourth order, it may be chosen Q = diag {q₁, q₂, q₃, q₄} such that q₁ >> q₂, q₂ >> q₄, q₃ >> q₄. The optimal state feedback control gains are then found to be – $$K = [-2.2361 \ -2.7209 \ 17.5208 \ 6.7791]^T \tag {21}$$ $$where \ q_1, \ q_2, \ q_3, \ q_4 \ are \ the \ weights \ on \ cart \ position \ (x), \ cart \ velocity \ (\dot{x}), \ pendulum$$ $$ angle \ (\theta) \ and \ pendulum \ angular \ velocity \ (\dot{\theta}) \ respectively.$$ \cdot{x} Finally, the closed-loop poles, that is, the eigen-values of (A – BK) are obtained as -2.8862 ± 2.1606 i, -2.5800 ± 0.1461 i

Now, with the above four poles and choosing the fifth pole to be six times the real part of the dominant one amongst these four poles, the coefficients of (14) are obtained as, p₁= 26.4, p₂= 218.6, p₃= 871.3, p₄= 1721.8, p₅= 1343.7

Next, by substituting these p₁, p₂, p₃, p₄, p₅ and b₁, b₂, a obtained from (7), (8) in (15), we get \begin{gather} \begin{bmatrix} -5.841 & 0 & 0 & 3.957 & 0 & 0\\ 0 & -5.841 & 0 & 0 & 3.957 & 0\\ 39.759 & 0 & -5.841 & 0 & 0 & 3.957\\ 0 & 39.759 & 0 & 0 & 0 & 0\\ 0 & 0 & 39.759 & 0 & 0 & 0\\ \end{bmatrix} \begin{bmatrix} k^1_d\\ k^1_p \\ k^1_i\\ k^2_d\\ k^2_p\\ k^2_i\\ \end{bmatrix} = \begin{bmatrix} 26.4\\ 225.5\\ 871.3\\ 1721.8\\ 1343.7\\ \end{bmatrix} \tag {22} \end{gather} In (22), five poles need to be placed and we have six parameters. So we need to fix one parameter. On choosing k_d² = 10 (say), the PID parameters are obtained as $$k^1_p =43.3 $$ $$k^1_d =2.254 $$ $$k^1_i =33.796 $$ $$k^2_p =120.9 $$ $$k^2_i = 247.43$$
Application

The Segway
The human posture systems
The launching of a rocket etc.

Basically, any system that requires vertical stabilization has dynamics that are similar to an inverted pendulum. The work involved in modeling and controlling an inverted pendulum can be carried over to many engineering areas.