Lecture 2: Convex Sets

Convex Optimization
Convex Sets
Mathematical Foundations
Author

Chao Ma

Published

December 3, 2025

Convex Sets

Affine Combination

\[x = \theta x_1 + (1-\theta)x_2, \quad \theta \in \mathbb{R}\]

Affine combination is the special form of linear combination where the sum of coefficients equals 1.

Example: Solution set \(\{x \mid Ax = b\}\)

For any \(x_1, x_2\) which both are solutions of \(Ax=b\):

\[A(\theta x_1 + (1-\theta)x_2) = \theta A x_1 + (1-\theta) A x_2 = \theta b + (1-\theta)b = b\]

Affine combination example Figure: Affine combinations form a line through two points. Unlike linear combinations, affine combinations require coefficients to sum to 1, making them translation-invariant—the solution set of Ax=b forms an affine subspace.


Convex Set

Linear segment: \(x = \theta x_1 + (1-\theta)x_2, \quad 0 \leq \theta \leq 1\)

Convex set: A set \(C\) is convex if for all \(x_1, x_2 \in C\) and \(0 \leq \theta \leq 1\):

\[\theta x_1 + (1-\theta)x_2 \in C\]

Convex set visualization Figure: A convex set contains all line segments between any two points in the set. The left shape is convex (any line segment between two points stays inside), while the right shape is non-convex (some line segments exit the set).


Convex Combination and Convex Hull

Convex combination:

\[x = \theta_1 x_1 + \theta_2 x_2 + \ldots + \theta_k x_k\]

where \(\theta_1 + \theta_2 + \ldots + \theta_k = 1\) and \(\theta_i \geq 0\)

Convex hull: The set of all convex combinations of points in \(S\)—the smallest convex set containing \(S\).

Convex hull example Figure: The convex hull of a finite set of points forms a convex polygon (or polyhedron in higher dimensions) that encloses all the points. It’s like stretching a rubber band around the outermost points.


Convex Cone

\[x = \theta_1 x_1 + \theta_2 x_2, \quad \theta_1 \geq 0, \theta_2 \geq 0\]

Convex cone visualization Figure: A convex cone is closed under positive linear combinations. If you take any two vectors in the cone and scale them by non-negative coefficients, their sum remains in the cone. The cone extends infinitely from the origin.


Hyperplanes and Halfspaces

Hyperplane

Set of the form \(a^\top x = b\) where \(a \neq 0\)

  • \(a\) is the normal vector
  • Hyperplanes are both affine and convex

Halfspaces

\[a^\top x \leq b\]

  • Halfspaces are convex

Hyperplane and halfspace Figure: A hyperplane (a^x = b) divides space into two halfspaces: a^x ≤ b and a^x ≥ b. The normal vector a points perpendicular to the hyperplane, and b controls the offset from the origin.


Euclidean Balls and Ellipsoids

Euclidean Ball

\[B(x_c, r) = \{x \mid \|x - x_c\|_2 \leq r\} = \{x_c + ru \mid \|u\|_2 \leq 1\}\]

Ellipsoid

\[\{x \mid (x - x_c)^\top P^{-1}(x - x_c) \leq 1\}\]

where \(P \in \mathbb{S}_{++}^n\) (P is positive definite symmetric)

Alternative representation: \(x_c + Au\) where \(A\) is an invertible square matrix

Balls and ellipsoids Figure: Euclidean balls have uniform radius in all directions, while ellipsoids stretch the ball along different axes. The matrix P in the ellipsoid formula determines the shape and orientation of the stretching.


Norm Balls and Norm Cones

Norm Ball

A norm function \(\|\cdot\|\) satisfies:

  1. \(\|x\| \geq 0\), and \(\|x\| = 0\) only if \(x = 0\)
  2. \(\|tx\| = |t| \cdot \|x\|\) for \(t \in \mathbb{R}\)
  3. \(\|x + y\| \leq \|x\| + \|y\|\) (triangle inequality)

Norm ball: \(\{x \mid \|x - x_c\| \leq r\}\)

Norm ball examples Figure: Different norms create different ball shapes. L1 norm creates a diamond, L2 norm creates a circle, and L∞ norm creates a square. All satisfy the norm axioms but measure distance differently.


Norm Cone

\[\{(x, t) \mid \|x\| \leq t\}\]

The Euclidean norm cone is called the second-order cone.

Norm cone visualization Figure: A norm cone extends infinitely upward, containing all points (x,t) where the norm of x is bounded by t. The second-order cone (using L2 norm) is fundamental in conic optimization and has special properties for efficient optimization.


Polyhedra

Defined by inequality and equality constraints:

\[Ax \preceq b, \quad Cx = d\]

where \(A \in \mathbb{R}^{m \times n}\) and \(C \in \mathbb{R}^{p \times n}\)

Here \(Ax \preceq b\) denotes componentwise inequality, i.e., \((Ax)_i \leq b_i\) for all \(i\).

Polyhedra example Figure: A polyhedron is the intersection of finitely many halfspaces and hyperplanes. Linear programming optimizes over polyhedra, making them fundamental to optimization theory. Every vertex represents a basic feasible solution.


Positive Semidefinite Cone

Let \(\mathbb{S}^n\) denote the set of \(n \times n\) symmetric matrices.

  • \(\mathbb{S}_+^n = \{X \in \mathbb{S}^n \mid X \succeq 0\}\)positive semidefinite
    • \(\mathbb{S}_+^n\) is a convex cone
  • \(\mathbb{S}_{++}^n = \{X \in \mathbb{S}^n \mid X \succ 0\}\)positive definite

Positive semidefinite cone Figure: The positive semidefinite cone contains all symmetric matrices with non-negative eigenvalues. It’s a convex cone in the space of symmetric matrices, fundamental to semidefinite programming (SDP).


Preserving Convexity

Testing Convexity

To determine if a set \(C\) is convex, check whether every convex combination of any two points in \(C\) is also in \(C\):

\[x_1, x_2 \in C, \quad 0 \leq \theta \leq 1 \implies \theta x_1 + (1-\theta)x_2 \in C\]

Operations That Preserve Convexity

1. Intersection: The intersection of convex sets is convex.

Intersection preserves convexity Figure: Intersecting convex sets always produces a convex set. This property is crucial because it allows us to build complex convex sets by intersecting simpler ones, like defining polyhedra as intersections of halfspaces.

2. Affine functions: \(f(x) = Ax + b\)

If \(C\) is convex, then both \(f(C)\) and \(f^{-1}(C)\) are convex.

Affine function preservation Figure: Affine transformations (scaling, rotation, translation) preserve convexity. If you take a convex set and apply an affine function, the image is still convex. Similarly, the pre-image of a convex set under an affine function is convex.

3. Perspective functions: \(P: \mathbb{R}^{n+1} \to \mathbb{R}^n\)

\[P(x, t) = x/t, \quad \text{dom}\,P = \{(x, t) \mid t > 0\}\]

4. Linear-fractional functions:

\[f(x) = \frac{Ax + b}{c^\top x + d}, \quad \text{dom}\,f = \{x \mid c^\top x + d > 0\}\]

Example: \(f(x) = \frac{1}{x_1 + x_2 + 1}x\)


Generalized Inequalities

A convex cone \(K \subseteq \mathbb{R}^n\) is a proper cone if:

  • \(K\) is closed (contains its boundary)
  • \(K\) is solid (has nonempty interior)
  • \(K\) is pointed (contains no line)

Examples of Proper Cones

  1. Nonnegative orthant: \(K = \mathbb{R}_+^n = \{x \in \mathbb{R}^n \mid x_i \geq 0, \, i=1,\ldots,n\}\)

  2. Positive semidefinite cone: \(\mathbb{S}_+^n\)

  3. Nonnegative polynomials on [0,1]: \[K = \{x \in \mathbb{R}^n \mid x_1 + x_2t + x_3t^2 + \ldots + x_nt^{n-1} \geq 0 \text{ for } t \in [0,1]\}\]

Generalized Inequality Notation

\[x \preceq_K y \Longleftrightarrow y - x \in K\] \[x \prec_K y \Longleftrightarrow y - x \in \text{int}\,K\]

Examples

  • Componentwise inequality (\(K = \mathbb{R}_+^n\)): \[x \preceq_{\mathbb{R}_+^n} y \Longleftrightarrow x_i \leq y_i \text{ for } i=1,2,\ldots,n\]

  • Matrix inequality (\(K = \mathbb{S}_+^n\)): \[X \preceq_{\mathbb{S}_+^n} Y \Longleftrightarrow Y - X \text{ is positive semidefinite}\]

Properties Similar to \(\leq\) in \(\mathbb{R}\)

\[x \preceq_K y, \, u \preceq_K v \implies x + u \preceq_K y + v\]


Minimum and Minimal Elements

Important: \(\preceq_K\) is not a total ordering—we can have \(x \not\preceq_K y\) and \(y \not\preceq_K x\) simultaneously.

  • Minimum: \(x\) is the minimum element of \(S\) with respect to \(\preceq_K\) if: \[y \in S \implies x \preceq_K y\]

  • Minimal: \(x\) is a minimal element of \(S\) with respect to \(\preceq_K\) if: \[y \in S, \, y \preceq_K x \implies y = x\]

Minimum and minimal elements Figure: The minimum element is unique and comparable to all other elements (if it exists). Minimal elements are not dominated by any other element in the set, but there can be multiple minimal elements that are incomparable to each other.


Key Insight

Mathematical foundations of convex optimization rest on convex sets and generalized inequalities. Convex sets (balls, cones, polyhedra, semidefinite cones) form the geometric foundation. Affine functions, intersections, and perspective operations preserve convexity, allowing us to construct complex convex sets from simpler building blocks. Generalized inequalities (\(\preceq_K\)) extend scalar ordering to vectors and matrices, enabling matrix inequalities (semidefinite constraints) and componentwise constraints. While generalized orderings lack total comparability, they retain key properties like additivity, making them powerful tools for formulating optimization problems over cones. These mathematical structures—convex sets, proper cones, and their preservation under operations—form the bedrock of convex optimization theory.