Merge tag 'riscv-for-linus-7.0-mw1' of git://git.kernel.org/pub/scm/linux/kernel/git/riscv/linux (cee73b1e) · Commits · git / linux-nf

Documentation/admin-guide/kernel-parameters.txt

+8 −0

Original line number	Diff line number	Diff line
		@@ -6641,6 +6641,14 @@ Kernel parameters
		replacement properties are not found. See the Kconfig
		entry for RISCV_ISA_FALLBACK.

		riscv_nousercfi=
		all Disable user CFI ABI to userspace even if cpu extension
		are available.
		bcfi Disable user backward CFI ABI to userspace even if
		the shadow stack extension is available.
		fcfi Disable user forward CFI ABI to userspace even if the
		landing pad extension is available.

		ro [KNL] Mount root device read-only on boot

		rodata= [KNL,EARLY]

Documentation/arch/riscv/hwprobe.rst

+5 −1

Original line number	Diff line number	Diff line
		@@ -67,7 +67,7 @@ The following keys are defined:
		programs (it may still be executed in userspace via a
		kernel-controlled mechanism such as the vDSO).

		* :c:macro:`RISCV_HWPROBE_KEY_IMA_EXT_0`: A bitmask containing the extensions
		* :c:macro:`RISCV_HWPROBE_KEY_IMA_EXT_0`: A bitmask containing extensions
		that are compatible with the :c:macro:`RISCV_HWPROBE_BASE_BEHAVIOR_IMA`:
		base system behavior.

		@@ -387,3 +387,7 @@ The following keys are defined:

		* :c:macro:`RISCV_HWPROBE_KEY_ZICBOP_BLOCK_SIZE`: An unsigned int which
		represents the size of the Zicbop block in bytes.

		* :c:macro:`RISCV_HWPROBE_KEY_IMA_EXT_1`: A bitmask containing additional
		extensions that are compatible with the
		:c:macro:`RISCV_HWPROBE_BASE_BEHAVIOR_IMA`: base system behavior.

Documentation/arch/riscv/index.rst

+2 −0

Original line number	Diff line number	Diff line
		@@ -14,5 +14,7 @@ RISC-V architecture
		uabi
		vector
		cmodx
		zicfilp
		zicfiss

		features

Documentation/arch/riscv/zicfilp.rst

0 → 100644

+122 −0

Original line number	Diff line number	Diff line
		.. SPDX-License-Identifier: GPL-2.0

		:Author: Deepak Gupta <debug@rivosinc.com>
		:Date: 12 January 2024

		====================================================
		Tracking indirect control transfers on RISC-V Linux
		====================================================

		This document briefly describes the interface provided to userspace by Linux
		to enable indirect branch tracking for user mode applications on RISC-V.

		1. Feature Overview
		--------------------

		Memory corruption issues usually result in crashes. However, in the
		hands of a creative adversary, these can result in a variety of
		security issues.

		Some of those security issues can be code re-use attacks, where an
		adversary can use corrupt function pointers, chaining them together to
		perform jump oriented programming (JOP) or call oriented programming
		(COP) and thus compromise control flow integrity (CFI) of the program.

		Function pointers live in read-write memory and thus are susceptible
		to corruption. This can allow an adversary to control the program
		counter (PC) value. On RISC-V, the zicfilp extension enforces a
		restriction on such indirect control transfers:

		- Indirect control transfers must land on a landing pad instruction ``lpad``.
		There are two exceptions to this rule:

		- rs1 = x1 or rs1 = x5, i.e. a return from a function and returns are
		protected using shadow stack (see zicfiss.rst)

		- rs1 = x7. On RISC-V, the compiler usually does the following to reach a
		function which is beyond the offset of possible J-type instruction::

		auipc x7, <imm>
		jalr (x7)

		This form of indirect control transfer is immutable and doesn't
		rely on memory. Thus rs1=x7 is exempted from tracking and
		these are considered software guarded jumps.

		The ``lpad`` instruction is a pseudo-op of ``auipc rd, <imm_20bit>``
		with ``rd=x0``. This is a HINT op. The ``lpad`` instruction must be
		aligned on a 4 byte boundary. It compares the 20 bit immediate with
		x7. If ``imm_20bit`` == 0, the CPU doesn't perform any comparison with
		``x7``. If ``imm_20bit`` != 0, then ``imm_20bit`` must match ``x7``
		else CPU will raise ``software check exception`` (``cause=18``) with
		``*tval = 2``.

		The compiler can generate a hash over function signatures and set them
		up (truncated to 20 bits) in x7 at callsites. Function prologues can
		have ``lpad`` instructions encoded with the same function hash. This
		further reduces the number of valid program counter addresses a call
		site can reach.

		2. ELF and psABI
		-----------------

		The toolchain sets up :c:macro:`GNU_PROPERTY_RISCV_FEATURE_1_FCFI` for
		property :c:macro:`GNU_PROPERTY_RISCV_FEATURE_1_AND` in the notes
		section of the object file.

		3. Linux enabling
		------------------

		User space programs can have multiple shared objects loaded in their
		address spaces. It's a difficult task to make sure all the
		dependencies have been compiled with indirect branch support. Thus
		it's left to the dynamic loader to enable indirect branch tracking for
		the program.

		4. prctl() enabling
		--------------------

		:c:macro:`PR_SET_INDIR_BR_LP_STATUS` / :c:macro:`PR_GET_INDIR_BR_LP_STATUS` /
		:c:macro:`PR_LOCK_INDIR_BR_LP_STATUS` are three prctls added to manage indirect
		branch tracking. These prctls are architecture-agnostic and return -EINVAL if
		the underlying functionality is not supported.

		* prctl(PR_SET_INDIR_BR_LP_STATUS, unsigned long arg)

		If arg1 is :c:macro:`PR_INDIR_BR_LP_ENABLE` and if CPU supports
		``zicfilp`` then the kernel will enable indirect branch tracking for the
		task. The dynamic loader can issue this :c:macro:`prctl` once it has
		determined that all the objects loaded in the address space support
		indirect branch tracking. Additionally, if there is a `dlopen` to an
		object which wasn't compiled with ``zicfilp``, the dynamic loader can
		issue this prctl with arg1 set to 0 (i.e. :c:macro:`PR_INDIR_BR_LP_ENABLE`
		cleared).

		* prctl(PR_GET_INDIR_BR_LP_STATUS, unsigned long * arg)

		Returns the current status of indirect branch tracking. If enabled
		it'll return :c:macro:`PR_INDIR_BR_LP_ENABLE`

		* prctl(PR_LOCK_INDIR_BR_LP_STATUS, unsigned long arg)

		Locks the current status of indirect branch tracking on the task. User
		space may want to run with a strict security posture and wouldn't want
		loading of objects without ``zicfilp`` support in them, to disallow
		disabling of indirect branch tracking. In this case, user space can
		use this prctl to lock the current settings.

		5. violations related to indirect branch tracking
		--------------------------------------------------

		Pertaining to indirect branch tracking, the CPU raises a software
		check exception in the following conditions:

		- missing ``lpad`` after indirect call / jmp
		- ``lpad`` not on 4 byte boundary
		- ``imm_20bit`` embedded in ``lpad`` instruction doesn't match with ``x7``

		In all 3 cases, ``*tval = 2`` is captured and software check exception is
		raised (``cause=18``).

		The kernel will treat this as :c:macro:`SIGSEGV` with code =
		:c:macro:`SEGV_CPERR` and follow the normal course of signal delivery.

Documentation/arch/riscv/zicfiss.rst

0 → 100644

+194 −0

Original line number	Diff line number	Diff line
		.. SPDX-License-Identifier: GPL-2.0

		:Author: Deepak Gupta <debug@rivosinc.com>
		:Date: 12 January 2024

		=========================================================
		Shadow stack to protect function returns on RISC-V Linux
		=========================================================

		This document briefly describes the interface provided to userspace by Linux
		to enable shadow stacks for user mode applications on RISC-V.

		1. Feature Overview
		--------------------

		Memory corruption issues usually result in crashes. However, in the
		hands of a creative adversary, these issues can result in a variety of
		security problems.

		Some of those security issues can be code re-use attacks on programs
		where an adversary can use corrupt return addresses present on the
		stack. chaining them together to perform return oriented programming
		(ROP) and thus compromising the control flow integrity (CFI) of the
		program.

		Return addresses live on the stack in read-write memory. Therefore
		they are susceptible to corruption, which allows an adversary to
		control the program counter. On RISC-V, the ``zicfiss`` extension
		provides an alternate stack (the "shadow stack") on which return
		addresses can be safely placed in the prologue of the function and
		retrieved in the epilogue. The ``zicfiss`` extension makes the
		following changes:

		- PTE encodings for shadow stack virtual memory
		An earlier reserved encoding in first stage translation i.e.
		PTE.R=0, PTE.W=1, PTE.X=0 becomes the PTE encoding for shadow stack pages.

		- The ``sspush x1/x5`` instruction pushes (stores) ``x1/x5`` to shadow stack.

		- The ``sspopchk x1/x5`` instruction pops (loads) from shadow stack and compares
		with ``x1/x5`` and if not equal, the CPU raises a ``software check exception``
		with ``*tval = 3``

		The compiler toolchain ensures that function prologues have ``sspush
		x1/x5`` to save the return address on shadow stack in addition to the
		regular stack. Similarly, function epilogues have ``ld x5,
		offset(x2)`` followed by ``sspopchk x5`` to ensure that a popped value
		from the regular stack matches with the popped value from the shadow
		stack.

		2. Shadow stack protections and linux memory manager
		-----------------------------------------------------

		As mentioned earlier, shadow stacks get new page table encodings that
		have some special properties assigned to them, along with instructions
		that operate on the shadow stacks:

		- Regular stores to shadow stack memory raise store access faults. This
		protects shadow stack memory from stray writes.

		- Regular loads from shadow stack memory are allowed. This allows
		stack trace utilities or backtrace functions to read the true call
		stack and ensure that it has not been tampered with.

		- Only shadow stack instructions can generate shadow stack loads or
		shadow stack stores.

		- Shadow stack loads and stores on read-only memory raise AMO/store
		page faults. Thus both ``sspush x1/x5`` and ``sspopchk x1/x5`` will
		raise AMO/store page fault. This simplies COW handling in kernel
		during fork(). The kernel can convert shadow stack pages into
		read-only memory (as it does for regular read-write memory). As
		soon as subsequent ``sspush`` or ``sspopchk`` instructions in
		userspace are encountered, the kernel can perform COW.

		- Shadow stack loads and stores on read-write or read-write-execute
		memory raise an access fault. This is a fatal condition because
		shadow stack loads and stores should never be operating on
		read-write or read-write-execute memory.

		3. ELF and psABI
		-----------------

		The toolchain sets up :c:macro:`GNU_PROPERTY_RISCV_FEATURE_1_BCFI` for
		property :c:macro:`GNU_PROPERTY_RISCV_FEATURE_1_AND` in the notes
		section of the object file.

		4. Linux enabling
		------------------

		User space programs can have multiple shared objects loaded in their
		address space. It's a difficult task to make sure all the
		dependencies have been compiled with shadow stack support. Thus
		it's left to the dynamic loader to enable shadow stacks for the
		program.

		5. prctl() enabling
		--------------------

		:c:macro:`PR_SET_SHADOW_STACK_STATUS` / :c:macro:`PR_GET_SHADOW_STACK_STATUS` /
		:c:macro:`PR_LOCK_SHADOW_STACK_STATUS` are three prctls added to manage shadow
		stack enabling for tasks. These prctls are architecture-agnostic and return
		-EINVAL if not implemented.

		* prctl(PR_SET_SHADOW_STACK_STATUS, unsigned long arg)

		If arg = :c:macro:`PR_SHADOW_STACK_ENABLE` and if CPU supports
		``zicfiss`` then the kernel will enable shadow stacks for the task.
		The dynamic loader can issue this :c:macro:`prctl` once it has
		determined that all the objects loaded in address space have support
		for shadow stacks. Additionally, if there is a :c:macro:`dlopen` to
		an object which wasn't compiled with ``zicfiss``, the dynamic loader
		can issue this prctl with arg set to 0 (i.e.
		:c:macro:`PR_SHADOW_STACK_ENABLE` being clear)

		* prctl(PR_GET_SHADOW_STACK_STATUS, unsigned long * arg)

		Returns the current status of indirect branch tracking. If enabled
		it'll return :c:macro:`PR_SHADOW_STACK_ENABLE`.

		* prctl(PR_LOCK_SHADOW_STACK_STATUS, unsigned long arg)

		Locks the current status of shadow stack enabling on the
		task. Userspace may want to run with a strict security posture and
		wouldn't want loading of objects without ``zicfiss`` support. In this
		case userspace can use this prctl to disallow disabling of shadow
		stacks on the current task.

		5. violations related to returns with shadow stack enabled
		-----------------------------------------------------------

		Pertaining to shadow stacks, the CPU raises a ``software check
		exception`` upon executing ``sspopchk x1/x5`` if ``x1/x5`` doesn't
		match the top of shadow stack. If a mismatch happens, then the CPU
		sets ``*tval = 3`` and raises the exception.

		The Linux kernel will treat this as a :c:macro:`SIGSEGV` with code =
		:c:macro:`SEGV_CPERR` and follow the normal course of signal delivery.

		6. Shadow stack tokens
		-----------------------

		Regular stores on shadow stacks are not allowed and thus can't be
		tampered with via arbitrary stray writes. However, one method of
		pivoting / switching to a shadow stack is simply writing to the CSR
		``CSR_SSP``. This will change the active shadow stack for the
		program. Writes to ``CSR_SSP`` in the program should be mostly
		limited to context switches, stack unwinds, or longjmp or similar
		mechanisms (like context switching of Green Threads) in languages like
		Go and Rust. CSR_SSP writes can be problematic because an attacker can
		use memory corruption bugs and leverage context switching routines to
		pivot to any shadow stack. Shadow stack tokens can help mitigate this
		problem by making sure that:

		- When software is switching away from a shadow stack, the shadow
		stack pointer should be saved on the shadow stack itself (this is
		called the ``shadow stack token``).

		- When software is switching to a shadow stack, it should read the
		``shadow stack token`` from the shadow stack pointer and verify that
		the ``shadow stack token`` itself is a pointer to the shadow stack
		itself.

		- Once the token verification is done, software can perform the write
		to ``CSR_SSP`` to switch shadow stacks.

		Here "software" could refer to the user mode task runtime itself,
		managing various contexts as part of a single thread. Or "software"
		could refer to the kernel, when the kernel has to deliver a signal to
		a user task and must save the shadow stack pointer. The kernel can
		perform similar procedure itself by saving a token on the user mode
		task's shadow stack. This way, whenever :c:macro:`sigreturn` happens,
		the kernel can read and verify the token and then switch to the shadow
		stack. Using this mechanism, the kernel helps the user task so that
		any corruption issue in the user task is not exploited by adversaries
		arbitrarily using :c:macro:`sigreturn`. Adversaries will have to make
		sure that there is a valid ``shadow stack token`` in addition to
		invoking :c:macro:`sigreturn`.

		7. Signal shadow stack
		-----------------------
		The following structure has been added to sigcontext for RISC-V::

		struct __sc_riscv_cfi_state {
		unsigned long ss_ptr;
		};

		As part of signal delivery, the shadow stack token is saved on the
		current shadow stack itself. The updated pointer is saved away in the
		:c:macro:`ss_ptr` field in :c:macro:`__sc_riscv_cfi_state` under
		:c:macro:`sigcontext`. The existing shadow stack allocation is used
		for signal delivery. During :c:macro:`sigreturn`, kernel will obtain
		:c:macro:`ss_ptr` from :c:macro:`sigcontext`, verify the saved
		token on the shadow stack, and switch the shadow stack.