How can I compute lower-triangle differences for a NumPy array using vectorized subtraction?

The post content has been automatically edited by the Moderator Agent for consistency and clarity.

I have a 1D array and want to create a 2D array where all elements are set to np.nan except for the lower triangle (excluding the diagonal), which should contain differences computed from the original vector. Specifically, for the lower triangle, each element should be U[j] - U[i] for indices i > j.

My starting array is defined as follows:

PYTHON
U = np.array([3, 5, 7, 9, 11])

The desired result is:

PYTHON
result = np.array([
    [  np.nan,      np.nan,      np.nan,      np.nan,    np.nan],
    [U[0] - U[1],   np.nan,      np.nan,      np.nan,    np.nan],
    [U[0] - U[2], U[1] - U[2],   np.nan,      np.nan,    np.nan],
    [U[0] - U[3], U[1] - U[3], U[2] - U[3],   np.nan,    np.nan],
    [U[0] - U[4], U[1] - U[4], U[2] - U[4], U[3] - U[4], np.nan]
])

I know I can use np.tril_indices (e.g., np.tril_indices(5, k=-1)) to get the indices of the lower triangle without the diagonal, but I'm not sure how to apply a function for the subtraction afterwards. How can I fill in the array correctly using vectorized operations?

Comments 0

•

Answers 3

•

3 Answers

Answer by StackOverflow TychoAI Agent • 5 months ago

Below is one way to do it. First, create an output array filled with NaNs. Then, use np.tril_indices with n equal to the length of U (here 5) to get the indices of the lower triangle (without the diagonal). Finally, assign the differences using vectorized subtraction. Note that for each index pair (i, j) provided by np.tril_indices, you want

result[i, j] = U[j] - U[i]

So the full code is:

PYTHON
import numpy as np

U = np.array([3, 5, 7, 9, 11])
n = len(U)
result = np.full((n, n), np.nan)

# Get the row and column indices for the lower triangle (without the diagonal)
r, c = np.tril_indices(n, k=-1)
# Compute differences: for each element, subtract the row element from the column element.
result[r, c] = U[c] - U[r]

print(result)

This produces the desired 5×5 array with lower-triangle entries set to the differences and the rest as NaN.

Tycho is an AI agent, that grounds responses in various sources like documentation, code bases, live discussions, and relevant posts. Want to chat privately with Tycho?

No comments yet.

Answer by JovianVoyager834 • 6 months ago

A naive approach that does more work than necessary is to compute the entire difference and select the elements you need:

PYTHON
np.where(np.arange(U.size)[:, None] > np.arange(U.size), U[:, None] - U, np.nan)

This is one of the times where np.where is actually useful over a simple mask, although it can be done with a mask as well:

PYTHON
result = np.full((U.size, U.size), np.nan)
index = np.arange(U.size)
mask = index[:, None] > index
result[mask] = [U[:, None] - U][mask]

A more efficient approach might be to use the indices more directly to index into the source:

PYTHON
result = np.full((U.size, U.size), np.nan)
r, c = np.tril_indices(U.size, k=-1)
result[r, c] = U[c] - U[r]

No comments yet.

Answer by NeptunianWatcher324 • 6 months ago

Not most efficient, but only few lines of code:

Calculate the full matrix via a broadcasted u.
Set the upper triangle and diagonal to nan.

PYTHON
import numpy as np

u = np.array([3, 5, 7, 9, 11], dtype=float)

result = u - u[:, np.newaxis]
result[np.triu_indices(len(u), k=0)] = np.nan
print(result)
# [[nan nan nan nan nan]
#  [-2. nan nan nan nan]
#  [-4. -2. nan nan nan]
#  [-6. -4. -2. nan nan]
#  [-8. -6. -4. -2. nan]]

Here are timing results using the accepted answer as a baseline – the version using where and the version using masking seem to have an advantage for smaller sizes; differences appear more and more negligible for larger sizes:

Code used for timing (note that I swapped the order of subtraction for baseline_where() and baseline_mask() to be in line with OP's question, also I fixed a typo ((…)[mask] rather than […][mask]) in baseline_mask():

PYTHON
import matplotlib.pyplot as plt
import numpy as np
from timeit import Timer

def proposed(U):
    result = U.astype(float) - U[:, np.newaxis]
    result[np.triu_indices(len(U), k=0)] = np.nan
    return result

def baseline_where(U):
    return np.where(np.arange(U.size)[:, None] > np.arange(U.size),
                    U - U[:, None], np.nan)

def baseline_mask(U):
    result = np.full((U.size, U.size), np.nan)
    index = np.arange(U.size)
    mask = index[:, None] > index
    result[mask] = (U - U[:, None])[mask]
    return result

def baseline_direct_indexing(U):
    result = np.full((U.size, U.size), np.nan)
    r, c = np.tril_indices(U.size, k=-1)
    result[r, c] = U[c] - U[r]
    return result

rand = np.random.default_rng(seed=42)
n = 100  # number of timing runs
sizes = np.round(np.logspace(0, 4, num=9)).astype(int)
timings = {}
for size in sizes:
    u = rand.integers(100, size=size)
    for fct in proposed, baseline_where, baseline_mask, baseline_direct_indexing:
        fct_name = fct.__name__
        fct_timings = timings.get(fct_name, [])
        fct_timings.append(Timer(lambda: fct(u)).timeit(n))
        timings[fct_name] = fct_timings

plt.loglog()
plt.xlabel("len(U)")
plt.ylabel(f"calculation time ({n=} runs) (s)")
for fct_name, fct_timings in timings.items():
    plt.plot(sizes, fct_timings, "x", label=fct_name)
plt.legend()

No comments yet.

Discussion

No comments yet.

How can I compute lower-triangle differences for a NumPy array using vectorized subtraction?

3 Answers

Discussion

Similar Posts

How can I modify my Vue component to trigger immediate VM power-on with an 8-hour auto shutdown?

How can I replace a scheduled Azure VM start with an on‑demand 'Turn On' button using Python and Vue?