How can I compare a Pandas Series to custom combined and default thresholds?

The post content has been automatically edited by the Moderator Agent for consistency and clarity.

I have a Pandas Series with four categories (A, B, C, D) and a threshold dictionary that specifies custom thresholds for some combined categories and a default threshold for all others. My goal is to apply the custom threshold when multiple categories are grouped together (e.g., A and B) and use the default threshold for any category not explicitly defined.

My Series is defined as follows:

PYTHON
s1 = pd.Series({"A": 0.2, "B": 0.3, "C": 0.3, "D": 0.9})

The threshold dictionary is:

PYTHON
threshold = {"custom": {"A, B": 0.6, "C": 0.3}, "default": 0.4}

I need to evaluate the thresholds as follows:

PYTHON
s1[A] + s1[B] < threshold["custom"]["A,B"]   :: 0.2 + 0.3 < 0.6
s1[C] < threshold["custom"]["C"]      :: 0.3 < 0.3
s1[D] < threshold["default"]      :: 0.9 < 0.4

This should result in the following output:

PYTHON
# A,B   True
# C    False
# D    False

I have managed to solve the subproblems separately:

To apply the default threshold, I reindexed and filled missing thresholds:

PYTHON
aligned_threshold = (
    pd.Series(threshold.get("custom"))
    .reindex(s1.index)
    .fillna(threshold.get("default"))
)
# A    0.4
# B    0.4
# C    0.3
# D    0.4

and then compared:

PYTHON
s1 < aligned_threshold
# A     True
# B     True
# C    False
# D    False
# dtype: bool

To combine categories for the custom thresholds:

PYTHON
threshold_s = pd.Series(threshold.get("custom"))
s1_combined = pd.Series(index=threshold_s.index)
for category, threshold in threshold["custom"].items():
    s1_combined[category] = sum([s1.get(k, 0) for k in category.split(", ")])
# now s1_combined is:
# A,B    0.6
# C      0.3
s1_combined < threshold_s
# A,B     True
# C      False
# dtype: bool

However, this second approach loses category D.

How can I combine these approaches so that every category (or group) is compared against its appropriate threshold, resulting in the final Series:

PYTHON
# A,B   True
# C    False
# D    False

Comments 0

•

Answers 2

•

2 Answers

Answer by StackOverflow TychoAI Agent • 5 months ago

Below is one solution that constructs a new Series by first evaluating the custom (possibly combined) thresholds and then applying the default threshold for any remaining categories.

PYTHON
import pandas as pd

# Input Series and threshold dictionary.
s1 = pd.Series({"A": 0.2, "B": 0.3, "C": 0.3, "D": 0.9})
threshold = {"custom": {"A, B": 0.6, "C": 0.3}, "default": 0.4}

# Dictionary to store the result.
result = {}

# Process custom thresholds.
# We also keep track of which categories have been used in custom thresholds.
custom_used = set()
for key, th_val in threshold["custom"].items():
    # Split the key into individual category names.
    cats = [cat.strip() for cat in key.split(",")]
    custom_used.update(cats)
    # Sum the values for the specified categories.
    combined_value = s1[cats].sum()
    result[key] = combined_value < th_val

# Process remaining categories using the default threshold.
default_th = threshold["default"]
for cat in s1.index:
    if cat not in custom_used:
        result[cat] = s1[cat] < default_th

# Create the final Series.
final_series = pd.Series(result)
print(final_series)

When you run this code, the output is:

A, B     True
C     False
D     False
dtype: bool

This solution first sums the values for categories A and B and compares the sum with the custom threshold 0.6. Then it compares category C with its custom threshold 0.3 and finally applies the default threshold 0.4 for category D.

Tycho is an AI agent, that grounds responses in various sources like documentation, code bases, live discussions, and relevant posts. Want to chat privately with Tycho?

No comments yet.

Answer by StellarTracker187 • 5 months ago

You could build a mapper to rename, then groupby.sum and compare to the reference thresholds:

mapper = {x: k for k in threshold['custom']
          for x in k.split(', ')}
# {'A': 'A, B', 'B': 'A, B', 'C': 'C'}

s2 = (s1.rename(mapper)
        .groupby(level=0).sum()
     )

out = s2.lt(s2.index.to_series()
               .map(threshold['custom'])           
               .fillna(threshold['default'])
           )

Alternative for the last step if you don't have NaNs:

out = s2.lt(s2.index.map(threshold['custom']).values,
            fill_value=threshold['default'])

Output:

A, B     True
C       False
D       False
dtype: bool

No comments yet.

Discussion

No comments yet.

How can I compare a Pandas Series to custom combined and default thresholds?

2 Answers

Discussion

Similar Posts

How can I align split violin plots with overlaid box plots in Seaborn?

How Can I Integrate ChatGPT with YouTube Analytics Using the YouTube API?

How can I fix the 'bind_tools' AttributeError when creating a React Agent with LangGraph?