move fragment types to constant #263

GeorgWa · 2024-12-29T15:30:55Z

So far the definition of what is a valid fragment type was somewhat loose.
This PR:

defines all valid fragment types as FRAGMENT_TYPES which allows checking if a type is supported
Drops partial support for non charge fragment types
Issues depracation warnings for old datastructures
Have defined order for charged frag types so mismatch of prediction models, libraries is mitigated

GeorgWa · 2024-12-29T15:41:16Z

This currently fails because the pinned version rdkit==2024.3.3 is being ignored

review-notebook-app · 2024-12-29T17:18:20Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

jalew188

LGTM

jalew188 · 2024-12-30T02:20:33Z

alphabase/peptide/fragment.py

+    "y_modloss": FragmentType(
+        name="y_modloss",
+        ref_ion="y",
+        formula="N(-1)H(-2)",


loss formula?

Please check others

mschwoer · 2025-01-10T09:33:21Z

alphabase/peptide/fragment.py

-    "c_lossH": "b+N(1)H(2)",
-    "z_addH": "y+N(-1)H(-1)",
+
+class DIRECTION:


Direction (also LOSS, LOSS_INVERSE.. )

mschwoer · 2025-01-10T09:35:24Z

alphabase/peptide/fragment.py

@@ -66,11 +302,20 @@ def parse_all_frag_type_representation():
 parse_all_frag_type_representation()


+def sort_charged_frag_types(charged_frag_types: List[str]) -> List[str]:
+    """charged frag types are sorted by (no-loss, loss) and then alphabetically"""
+    has_loss = [f.count("_") > 1 for f in charged_frag_types]


has_loss = [f.replace(FRAGMENT_CHARGE_SEPARATOR, "").count("_") > 0 for f in charged_frag_types]

mschwoer · 2025-01-10T09:36:01Z

alphabase/peptide/fragment.py

@@ -93,9 +338,12 @@ def get_charged_frag_types(
    """
    charged_frag_types = []
    for _type in frag_types:


_type->frag_type, _ch -> charge?

mschwoer · 2025-01-10T09:38:30Z

alphabase/peptide/fragment.py

+
+    if FRAGMENT_CHARGE_SEPARATOR in charged_frag_type:
+        _type, _ch = charged_frag_type.split(FRAGMENT_CHARGE_SEPARATOR)
+        return _type, int(_ch)


here you could raise if _ch is not int

mschwoer · 2025-01-10T09:38:44Z

alphabase/peptide/fragment.py

-    return _type, int(_ch)
+
+    if FRAGMENT_CHARGE_SEPARATOR in charged_frag_type:
+        _type, _ch = charged_frag_type.split(FRAGMENT_CHARGE_SEPARATOR)


_type->frag_type, _ch -> charge?

mschwoer · 2025-01-10T10:02:36Z

alphabase/peptide/fragment.py

-        else:
-            frag_directions.append(0)
+    for charged_frag_type in fragment_mz_df.columns.values:
+        frag_type, charge = parse_charged_frag_type(charged_frag_type)


you knew it all along!

mschwoer · 2025-01-10T10:05:15Z

alphabase/peptide/fragment.py

+    for charged_frag_type in fragment_mz_df.columns.values:
+        frag_type, charge = parse_charged_frag_type(charged_frag_type)
+        frag_charges.append(charge)
+        frag_types.append(FRAGMENT_TYPES[frag_type].series)


the new implementation really offers enhanced maintainability and readability!

mschwoer · 2025-01-10T10:21:18Z

alphabase/utils.py


 import pandas as pd
 import tqdm


+# Create a warning class for deprecation


If I could improve only one thing in code generated by LLMs, it would be "drop this comment describing what the next line of code does".

mschwoer · 2025-01-10T10:22:05Z

alphabase/peptide/fragment.py

+
+# because we dont know the loss, we assume every loss type is phospho
+class LOSS:
+    MODLOSS = 98


maybe give some context here what these numbers mean?

mschwoer · 2025-01-10T10:24:39Z

alphabase/peptide/fragment.py

+class LOSS:
+    MODLOSS = 98
+    H2O = 18
+    NH3 = 17
+    LOSSH = 1
+    ADDH = 2
+    NONE = 0
+
+
+LOSS_INVERSE = {
+    18: "H2O",
+    17: "NH3",
+    98: "modloss",
+    1: "lossH",
+    0: "",
+    2: "addH",
 }


Consider using this idiom:

class Losses: """String contants defining losses."" H2O = "H2O" ... LOSS_MAPPING = { """Mapping of loss names to ... ."" Losses.H2O: 18, ... } LOSS_MAPPING_INV = { v : k for k, v in LOSS_MAPPING.items() }

(same for SERIES)

move fragment types to config

41b3b71

GeorgWa requested review from jalew188 and mschwoer December 29, 2024 15:30

add TODOs

528071c

update tests

7dcfb47

jalew188 approved these changes Dec 30, 2024

View reviewed changes

jalew188 reviewed Dec 30, 2024

View reviewed changes

update loss

0649c45

mschwoer reviewed Jan 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

move fragment types to constant #263

move fragment types to constant #263

GeorgWa commented Dec 29, 2024 •

edited

Loading

GeorgWa commented Dec 29, 2024

review-notebook-app bot commented Dec 29, 2024

jalew188 left a comment

jalew188 Dec 30, 2024

jalew188 Dec 30, 2024

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

move fragment types to constant #263

Are you sure you want to change the base?

move fragment types to constant #263

Conversation

GeorgWa commented Dec 29, 2024 • edited Loading

GeorgWa commented Dec 29, 2024

review-notebook-app bot commented Dec 29, 2024

jalew188 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

GeorgWa commented Dec 29, 2024 •

edited

Loading