Compare agent bet histories with kelly strategy #419

evangriffiths · 2024-09-19T12:12:30Z

See output in the comments when running the modified examples/monitor/match_bets_with_langfuse_traces.py script.

TL;DR Kelly gives consistently better results! 🚀

coderabbitai · 2024-09-19T12:12:43Z

Walkthrough

The changes significantly enhance the simulation of betting strategies by introducing a new data model for simulated outcomes and expanding the logic for various betting strategies. A new function computes outcomes based on market traces, and the execution flow is modified to ensure all bets have corresponding traces. Additionally, a new class for dynamically adjusting bet amounts based on market conditions is introduced, while a sanity check is removed to simplify trace retrieval. A function to retrieve private keys from Google Cloud Platform's Secret Manager is also added.

Changes

File Path	Change Summary
examples/monitor/match_bets_with_langfuse_traces.py	Introduced a new data model `SimulatedOutcome`, added `get_outcome_for_trace` function for computing bet outcomes, and modified execution to handle multiple agents and strategies.
prediction_market_agent_tooling/deploy/betting_strategy.py	Added `MaxAccuracyWithKellyScaledBetsStrategy` class for adjusting bet amounts based on market conditions, including methods for calculating trades and string representation.
prediction_market_agent_tooling/tools/langfuse_client_utils.py	Removed a sanity check from the `get_trace_for_bet` function, allowing traces that occur before a bet's creation time to be returned.
prediction_market_agent_tooling/tools/utils.py	Added a new function `get_private_key_from_gcp_secret` to retrieve private keys from Google Cloud Platform's Secret Manager.

Possibly related issues

Refactor DeployableTraderAgent to 1. predict a probability for a market then 2. place a bet based on its BettingStrategy #338: The refactor of DeployableTraderAgent to better integrate betting strategies aligns with the changes made to enhance betting strategy simulations.

Possibly related PRs

Add simplified get_kelly_bet function, remove old #330: The introduction of the get_kelly_bet function and enhancements in the Kelly criterion betting strategy are relevant as the main PR also expands on betting strategies, particularly with the addition of multiple strategies including the Kelly criterion.
Updating langfuse integration #345: This PR updates the Langfuse integration, which is relevant because the main PR modifies how bets are matched with Langfuse traces, indicating a connection in their functionalities.
Refactor DeployableTraderAgent to 1. predict a probability for a market then 2. place a bet based on its BettingStrategy #352: The changes to the DeployableTraderAgent class and its betting strategy are related to the main PR's focus on simulating betting strategies and outcomes.
Improve matching of langfuse traces with resolved bets. Add example #418: The introduction of utilities for matching bets with Langfuse traces directly relates to the main PR's functionality of processing and matching bets with traces.
Remove timestamp check when matching bets with traces #421: The removal of the timestamp check when matching bets with traces is directly relevant to the main PR's focus on improving the matching logic between bets and traces.

Suggested reviewers

evangriffiths

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

Share

Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
-- I pushed a fix in commit <commit_id>, please review it.
-- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
-- @coderabbitai generate unit testing code for this file.
-- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
-- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
-- @coderabbitai read src/utils.ts and generate unit testing code.
-- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
-- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

evangriffiths · 2024-09-19T12:13:02Z

Agent Bet vs Theoretical Kelly Bet Comparison

DeployablePredictionProphetGPT4TurboFinalAgent

Number of bets since 2024-09-13 00:00:00: 35
Actual Bet: ROI=-24.97%, amount=36.18, profit=-9.03
Kelly Bet: ROI=-12.87%, amount=30.18, profit=-3.88

DeployablePredictionProphetGPT4TurboPreviewAgent

Number of bets since 2024-09-13 00:00:00: 27
Actual Bet: ROI=-10.90%, amount=27.00, profit=-2.94
Kelly Bet: ROI=14.03%, amount=24.28, profit=3.41

DeployablePredictionProphetGPT4oAgent

Number of bets since 2024-09-13 00:00:00: 34
Actual Bet: ROI=-14.82%, amount=34.00, profit=-5.04
Kelly Bet: ROI=4.54%, amount=25.20, profit=1.14

DeployableOlasEmbeddingOAAgent

Number of bets since 2024-09-13 00:00:00: 34
Actual Bet: ROI=10.94%, amount=34.00, profit=3.72
Kelly Bet: ROI=20.16%, amount=41.91, profit=8.45

DeployableKnownOutcomeAgent

Number of bets since 2024-09-13 00:00:00: 15
Actual Bet: ROI=33.87%, amount=15.00, profit=5.08
Kelly Bet: ROI=38.28%, amount=21.78, profit=8.34

gabrielfior · 2024-09-19T12:30:13Z

examples/monitor/match_bets_with_langfuse_traces.py

+    received_outcome_tokens = market.get_buy_token_amount(
+        bet_amount=market.get_bet_amount(kelly_bet.size),
+        direction=kelly_bet.direction,
+    ).amount


Wondering if we can get the received_outcome_tokens directly from the subgraph instead of calculating?
DId you sample a few datapoints to see if there is a diff?

I haven't but I'm confident that it's correct based on this test:

prediction-market-agent-tooling/tests/markets/omen/test_omen.py

Line 250 in 3794ce2

def test_get_buy_token_amount(direction: bool) -> None:

kongzii · 2024-09-20T08:17:47Z

I'm sorry, but I was still bugged a little. It seems that if kelly's max_bet == max accuracy constant bet, Kelly will yield better results, as in your experiments.

But if we are gonna do this, then we have a problem (experiment from latest commit I pushed, seems like it doesn't show up in closed PR):

# Agent Bet vs Theoretical Kelly Bet Comparison

## DeployablePredictionProphetGPT4TurboFinalAgent

Number of bets since 2024-09-13 00:00:00: 48
Actual Bet: ROI=-15.68%, amount=49.18, profit=-7.71
Kelly Bet: ROI=-19.23%, amount=202.94, profit=-39.02

## DeployablePredictionProphetGPT4TurboPreviewAgent

Number of bets since 2024-09-13 00:00:00: 37
Actual Bet: ROI=-12.37%, amount=37.00, profit=-4.58
Kelly Bet: ROI=-17.00%, amount=186.00, profit=-31.61

## DeployablePredictionProphetGPT4oAgent

Number of bets since 2024-09-13 00:00:00: 46
Actual Bet: ROI=-15.93%, amount=46.00, profit=-7.33
Kelly Bet: ROI=-23.94%, amount=169.28, profit=-40.53

## DeployableOlasEmbeddingOAAgent

Number of bets since 2024-09-13 00:00:00: 46
Actual Bet: ROI=5.04%, amount=46.00, profit=2.32
Kelly Bet: ROI=-18.12%, amount=165.55, profit=-29.99

## DeployableKnownOutcomeAgent

Number of bets since 2024-09-13 00:00:00: 28
Actual Bet: ROI=38.43%, amount=28.00, profit=10.76
Kelly Bet: ROI=16.81%, amount=376.06, profit=63.20, final agent balance: 113.20

coderabbitai

Actionable comments posted: 3

Outside diff range and nitpick comments (1)

examples/monitor/match_bets_with_langfuse_traces.py (1)
124-125: Use logging instead of print statements for better practice

Using the logging module instead of print statements provides better control over log levels and output formatting, which is essential for larger applications.

Import the logging module and replace print statements with appropriate logging calls:
+import logging
...
 if agent_balance <= 0:
-    print(f"Agent died with balance {agent_balance}.")
+    logging.info(f"Agent died with balance {agent_balance}.")
     break

Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between b7168ae and b78489f.

Files selected for processing (2)

examples/monitor/match_bets_with_langfuse_traces.py (1 hunks)
prediction_market_agent_tooling/tools/langfuse_client_utils.py (0 hunks)

Files not reviewed due to no reviewable changes (1)

prediction_market_agent_tooling/tools/langfuse_client_utils.py

coderabbitai · 2024-09-20T08:22:34Z

examples/monitor/match_bets_with_langfuse_traces.py

+        roi = 100 * total_bet_profit / total_bet_amount
+        kelly_roi = 100 * total_kelly_profit / total_kelly_amount


Prevent potential division by zero in ROI calculations

If total_bet_amount or total_kelly_amount is zero, calculating ROI will result in a ZeroDivisionError. To prevent this, add a check before performing the division.

Update the ROI calculations to handle zero amounts safely:

roi = 100 * total_bet_profit / total_bet_amount if total_bet_amount != 0 else 0 kelly_roi = 100 * total_kelly_profit / total_kelly_amount if total_kelly_amount != 0 else 0

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

roi = 100 * total_bet_profit / total_bet_amount

kelly_roi = 100 * total_kelly_profit / total_kelly_amount

roi = 100 * total_bet_profit / total_bet_amount if total_bet_amount != 0 else 0

kelly_roi = 100 * total_kelly_profit / total_kelly_amount if total_kelly_amount != 0 else 0

examples/monitor/match_bets_with_langfuse_traces.py

coderabbitai

Actionable comments posted: 2

Outside diff range and nitpick comments (1)

prediction_market_agent_tooling/tools/utils.py (1)

218-230: Add Docstring to Enhance Readability and Maintainability

Adding a docstring to the get_private_key_from_gcp_secret function will help other developers understand its purpose, parameters, and return value.

Include a descriptive docstring:

def get_private_key_from_gcp_secret(
    secret_id: str,
    project_id: str,
    version_id: str = "latest",
) -> PrivateKey:
    """
    Retrieves a private key from Google Cloud Secret Manager.

    Args:
        secret_id (str): The ID of the secret to retrieve.
        project_id (str): The Google Cloud project ID.
        version_id (str, optional): The version of the secret. Defaults to "latest".

    Returns:
        PrivateKey: The retrieved private key wrapped in a SecretStr.

    Raises:
        ValueError: If the private key is not found or if the secret cannot be accessed.
    """
    # Function implementation...

Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between 1212dca and c6ee7fe.

Files selected for processing (2)

examples/monitor/match_bets_with_langfuse_traces.py (1 hunks)
prediction_market_agent_tooling/tools/utils.py (3 hunks)

Additional comments not posted (12)

examples/monitor/match_bets_with_langfuse_traces.py (7)
4-4: LGTM!

The import statement for BaseModel from pydantic is correct.

9-11: LGTM!

The import statement for get_kelly_bet_full is correct.

18-21: LGTM!

The import statements for check_not_none and get_private_key_from_gcp_secret are correct.

24-28: LGTM!

The KellyBetOutcome data model is defined correctly using BaseModel from pydantic. The fields capture the essential information about a Kelly bet outcome.

31-60: LGTM!

The get_kelly_bet_outcome_for_trace function is implemented correctly. It calculates the Kelly bet size, direction, correctness, and profit based on the input trace and market outcome using the get_kelly_bet_full function and the market information from the trace. The function logic and implementation look good.

64-160: LGTM!

The main execution block of the script is well-structured and follows a logical flow. It retrieves private keys for agents from GCP secrets, iterates over each agent, retrieves their traces and resolved bets, and compares the actual bets with theoretical Kelly bets. The simulation of the betting process using the agent's balance provides a realistic comparison between actual bets and Kelly bets. The ROI calculations for actual bets and Kelly bets provide a meaningful comparison of their performance. The code segment looks good overall.

120-124: Handle missing traces without interrupting execution

Raising a ValueError if some bets do not have a corresponding trace may halt the analysis unnecessarily. Since it's possible for bets to exist without traces, consider handling this scenario gracefully, perhaps by logging a warning and proceeding with the available data.

Modify the code to log a warning instead of raising an exception:
 if len(bets_with_traces) != len(bets):
-    raise ValueError(
-        f"{len(bets) - len(bets_with_traces)} bets do not have a corresponding trace"
-    )
+    print(
+        f"Warning: {len(bets) - len(bets_with_traces)} bets do not have a corresponding trace"
+    )
Likely invalid or redundant comment.
prediction_market_agent_tooling/tools/utils.py (5)
228-229: Avoid Exposing Sensitive Information in Error Messages

The error message includes secret_id, which may be sensitive. To prevent potential information leakage, avoid revealing sensitive details in exception messages.

[security]

Modify the error message to exclude secret_id:
-        raise ValueError(f"Private key not found in gcp secret {secret_id}")
+        raise ValueError("Private key not found in GCP secret")
223-230: Ensure Secure Handling of Private Keys

When dealing with private keys, it's crucial to handle them securely to prevent unauthorized access. Ensure that the key is not logged, printed, or exposed in any way throughout the codebase.

[security]

1-1: Import of json Module is Appropriate

The json module is imported and used correctly to parse the secret payload. No issues found.

10-10: Import of secretmanager Module is Correct

The import statement from google.cloud import secretmanager is necessary for accessing GCP Secret Manager services. No issues found.

17-17: Import of PrivateKey is Necessary

The PrivateKey class is imported from prediction_market_agent_tooling.gtypes and used as the return type for the new function. No issues found.

coderabbitai · 2024-09-20T12:35:41Z

prediction_market_agent_tooling/tools/utils.py

+    client = secretmanager.SecretManagerServiceClient()
+    name = f"projects/{project_id}/secrets/{secret_id}/versions/{version_id}"
+    response = client.access_secret_version(request={"name": name})


Add Exception Handling for Secret Manager Operations

Accessing secrets from GCP Secret Manager can raise exceptions due to network errors, authentication issues, or missing permissions. To improve robustness, add exception handling to gracefully manage potential errors.

Wrap the secret access code in a try-except block:

try: client = secretmanager.SecretManagerServiceClient() name = f"projects/{project_id}/secrets/{secret_id}/versions/{version_id}" response = client.access_secret_version(request={"name": name}) except Exception as e: raise ValueError(f"Failed to access secret: {e}")

coderabbitai · 2024-09-20T12:35:41Z

prediction_market_agent_tooling/tools/utils.py

+
+def get_private_key_from_gcp_secret(
+    secret_id: str,
+    project_id: str = "582587111398",  # Gnosis AI default project_id


Consider Removing Hardcoded Default project_id

Hardcoding the project_id as a default parameter reduces the function's flexibility and may lead to issues in different environments or projects. It's better to require project_id as a mandatory parameter or retrieve it from an environment variable or configuration file.

Apply this diff to make project_id a required parameter:

- project_id: str = "582587111398", # Gnosis AI default project_id + project_id: str,

Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

project_id: str = "582587111398", # Gnosis AI default project_id

project_id: str,

coderabbitai

Actionable comments posted: 5

Outside diff range and nitpick comments (4)

examples/monitor/match_bets_with_langfuse_traces.py (3)
Line range hint 1-34: LGTM! Consider adding docstrings for better documentation.

The new imports and the SimulatedOutcome class are well-structured and appropriate for the added functionality. The use of pydantic for the SimulatedOutcome class ensures type safety.

Consider adding a docstring to the SimulatedOutcome class to describe its purpose and attributes. For example:
class SimulatedOutcome(BaseModel):
    """
    Represents the outcome of a simulated bet.

    Attributes:
        size (float): The size of the bet.
        direction (bool): The direction of the bet (True for Yes, False for No).
        correct (bool): Whether the bet was correct.
        profit (float): The profit or loss from the bet.
    """
    size: float
    direction: bool
    correct: bool
    profit: float
36-80: LGTM! Consider adding type hints for better clarity.

The get_outcome_for_trace function is well-implemented and handles various scenarios correctly. The assertions help ensure the expected behavior, and the profit calculation logic is sound.

Consider adding type hints to the local variables for better clarity. For example:
def get_outcome_for_trace(
    strategy: BettingStrategy,
    trace: ProcessMarketTrace,
    market_outcome: bool,
) -> SimulatedOutcome | None:
    market: OmenAgentMarket = trace.market
    answer: ProbabilisticAnswer = trace.answer

    trades: list[Trade] = strategy.calculate_trades(
        existing_position=None,
        answer=ProbabilisticAnswer(
            p_yes=answer.p_yes,
            confidence=answer.confidence,
        ),
        market=market,
    )
    # ... rest of the function
This change would make the types of market, answer, and trades more explicit, improving code readability.

83-111: LGTM! Consider parameterizing the start date for increased flexibility.

The setup for the main execution block is comprehensive and well-structured. The use of GCP Secret Manager for retrieving private keys is a good security practice. The defined strategies cover a good range of betting approaches.

Consider parameterizing the start date (currently hardcoded as September 13, 2024) to make the script more flexible. You could add a command-line argument or environment variable to set this date. For example:
import argparse
from datetime import datetime

# At the beginning of the main block
parser = argparse.ArgumentParser(description='Analyze betting strategies')
parser.add_argument('--start-date', type=str, default='2024-09-13',
                    help='Start date for analysis (YYYY-MM-DD)')
args = parser.parse_args()

start_time = datetime.strptime(args.start_date, '%Y-%m-%d')
This change would allow users to easily specify different start dates when running the script.
prediction_market_agent_tooling/deploy/betting_strategy.py (1)

213-271: Add docstrings for the new class and its methods

Including docstrings for MaxAccuracyWithKellyScaledBetsStrategy and its methods will improve code readability and help others understand the strategy's purpose and implementation.

Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

Commits

Files that changed from the base of the PR and between c6ee7fe and 82c103b.

Files selected for processing (2)

examples/monitor/match_bets_with_langfuse_traces.py (2 hunks)
prediction_market_agent_tooling/deploy/betting_strategy.py (2 hunks)

Additional comments not posted (2)

prediction_market_agent_tooling/deploy/betting_strategy.py (2)

144-146: Good addition of __repr__ method for better debugging

The __repr__ method provides a useful string representation of the object, enhancing debugging and logging capabilities.

209-211: __repr__ method improves object representation

Adding the __repr__ method makes it easier to understand instances when printed or logged.

coderabbitai · 2024-09-23T13:44:03Z

examples/monitor/match_bets_with_langfuse_traces.py

+    print("# Agent Bet vs Simulated Bet Comparison")
+    for agent_name, private_key in agent_pkey_map.items():
+        print(f"\n## {agent_name}\n")
+        api_keys = APIKeys(BET_FROM_PRIVATE_KEY=private_key)
+
+        # Pick a time after pool token number is stored in OmenAgentMarket
+        start_time = datetime(2024, 9, 13)
+
+        langfuse = Langfuse(
+            secret_key=api_keys.langfuse_secret_key.get_secret_value(),
+            public_key=api_keys.langfuse_public_key,
+            host=api_keys.langfuse_host,
        )
+
+        traces = get_traces_for_agent(
+            agent_name=agent_name,
+            trace_name="process_market",
+            from_timestamp=start_time,
+            has_output=True,
+            client=langfuse,
+        )
+        process_market_traces: list[ProcessMarketTrace] = []
+        for trace in traces:
+            if process_market_trace := ProcessMarketTrace.from_langfuse_trace(trace):
+                process_market_traces.append(process_market_trace)
+
+        bets: list[ResolvedBet] = OmenAgentMarket.get_resolved_bets_made_since(
+            better_address=api_keys.bet_from_address,
+            start_time=start_time,
+            end_time=None,
+        )
+
+        # All bets should have a trace, but not all traces should have a bet
+        # (e.g. if all markets are deemed unpredictable), so iterate over bets
+        bets_with_traces: list[ResolvedBetWithTrace] = []
+        for bet in bets:
+            trace = get_trace_for_bet(bet, process_market_traces)
+            if trace:
+                bets_with_traces.append(ResolvedBetWithTrace(bet=bet, trace=trace))
+
+        print(f"Number of bets since {start_time}: {len(bets_with_traces)}\n")
+        if len(bets_with_traces) != len(bets):
+            raise ValueError(
+                f"{len(bets) - len(bets_with_traces)} bets do not have a corresponding trace"
+            )
+


LGTM! Consider improving error handling for mismatched bets and traces.

The data retrieval and processing logic is well-implemented. Matching bets with corresponding traces and checking for data integrity are crucial steps.

The current error handling for mismatched bets and traces could be improved. Instead of raising a ValueError, consider logging a warning and continuing with the available data. This approach would allow the script to process as much data as possible, even if some bets are missing traces.

Replace the following code:

if len(bets_with_traces) != len(bets): raise ValueError( f"{len(bets) - len(bets_with_traces)} bets do not have a corresponding trace" )

with:

import logging # ... (earlier in the file, set up logging) if len(bets_with_traces) != len(bets): missing_traces = len(bets) - len(bets_with_traces) logging.warning(f"{missing_traces} bets do not have a corresponding trace. Continuing with available data.") print(f"Warning: {missing_traces} bets do not have a corresponding trace. Continuing with available data.")

This change will allow the script to continue processing even if some bets are missing traces, providing more robust behavior.

coderabbitai · 2024-09-23T13:44:03Z

examples/monitor/match_bets_with_langfuse_traces.py

+        simulations: list[dict[str, Any]] = []
+
+        for strategy_idx, strategy in enumerate(strategies):
+            # "Born" agent with initial funding, simulate as if he was doing bets one by one.
+            starting_balance = 50.0
+            agent_balance = starting_balance
+            simulated_outcomes: list[SimulatedOutcome] = []
+
+            for bet_with_trace in bets_with_traces:
+                bet = bet_with_trace.bet
+                trace = bet_with_trace.trace
+                simulated_outcome = get_outcome_for_trace(
+                    strategy=strategy, trace=trace, market_outcome=bet.market_outcome
+                )
+                if simulated_outcome is None:
+                    continue
+                simulated_outcomes.append(simulated_outcome)
+                agent_balance += simulated_outcome.profit
+
+            total_bet_amount = sum([bt.bet.amount.amount for bt in bets_with_traces])
+            total_bet_profit = sum([bt.bet.profit.amount for bt in bets_with_traces])
+            total_simulated_amount = sum([so.size for so in simulated_outcomes])
+            total_simulated_profit = sum([so.profit for so in simulated_outcomes])
+            roi = 100 * total_bet_profit / total_bet_amount
+            simulated_roi = 100 * total_simulated_profit / total_simulated_amount
+
+            # At the beginning, add also the agent's current strategy.
+            if strategy_idx == 0:
+                simulations.append(
+                    {
+                        "strategy": "original",
+                        "bet_amount": total_bet_amount,
+                        "bet_profit": total_bet_profit,
+                        "roi": roi,
+                        # We don't know these for the original run.
+                        "start_balance": None,
+                        "end_balance": None,
+                    }
+                )
+
+            simulations.append(
+                {
+                    "strategy": repr(strategy),
+                    "bet_amount": total_simulated_amount,
+                    "bet_profit": total_simulated_profit,
+                    "roi": simulated_roi,
+                    "start_balance": starting_balance,
+                    "end_balance": agent_balance,
+                }
+            )
+
+        print(pd.DataFrame.from_records(simulations).to_markdown(index=False))


LGTM! Consider adding summary statistics for easier comparison.

The simulation logic and results presentation are well-implemented. The comparison of ROI between actual bets and various simulated strategies provides valuable insights.

To enhance the analysis of results, consider adding summary statistics to compare the performance of different strategies more easily. You could add the following code after the DataFrame is created:

df = pd.DataFrame.from_records(simulations) print(df.to_markdown(index=False)) # Add summary statistics print("\nSummary Statistics:") summary = df.groupby('strategy').agg({ 'roi': ['mean', 'std', 'min', 'max'], 'bet_amount': 'sum', 'bet_profit': 'sum' }).reset_index() summary.columns = ['strategy', 'mean_roi', 'std_roi', 'min_roi', 'max_roi', 'total_bet_amount', 'total_profit'] print(summary.to_markdown(index=False))

This addition will provide a concise overview of each strategy's performance, making it easier to compare and identify the most effective approaches.

coderabbitai · 2024-09-23T13:44:03Z

prediction_market_agent_tooling/deploy/betting_strategy.py

+    def adjust_bet_amount(
+        self, existing_position: Position | None, market: AgentMarket
+    ) -> float:
+        existing_position_total_amount = (
+            existing_position.total_amount.amount if existing_position else 0
+        )
+        return self.max_bet_amount + existing_position_total_amount
+


Ensure max_bet_amount enforces the intended maximum

In the adjust_bet_amount method, adding existing_position_total_amount to self.max_bet_amount might result in a total bet exceeding max_bet_amount when there's an existing position. If max_bet_amount is intended to cap the total bet amount, consider revising the calculation to enforce this limit.

coderabbitai · 2024-09-23T13:44:04Z

prediction_market_agent_tooling/deploy/betting_strategy.py

+        estimated_p_yes = float(answer.p_yes > 0.5)
+        confidence = 1.0
+
+        kelly_bet = (


Consider using actual estimated probabilities and confidence

Currently, estimated_p_yes is set to 1.0 or 0.0 based on whether answer.p_yes > 0.5, and confidence is hardcoded to 1.0. This may not fully utilize the model's probability estimates and confidence levels. To leverage the Kelly criterion effectively, consider using the actual answer.p_yes and answer.confidence values.

coderabbitai · 2024-09-23T13:44:04Z

prediction_market_agent_tooling/deploy/betting_strategy.py

+        adjusted_bet_amount = self.adjust_bet_amount(existing_position, market)
+        outcome_token_pool = check_not_none(market.outcome_token_pool)
+
+        # Fixed direction of bet, only use Kelly to adjust the bet size based on market's outcome pool size.
+        estimated_p_yes = float(answer.p_yes > 0.5)
+        confidence = 1.0
+
+        kelly_bet = (
+            get_kelly_bet_full(
+                yes_outcome_pool_size=outcome_token_pool[
+                    market.get_outcome_str_from_bool(True)
+                ],
+                no_outcome_pool_size=outcome_token_pool[
+                    market.get_outcome_str_from_bool(False)
+                ],
+                estimated_p_yes=estimated_p_yes,
+                max_bet=adjusted_bet_amount,
+                confidence=confidence,
+            )
+            if market.has_token_pool()
+            else get_kelly_bet_simplified(
+                adjusted_bet_amount,
+                market.current_p_yes,
+                estimated_p_yes,
+                confidence,
+            )
+        )
+
+        amounts = {
+            market.get_outcome_str_from_bool(kelly_bet.direction): TokenAmount(
+                amount=kelly_bet.size, currency=market.currency
+            ),
+        }
+        target_position = Position(market_id=market.id, amounts=amounts)
+        trades = self._build_rebalance_trades_from_positions(
+            existing_position, target_position, market=market
+        )
+        return trades


Refactor shared logic into a common method

The calculate_trades method contains logic similar to that in KellyBettingStrategy. To promote code reusability and reduce duplication, consider refactoring the common code for calculating the Kelly bet into a shared method or utility function.

Compare agent bet histories with kelly strategy

e66572a

evangriffiths marked this pull request as draft September 19, 2024 12:12

gabrielfior reviewed Sep 19, 2024

View reviewed changes

evangriffiths added 3 commits September 19, 2024 15:19

bugfix

e48aea3

mypy

48e8993

mypy

2325c4b

evangriffiths mentioned this pull request Sep 19, 2024

Simulate payout of prediction-prophet-gpt4 if we were using Kelly #381

Closed

evangriffiths marked this pull request as ready for review September 19, 2024 14:48

evangriffiths closed this Sep 19, 2024

Use dynamic max bet amount

b78489f

kongzii reopened this Sep 20, 2024

print final balance

0ab2ef5

coderabbitai bot reviewed Sep 20, 2024

View reviewed changes

evangriffiths added 4 commits September 20, 2024 10:14

Merge main

aeffaa9

tidy

2cbb236

comment out debug print

1212dca

Add util function to get pkeys from GCP

c6ee7fe

coderabbitai bot reviewed Sep 20, 2024

View reviewed changes

kongzii mentioned this pull request Sep 20, 2024

BettingStrategy.calculate_bet_amount_and_direction should be able to take wallet_balance arg #377

Closed

Simulate all strategies we have with different bet sizes (#424)

82c103b

kongzii approved these changes Sep 23, 2024

View reviewed changes

coderabbitai bot reviewed Sep 23, 2024

View reviewed changes

Merge branch 'main' into evan/simulate-kelly-bets

b34020b

evangriffiths merged commit e63f3e0 into main Sep 23, 2024
8 checks passed

evangriffiths deleted the evan/simulate-kelly-bets branch September 23, 2024 15:46

This was referenced Sep 24, 2024

Save more detailed bet details in strategy simulation script #429

Merged

Allow dynamic bet amounts based on market - remove 'adjust_bet_amount' #430

Merged

This was referenced Sep 24, 2024

Remove dynamic 'strategy getter function' on DeployableTraderAgent, add boolean params to each strategy to dynamically update bet amount #431

Closed

Fix examples/monitor/match_bets_with_langfuse_traces.py #450

Merged

This was referenced Oct 1, 2024

Add o1 agents to bet simulation script #453

Merged

Add max_price_impact parameter to KellyBettingStrategy #433

Merged

coderabbitai bot mentioned this pull request Oct 10, 2024

Auto-verify that buy trades wouldn't cause guaranted loss #488

Merged

coderabbitai bot mentioned this pull request Nov 7, 2024

545 strategy backtests add financial metrics #548

Merged

coderabbitai bot mentioned this pull request Nov 19, 2024

Use Optuna and KFold to find best strategy for agent #561

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compare agent bet histories with kelly strategy #419

Compare agent bet histories with kelly strategy #419

evangriffiths commented Sep 19, 2024 •

edited

Loading

coderabbitai bot commented Sep 19, 2024 •

edited

Loading

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

evangriffiths commented Sep 19, 2024 •

edited

Loading

gabrielfior Sep 19, 2024

evangriffiths Sep 19, 2024

kongzii commented Sep 20, 2024 •

edited by evangriffiths

Loading

coderabbitai bot left a comment

coderabbitai bot Sep 20, 2024

coderabbitai bot left a comment

coderabbitai bot Sep 20, 2024

coderabbitai bot Sep 20, 2024

coderabbitai bot left a comment

coderabbitai bot Sep 23, 2024

coderabbitai bot Sep 23, 2024

coderabbitai bot Sep 23, 2024

coderabbitai bot Sep 23, 2024

coderabbitai bot Sep 23, 2024

		roi = 100 * total_bet_profit / total_bet_amount
		kelly_roi = 100 * total_kelly_profit / total_kelly_amount

	project_id: str = "582587111398", # Gnosis AI default project_id
	project_id: str,

Compare agent bet histories with kelly strategy #419

Compare agent bet histories with kelly strategy #419

Conversation

evangriffiths commented Sep 19, 2024 • edited Loading

coderabbitai bot commented Sep 19, 2024 • edited Loading

Walkthrough

Changes

Possibly related issues

Possibly related PRs

Suggested reviewers

Chat

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

evangriffiths commented Sep 19, 2024 • edited Loading

Agent Bet vs Theoretical Kelly Bet Comparison

DeployablePredictionProphetGPT4TurboFinalAgent

DeployablePredictionProphetGPT4TurboPreviewAgent

DeployablePredictionProphetGPT4oAgent

DeployableOlasEmbeddingOAAgent

DeployableKnownOutcomeAgent

gabrielfior Sep 19, 2024

Choose a reason for hiding this comment

evangriffiths Sep 19, 2024

Choose a reason for hiding this comment

kongzii commented Sep 20, 2024 • edited by evangriffiths Loading

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Sep 20, 2024

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Sep 20, 2024

Choose a reason for hiding this comment

coderabbitai bot Sep 20, 2024

Choose a reason for hiding this comment

coderabbitai bot left a comment

Choose a reason for hiding this comment

coderabbitai bot Sep 23, 2024

Choose a reason for hiding this comment

coderabbitai bot Sep 23, 2024

Choose a reason for hiding this comment

coderabbitai bot Sep 23, 2024

Choose a reason for hiding this comment

coderabbitai bot Sep 23, 2024

Choose a reason for hiding this comment

coderabbitai bot Sep 23, 2024

Choose a reason for hiding this comment

evangriffiths commented Sep 19, 2024 •

edited

Loading

coderabbitai bot commented Sep 19, 2024 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

evangriffiths commented Sep 19, 2024 •

edited

Loading

kongzii commented Sep 20, 2024 •

edited by evangriffiths

Loading