Player Search Feature Implementation Instructions

Context

You are an expert at UI/UX design and software front-end development and architecture. You are allowed to not know an answer, be uncertain, or disagree with your task. If any of these occur, halt your current process and notify the user immediately. You should not hallucinate. If you are unable to remember information, you are allowed to look it up again.

You are not allowed to hallucinate. You may only use data that exists in the files specified. You are not allowed to create new data if it does not exist in those files.

You MUST plan extensively before each function call, and reflect extensively on the outcomes of the previous function calls. DO NOT do this entire process by making function calls only, as this can impair your ability to solve the problem and think insightfully.

When writing code, your focus should be on creating new functionality that builds on the existing code base without breaking things that are already working. If you need to rewrite how existing code works in order to develop a new feature, please check your work carefully, and also pause your work and tell me (the human) for review before going ahead. We want to avoid software regression as much as possible.

I WILL REPEAT, WHEN UPDATING EXISTING CODE FILES, PLEASE DO NOT OVERWRITE EXISTING CODE, PLEASE ADD OR MODIFY COMPONENTS TO ALIGN WITH THE NEW FUNCTIONALITY. THIS INCLUDES SMALL DETAILS LIKE FUNCTION ARGUMENTS AND LIBRARY IMPORTS. REGRESSIONS IN THESE AREAS HAVE CAUSED UNNECESSARY DELAYS AND WE WANT TO AVOID THEM GOING FORWARD.

When you need to modify existing code (in accordance with the instruction above), please present your recommendation to the user before taking action, and explain your rationale.

If the data files and code you need to use as inputs to complete your task do not conform to the structure you expected based on the instructions, please pause your work and ask the human for review and guidance on how to proceed.

If you have difficulty finding mission critical updates in the codebase (e.g. .env files, data files) ask the user for help in finding the path and directory.

Objective

Follow the step-by-step process to build the Player Search feature (Task 1.2.2 from requirements.md). Start with a simple use case of displaying a UI component with the player's headshot, Instagram handle link, and a summary of their roster info. The goal is for the user to ask the app a question about a specific player and receive both a text summary and a visual UI component with information for that player.

Implementation Steps

Review Code Base: Familiarize yourself with the current project structure, particularly the Gradio app (gradio_app.py), existing components (components/), services, and utilities. Pay close attention to how the Game Recap feature was integrated.
Neo4j Update Script Creation:
- Create a new subfolder within ifx-sandbox/data/april_11_multimedia_data_collect/new_final_april 11/ specifically for the player data update script (e.g., neo4j_player_update/).
- Create a Python script (update_player_nodes.py) within this new subfolder.
- Use the existing script ifx-sandbox/data/april_11_multimedia_data_collect/new_final_april 11/neo4j_update/update_game_nodes.py as a reference for connecting to Neo4j and performing updates.
Neo4j Database Update:
- The script should read player data from ifx-sandbox/data/april_11_multimedia_data_collect/new_final_april 11/roster_april_11.csv.
- Update existing Player nodes in the Neo4j database. Do not create new nodes.
- Use the Player_id attribute as the primary key to match records in the CSV file with nodes in the graph database.
- Add the following new attributes to the corresponding Player nodes:
  - headshot_url
  - instagram_url
  - highlight_video_url (Note: Confirm if this specific column name exists in roster_april_11.csv or if it needs mapping).
- Implement verification steps within the script to confirm successful updates for each player.
- Report the number of updated nodes and any errors encountered.
- Pause and request user confirmation that the update completed successfully in the cloud interface before proceeding.
Player Component Development:
- Create a new component file (e.g., components/player_card_component.py).
- Design the component structure based on the requirements (headshot, name, potentially key stats, Instagram link). Use components/game_recap_component.py as a structural reference for creating a dynamic Gradio component.
- Ensure the component accepts player data (retrieved from Neo4j) as input.
- Implement responsive design and apply the established 49ers theme CSS.
LangChain Integration:
- Review existing LangChain integration in gradio_agent.py and cypher.py (and potentially tools/game_recap.py).
- Create a new file, potentially tools/player_search.py, for the player-specific LangChain logic.
- Define a new LangChain tool specifically for player search with a clear description so the agent recognizes when to use it.
- Implement text-to-Cypher query generation to retrieve player information based on natural language queries (e.g., searching by name, jersey number).
- Ensure the Cypher query retrieves all necessary attributes (name, headshot_url, instagram_url, relevant stats, etc.) using Player_id or Name for matching.
- The tool function should return both a text summary (generated by the LLM based on retrieved data) and the structured data needed for the UI component.
Gradio App Integration:
- Propose changes first: Before modifying gradio_app.py or related files, outline the necessary changes (e.g., adding a new placeholder for the player component, updating the chat processing function to handle player data, modifying event handlers) and request user approval.
- Import the new player search tool into gradio_agent.py and add it to the agent's tool list.
- Import the new player card component into gradio_app.py.
- Modify the main chat/response function in gradio_app.py to:
  - Recognize when the agent returns player data.
  - Extract the text summary and structured data.
  - Update the Gradio UI to display the player card component with the structured data.
  - Display the text summary in the chat interface.
- Ensure the player card component is initially hidden and only displayed when relevant data is available (similar to the game recap component).
- Update the "Clear Chat" functionality to also hide/reset the player card component.
Testing and Validation:
- Test the Neo4j update script thoroughly.
- Verify the LangChain tool correctly identifies player queries and generates appropriate Cypher.
- Test retrieving data for various players.
- Validate that the player card component renders correctly with different player data.
- Test the end-to-end flow in the Gradio app with various natural language queries about players.
- Check error handling for cases like player not found or ambiguous queries.

Data Flow Architecture

User submits a natural language query about a specific player.
LangChain agent processes the query and selects the Player Search tool (likely implemented in tools/player_search.py).
The tool generates a Cypher query to retrieve player data from Neo4j based on the user's query.
Neo4j returns the player data including attributes like name, position, headshot URL, Instagram URL, etc.
The tool receives the data, potentially uses an LLM to generate a text summary, and structures the data for the UI component.
The tool returns the text summary and structured data to the agent/Gradio app.
The Gradio app receives the response.
The player card component function is called with the structured data, generating the visual UI.
The UI component is displayed to the user, and the text summary appears in the chat.

Error Handling Strategy

Implement specific error handling for:
- Player not found in the database.
- Ambiguous player identification (e.g., multiple players with similar names).
- Missing required attributes in Neo4j (e.g., missing headshot_url).
- Database connection issues during query.
- Failures in rendering the UI component.
Provide user-friendly error messages in the chat interface.
Implement graceful degradation (e.g., show text summary even if the visual component fails).
Add logging for debugging player search queries and component rendering.

Performance Optimization

Optimize Neo4j Cypher queries for player search.
Consider caching frequently accessed player data if performance becomes an issue.
Ensure efficient loading of player headshot images in the UI component.

Failure Condition

If you are unable to complete any step after 3 attempts, immediately halt the process, document the failure point and reason, and consult with the user on how to continue. Do not proceed without resolution.

Success Criteria

Neo4j database successfully updated with new player attributes (headshot_url, instagram_url, etc.).
LangChain correctly identifies player search queries and retrieves accurate data.
The Player Card component renders correctly in the Gradio UI, displaying headshot, relevant info, and links.
User can query specific players using natural language and receive both text and visual responses.
Integration does not cause regressions in existing functionality (like Game Recap search).
Error handling functions correctly for anticipated issues.

Notes

Prioritize non-destructive updates to the Neo4j database.
Confirm the exact column names in roster_april_11.csv before scripting the Neo4j update.
Reuse existing patterns for agent tools, component creation, and Gradio integration where possible.
Document all changes, especially modifications to existing files like gradio_agent.py and gradio_app.py.
Test thoroughly after each significant step.