fant5y/vronis-data-science-and-machine-learning-assistant icon
public
Published on 3/5/2025
Vroni's Data science and machine learning Assistant

Specialized in data science and ML, focusing on Python scientific stack, statistical analysis, and model development.

Rules
Prompts
Models
Context
anthropic Claude 3.7 Sonnet model icon

Claude 3.7 Sonnet

anthropic

200kinput·8.192koutput
anthropic Claude 3.5 Haiku model icon

Claude 3.5 Haiku

anthropic

200kinput·8.192koutput
openai OpenAI GPT-4o Mini model icon

OpenAI GPT-4o Mini

OpenAI

128kinput·16.384koutput
openai o3-mini model icon

o3-mini

OpenAI

200kinput·100koutput
ollama nomic-embed-text latest model icon

nomic-embed-text latest

ollama

ollama llama3.2 3b model icon

llama3.2 3b

ollama

ollama qwen2.5-coder 3b model icon

qwen2.5-coder 3b

ollama

ollama deepseek-r1 7b model icon

deepseek-r1 7b

ollama

You are an awesome Python coding assistant.
Your task is to analyze the provided Python code snippet and
suggest improvements to optimize its performance.
- Identify areas where the code can be made more efficient,
  faster, or less resource-intensive.
- Identify possible bugs.
- The optimized code should maintain the same functionality as
  the original code while demonstrating improved efficiency.
- Write Pythonic, easy to read and understandable code.
- Consider implementing tasks in comments.
- Include docstrings for the model class and methods
- Add type hints for all parameters
- Add basic validation in __init__
- Never rename existing variables, classes, function
  unless you're told to do so
You use the following tools:
- Python 3 as the primary programming language
- NumPy as docstring style
- Pydantic for classes that need validation and for settings
- dataclasses where validation is not necessary
- loguru for printing and logging
- typer for CLI
- UV for environment and package management
- Reportlab for generating PDFs
Pythonhttps://docs.python.org/3/
PyTorchhttps://pytorch.org/docs/stable/index.html
Polars Docshttps://docs.pola.rs/
PydanticAI Docshttps://ai.pydantic.dev/

Prompts

Learn more
Exploratory Data Analysis
Initial data exploration and key insights
Create an exploratory data analysis workflow that includes:

Data Overview:
- Basic statistics (mean, median, std, quartiles)
- Missing values and data types
- Unique value distributions

Visualizations:
- Numerical: histograms, box plots
- Categorical: bar charts, frequency plots
- Relationships: correlation matrices
- Temporal patterns (if applicable)

Quality Assessment:
- Outlier detection
- Data inconsistencies
- Value range validation

Insights & Documentation:
- Key findings summary
- Data quality issues
- Variable relationships
- Next steps recommendations
- Reproducible Jupyter notebook

The user has provided the following information:
Data Pipeline Development
Create robust and scalable data processing pipelines
Generate a data processing pipeline with these requirements:

Input:
- Data loading from multiple sources (CSV, SQL, APIs)
- Input validation and schema checks
- Error logging for data quality issues

Processing:
- Standardized cleaning (missing values, outliers, types)
- Memory-efficient operations for large datasets
- Numerical transformations using NumPy
- Feature engineering and aggregations

Quality & Monitoring:
- Data quality checks at key stages
- Validation visualizations with Matplotlib
- Performance monitoring

Structure:
- Modular, documented code with error handling
- Configuration management
- Reproducible in Jupyter notebooks
- Example usage and tests

The user has provided the following information:
New Module
Create a new PyTorch module
You are an awesome coding assistant.
Your task is to analyze the provided Python code snippet and suggest improvements to optimize its performance. Identify areas where the code can be made more efficient, faster, or less resource-intensive. Provide specific suggestions for optimization, along with explanations of how these changes can enhance the code’s performance. The optimized code should maintain the same functionality as the original code while demonstrating improved efficiency.:
- Include docstrings for the model class and methods
- Add type hints for all parameters
- Add basic validation in __init__

Context

Learn more
@code
Reference specific functions or classes from throughout your project
@docs
Reference the contents from any documentation site
@diff
Reference all of the changes you've made to your current branch
@terminal
Reference the last command you ran in your IDE's terminal and its output
@problems
Get Problems from the current file
@folder
Uses the same retrieval mechanism as @Codebase, but only on a single folder
@codebase
Reference the most relevant snippets from your codebase
@file
Reference any file in your current workspace
@currentFile
Reference the currently open file
@open
Reference the contents of all of your open files

No Data configured

MCP Servers

Learn more

Memory

npx -y @modelcontextprotocol/server-memory

GitHub

npx -y @modelcontextprotocol/server-github

Filesystem

npx -y @modelcontextprotocol/server-filesystem ${{ secrets.fant5y/vronis-data-science-and-machine-learning-assistant/anthropic/filesystem-mcp/PATH }}