Skip to the content.

💡 Watt-Seer-Compare

You stare at your energy bill. The numbers fluctuate month to month, but one question always lingers: Is this normal?”
You check the usage – hundreds, maybe thousands of kilowatt-hours (kWh). But what does that really mean? Compared to whom? Is your home an energy guzzler, a model of efficiency, or just… average?

It’s a question millions ask, often met with frustratingly little context. Raw smart meter data is overwhelming, and simple monthly totals hide the real story.

What if you could see how your home truly stacks up against others just like it? Maybe you live in a neighborhood with similar houses, but you don’t know your neighbors’ habits. Maybe the weather was weird last month. How does your specific energy usage pattern – driven by your appliances, your schedule, and your household size – compare to others like you?

That’s the quest behind Watt-Seer-Compare. We’re moving beyond simple averages to provide personalized, AI-powered energy benchmarking. Using detailed smart home data and the nuanced understanding of Google Gemini, this project finds your household’s “Energy Twins” and tells you why your usage patterns are similar or different.

👥 Team Members


🔍 The Problem: Drowning in Data, Starving for Context

Smart meters promise granular insights, tracking energy use down to the hour or minute. We get data points enriched with details:

But, rows upon rows of this data often lead to more confusion than clarity. How do you know if your AC usage on a hot August day was typical for a family of four in your climate zone?

Traditional comparisons often rely on crude filters (e.g., same month), missing the subtle interplay of factors that define a household’s energy signature.


🤖 Solution: Finding Your “Energy Twins” with AI

Watt-Seer-Compare tackles this challenge head-on. It doesn’t just filter data; it understands it. Here’s how:

Screenshot 2025-04-19 at 3 23 27 PM

1. Smart Summaries: We first distill the flood of raw data into concise, informative monthly summaries for each home, capturing total usage, temperature context, seasonality, household size, and key appliance contributions.

2. Semantic Understanding (Gemini Embeddings): This is where the magic happens. We use Google Gemini’s text-embedding-004 model to convert these textual summaries into rich numerical representations (embeddings). These aren’t just numbers; they capture the semantic meaning and context of each household’s monthly energy story.

3. Finding Your Peers (Cosine Similarity): With embeddings, we can mathematically compare how “alike” different households are based on their entire energy profile for the month using cosine similarity. This allows us to find the homes that are most similar to yours – your “Energy Twins”

4. Visual Comparisons and AI Explanation (Gemini LLM): We generate clear, intuitive plots. Here numbers and plots are great, but why are you similar or different? We feed your summary and your Twins’ summaries into Gemini (gemini-2.0-flash), prompting it to analyze the nuances and generate a plain-English explanation comparing your profile to your closest peers.

Screenshot 2025-04-20 at 8 05 35 AM


📊 Example Output:

Gain Unprecedented Clarity With Visual Insights!

Let’s follow Home 34’s journey in June 2023. After inputting the ID and month, Watt-Seer-Compare gets to work and presents…

  1. First, let’s look at target home’s individual profile and Total usage vs. the average of its Energy Twins:
Pie Chart
Caption: Target Appliance Breakdown
Avg Comparison
Caption: Target vs Avg Similar
  1. Comparison Context:
Individual Comparison
Caption: Target vs Individual Similar
Appliance Comparison
Caption: Top Appliance Comparison
  1. The AI Analyst’s Take:

LLM Comparison Summary:

Home 34’s energy consumption profile for June 2023 is highly similar to several other homes, primarily due to similar household sizes, seasonal context, and reliance on both heating and air conditioning. These homes exhibit total energy consumption within a relatively narrow range of 30 to 37 kWh. While the overall patterns are alike, variations in average temperature lead to differences in the specific usage of heating and cooling appliances. Furthermore, lifestyle differences are reflected in the varying consumption of appliances like lights, washing machines, and ovens, contributing to subtle distinctions between the homes despite their high similarity scores. The lower similarity homes have different household sizes, some being smaller and one being larger, and also have a wider range of temperatures and total consumption, contributing to their lower similarity scores.

Suddenly, your energy usage isn’t just a number; it’s a narrative placed within the context of genuinely comparable households.


🧪 Code Highlight: Generating the Embeddings

The core of finding “Energy Twins” lies in creating meaningful embeddings from the monthly summaries. Here’s a snippet of how we prepare the data and call the embedding model (using helper functions defined in the notebook):

1. Generating Monthly Text Summaries:

# (Inside create_monthly_summaries function)
summary_text = (
    f"Home ID {home_id} for Month {period}:\n"
    f"- Total Consumption: {total_kwh:.1f} kWh\n"
    f"- Household Size: {household_size}\n"
    f"- Average Temperature: {avg_temp}°C\n"
    f"- Season: {season}\n"
    f"- Appliance Breakdown: {appliance_summary}"
)
summary_records.append({..., "Summary Text": summary_text})

2. Finding Similar Homes via Embeddings:

# (Inside run_energy_comparison_analysis)
# Get embeddings for the month
embeddings, summaries = generate_embeddings_efficiently(...)

# Calculate similarity
top_similar, bottom_similar = get_embedding_similarities(
    target_home_id=target_home_id,
    all_embeddings=embeddings,
    k=k_similar
)

3. Prompting the LLM for Explanation:

# (Inside generate_llm_comparison)
prompt = f"""
**Context:** You are an expert energy analyst...
**Input Data:**
* **Target Home (ID: {target_home_id}) Summary:**\n    ```{target_summary}```
* **Most Similar Homes (Top {homes_added}):**\n{similar_homes_text}
**Task:** Analyze the provided summaries. **Think step-by-step**...
**Output:** Provide the final synthesized comparison summary...
"""
response = client.models.generate_content(model=model_name, contents=prompt, ...)

✨ Why This Matters: Beyond Simple Filters

This approach offers significant advantages:


🔮 Limitations & Future Directions..

Watt-Seer-Compare provides powerful context, but like any analysis, it has boundaries:

What’s Next?


🤝 From Numbers to Narratives:

Understand Your Energy Story

Stop wondering in the dark. By finding your “Energy Twins”, combining detailed data, semantic AI, visualizations, and natural language explanations, Watt-Seer-Compare demonstrates how AI can transform rows of energy data into a meaningful conversation about consumption patterns and empowers you to understand your unique energy story.

Ready to see how you stack up and meet your Energy Twins?


🚀 Dive Deeper: Explore the Code & Watch the Demo!

Explore the code, run it with the sample data, and imagine plugging in your own smart meter readings. Discover your own “Energy Twins” and gain clarity on your consumption.

image

👉 View the Watt-Seer-Compare Kaggle Notebook

📺 Watch our video walkthrough for a quick overview!

Your energy data holds insights waiting to be revealed. Let AI help you listen.


Dataset: Smart Home Energy Consumption Dataset on Kaggle

Concepts:


Questions/Feedback? Post your questions via Kaggle Comments or open an issue on our GitHubGitHub Issues