Spaces:

Kushagra13
/

InsightLoop

Sleeping

App Files Files Community

Kushagra13 commited on 21 days ago

Commit

aae8a37

verified ·

1 Parent(s): f4d8737

Upload 38 files

Browse files

Files changed (39) hide show

.gitattributes +6 -0
src/data.csv +31 -0
src/data_with_text.csv +77 -0
src/logo.png +3 -0
src/pages/newprod.py +451 -0
src/pages/persona.py +410 -0
src/pages/prt111.py +700 -0
src/personas.json +42 -0
src/review_files/ 1.txt +2 -0
src/review_files/ 2.txt +2 -0
src/review_files/ 3.txt +2 -0
src/review_files/ 4.txt +2 -0
src/review_files/ 5.txt +2 -0
src/review_files/ 6.txt +2 -0
src/review_files/ 7.txt +2 -0
src/review_files/ 8.txt +2 -0
src/review_files/ 9.txt +2 -0
src/review_files/.DS_Store +0 -0
src/review_files/10.txt +2 -0
src/review_files/11.txt +2 -0
src/review_files/12.txt +2 -0
src/review_files/13.txt +2 -0
src/review_files/14.txt +2 -0
src/review_files/15.txt +2 -0
src/review_files/16.txt +2 -0
src/review_files/17.txt +2 -0
src/review_files/18.txt +2 -0
src/review_files/19.txt +2 -0
src/review_files/20.txt +2 -0
src/review_files/21.png +0 -0
src/review_files/22.png +0 -0
src/review_files/23.png +0 -0
src/review_files/24.png +0 -0
src/review_files/25.png +0 -0
src/review_files/26.wav +3 -0
src/review_files/27.wav +3 -0
src/review_files/28.wav +3 -0
src/review_files/29.wav +3 -0
src/review_files/30.wav +3 -0

.gitattributes ADDED Viewed

	@@ -0,0 +1,6 @@

+src/logo.png filter=lfs diff=lfs merge=lfs -text
+src/review_files/26.wav filter=lfs diff=lfs merge=lfs -text
+src/review_files/27.wav filter=lfs diff=lfs merge=lfs -text
+src/review_files/28.wav filter=lfs diff=lfs merge=lfs -text
+src/review_files/29.wav filter=lfs diff=lfs merge=lfs -text
+src/review_files/30.wav filter=lfs diff=lfs merge=lfs -text

src/data.csv ADDED Viewed

	@@ -0,0 +1,31 @@

+unique_order_id,customer_id,product_name,price,review_file,timestamp
+1001,5001,Chocolate flavoured whey 1kg,1100,1.txt,2025-07-02 13:40:09
+1002,5002,Chocolate flavoured whey 1kg,1300,2.txt,2025-06-30 01:44:41
+1003,5003,Chocolate flavoured whey 1kg,1250,3.txt,2025-07-02 16:53:24
+1004,5004,Chocolate flavoured whey 1kg,1300,4.txt,2025-06-29 15:29:25
+1005,5005,Chocolate flavoured whey 1kg,1300,5.txt,2025-06-29 13:21:02
+1006,5006,Chocolate flavoured whey 1kg,1100,6.txt,2025-06-27 12:36:57
+1007,5007,Chocolate flavoured whey 1kg,1250,7.txt,2025-07-01 20:21:00
+1008,5008,Chocolate flavoured whey 1kg,1100,8.txt,2025-07-02 04:35:40
+1009,5009,Chocolate flavoured whey 1kg,1250,9.txt,2025-07-02 12:56:36
+1010,5010,Chocolate flavoured whey 1kg,1300,10.txt,2025-06-30 12:45:43
+1011,5011,Chocolate flavoured whey 1kg,1250,11.txt,2025-06-27 08:43:25
+1012,5012,Chocolate flavoured whey 1kg,1300,12.txt,2025-06-30 12:02:19
+1013,5013,Chocolate flavoured whey 1kg,1200,13.txt,2025-07-02 07:31:49
+1014,5014,Chocolate flavoured whey 1kg,1250,14.txt,2025-06-27 04:10:51
+1015,5015,Chocolate flavoured whey 1kg,1100,15.txt,2025-06-28 06:01:28
+1016,5016,Chocolate flavoured whey 1kg,1100,16.txt,2025-07-01 13:51:04
+1017,5017,Chocolate flavoured whey 1kg,1250,17.txt,2025-06-26 20:17:45
+1018,5018,Chocolate flavoured whey 1kg,1200,18.txt,2025-06-30 08:12:20
+1019,5019,Chocolate flavoured whey 1kg,1250,19.txt,2025-07-01 23:20:05
+1020,5020,Chocolate flavoured whey 1kg,1250,20.txt,2025-06-26 12:46:36
+1021,5021,Chocolate flavoured whey 1kg,1200,21.png,2025-06-29 12:06:12
+1022,5022,Chocolate flavoured whey 1kg,1200,22.png,2025-06-28 12:41:00
+1023,5023,Chocolate flavoured whey 1kg,1100,23.png,2025-06-30 00:08:33
+1024,5024,Chocolate flavoured whey 1kg,1300,24.png,2025-07-01 06:10:49
+1025,5025,Chocolate flavoured whey 1kg,1300,25.png,2025-06-26 05:24:20
+1026,5026,Chocolate flavoured whey 1kg,1200,26.wav,2025-06-28 15:00:32
+1027,5027,Chocolate flavoured whey 1kg,1200,27.wav,2025-06-28 06:51:25
+1028,5028,Chocolate flavoured whey 1kg,1300,28.wav,2025-06-26 01:31:07
+1029,5029,Chocolate flavoured whey 1kg,1100,29.wav,2025-06-30 04:45:59
+1030,5030,Chocolate flavoured whey 1kg,1250,30.wav,2025-06-26 15:09:14

src/data_with_text.csv ADDED Viewed

	@@ -0,0 +1,77 @@

+unique_order_id,customer_id,product_name,price,review_file,timestamp,review_text,polarity
+1001,5001,Chocolate flavoured whey 1kg,1100,1.txt,2025-07-02 13:40:09,"Mary says:
+Improved my stamina noticeably. I feel more energetic during workouts. Good consistency and not too sweet.",0.9996833801269531
+1002,5002,Chocolate flavoured whey 1kg,1300,2.txt,2025-06-30 01:44:41,"Christopher says:
+Great taste and mixes well. The flavor is okay, nothing special.",0.9987167119979858
+1003,5003,Chocolate flavoured whey 1kg,1250,3.txt,2025-07-02 16:53:24,"Maria says:
+I feel more energetic during workouts. Noticeable muscle recovery improvement.",0.9971129894256592
+1004,5004,Chocolate flavoured whey 1kg,1300,4.txt,2025-06-29 15:29:25,"Dawn says:
+Blends easily with water or milk. Very effective for post-workout nutrition. Slight aftertaste but manageable. Improved my stamina noticeably.",0.9991833567619324
+1005,5005,Chocolate flavoured whey 1kg,1300,5.txt,2025-06-29 13:21:02,"Amy says:
+Perfect for daily supplementation. Could have a few more flavor options. No digestive issues so far. Very effective for post-workout nutrition.",0.999066174030304
+1006,5006,Chocolate flavoured whey 1kg,1100,6.txt,2025-06-27 12:36:57,"Stephanie says:
+Bit pricey for the quantity. I feel more energetic during workouts. The flavor is okay, nothing special.",0.9735181331634521
+1007,5007,Chocolate flavoured whey 1kg,1250,7.txt,2025-07-01 20:21:00,"Janet says:
+Mild smell, not unpleasant though. Noticeable muscle recovery improvement. Very effective for post-workout nutrition. I feel more energetic during workouts.",0.997998058795929
+1008,5008,Chocolate flavoured whey 1kg,1100,8.txt,2025-07-02 04:35:40,"Robin says:
+Good consistency and not too sweet. Works fine if you’re consistent.",0.9997614026069641
+1009,5009,Chocolate flavoured whey 1kg,1250,9.txt,2025-07-02 12:56:36,"Cynthia says:
+Not as filling as expected. Serving scoop could be better marked. Noticeable muscle recovery improvement. Improved my stamina noticeably.",0.7413376569747925
+1010,5010,Chocolate flavoured whey 1kg,1300,10.txt,2025-06-30 12:45:43,"Stanley says:
+Good consistency and not too sweet. Very effective for post-workout nutrition. Packaging could be more durable.",0.9992110729217529
+1011,5011,Chocolate flavoured whey 1kg,1250,11.txt,2025-06-27 08:43:25,"Kyle says:
+Good consistency and not too sweet. Blends easily with water or milk.",0.9993370175361633
+1012,5012,Chocolate flavoured whey 1kg,1300,12.txt,2025-06-30 12:02:19,"Angela says:
+Bit pricey for the quantity. Good consistency and not too sweet. Texture is decent, not too gritty. Helped me maintain my protein intake.",0.9987452030181885
+1013,5013,Chocolate flavoured whey 1kg,1200,13.txt,2025-07-02 07:31:49,"Todd says:
+Could have a few more flavor options. Perfect for daily supplementation.",0.9952136278152466
+1014,5014,Chocolate flavoured whey 1kg,1250,14.txt,2025-06-27 04:10:51,"Thomas says:
+No digestive issues so far. Bit pricey for the quantity. Could have a few more flavor options.",-0.9891797304153442
+1015,5015,Chocolate flavoured whey 1kg,1100,15.txt,2025-06-28 06:01:28,"Julia says:
+Noticeable muscle recovery improvement. Blends easily with water or milk. Blends easily with water or milk. Seems effective but too early to judge.",-0.798577070236206
+1016,5016,Chocolate flavoured whey 1kg,1100,16.txt,2025-07-01 13:51:04,"Evelyn says:
+Perfect for daily supplementation. Perfect for daily supplementation. The flavor is okay, nothing special. Blends easily with water or milk.",0.962000846862793
+1017,5017,Chocolate flavoured whey 1kg,1250,17.txt,2025-06-26 20:17:45,"Bob says:
+Blends easily with water or milk. Perfect for daily supplementation. Clumps if not shaken properly.",0.986446738243103
+1018,5018,Chocolate flavoured whey 1kg,1200,18.txt,2025-06-30 08:12:20,"Richard says:
+Great taste and mixes well. Not as filling as expected. Helped me maintain my protein intake. Very effective for post-workout nutrition.",0.9984906911849976
+1019,5019,Chocolate flavoured whey 1kg,1250,19.txt,2025-07-01 23:20:05,"Aaron says:
+Good consistency and not too sweet. Great taste and mixes well.",0.9998080134391785
+1020,5020,Chocolate flavoured whey 1kg,1250,20.txt,2025-06-26 12:46:36,"Shelby says:
+Noticeable muscle recovery improvement. Noticeable muscle recovery improvement.",0.9959927797317505
+1021,5021,Chocolate flavoured whey 1kg,1200,21.png,2025-06-29 12:06:12,"Must buy!
+It's a genuine product . I'm a beginner to to the gym . It's been 6 months since | have joined the gym . | have used it twice on the day it got delivered as
+the taste is just awesome . Best ever taste as if I'm having some chocolate smoothie. It's just too good . Great product just goo for it",0.999661922454834
+1022,5022,Chocolate flavoured whey 1kg,1200,22.png,2025-06-28 12:41:00,"Wonderful
+Those who are complaining about taste i don't understand what is wrong it's too good i liked chocolate cream just gonna update result i feel gas after
+having 3 scoop but i won't consume this much now let me update results",-0.9602248668670654
+1023,5023,Chocolate flavoured whey 1kg,1100,23.png,2025-06-30 00:08:33,"Mind-blowing purchase
+Dude this is amazing, not only gives tha macros also keeps you tummy full for sometimes. | take it 200 ml of milk one scoop daily. | am a Mixed Martial
+Artists so i don't really have goal to show my muscles, so | can't point out goods and bads about this specific protein from others. Although works good
+for me.",0.9985721111297607
+1024,5024,Chocolate flavoured whey 1kg,1300,24.png,2025-07-01 06:10:49,"Value-for-money
+So this protein powder is best for who's looking to lean bulk
+Mixabiltity - 9/10
+| bought chocolate flavor it's got a good taste - 8/10
+You won't get results quickly, you need to maintain the food also and balanced nutrition
+And it's also lab tested and also hygiene without any side effects and no digestive problems with the product
+Yeah It maintains body at a balanced weight
+And sure thing is you can buy this without hesitating",0.8845929503440857
+1025,5025,Chocolate flavoured whey 1kg,1300,25.png,2025-06-26 05:24:20,"Terrific purchase
+Very nice product with authenticity code, complete amino profile mentioned, filtration method mentioned.taste is very nice slightly sweeter. Overall good
+product for beginners. But it has blotting and stomach upset issue",0.9798041582107544
+1026,5026,Chocolate flavoured whey 1kg,1200,26.wav,2025-06-28 15:00:32,good morning sir thank you for calling what can I assist you with today I have been using the protein powder for a little over a week and I just wanted to give him give some quick feedback about it because I would like to hear experience on it so first of I appreciate the fact that it's not too heavy some powder is make me feel very over full or uncomfortable this but this one feels light and easy to digest but there are any other areas on which we could improve packaging it's very hard to scope the poverty and the container starts running low a wider container or the longest cook can handle it thanks for your valuable suggestions anything else that's it thank you,-0.9288781881332397
+1027,5027,Chocolate flavoured whey 1kg,1200,27.wav,2025-06-28 06:51:25,good morning sir thank you for calling how can I assist you today I just wanted to give some feedback about the delivery experience it wasn't great honestly I am sorry to hear that what happened exactly can you elaborate so I placed the order and I got an estimated delivery date but it arrive nearly five days late with no updates in between I kept checking the tracking and it didn't move for 3 days I'm really sorry for the inconvenience cost that's definitely not ideal for our system it's ok understand the delay is happen but I think better communication would have helped it email or a text update would made a big difference if the order and come through properly we really appreciate your feedback and will pass this on to a Logistic team,-0.9972971081733704
+1028,5028,Chocolate flavoured whey 1kg,1300,28.wav,2025-06-26 01:31:07,good afternoon how may I help you today looking for Protein powder for a couple of weeks now and wanted to share some feedback it was about the pricing of course please go ahead the product itself is decent the taste is taste and flexibility of fine but to be honest I feel it's a bit on the expensive site for what it offers I will similar products before but they were slightly more affordable and better as well I see would you say the quality justify the price that is the thing it's not bad but not exceptional either but you could have offered the bundle pricing or a loyalty discount I would probably continue but at the current price it's a bit hard to manage all the expenses with it thank you for really helpful,0.9790260195732117
+1029,5029,Chocolate flavoured whey 1kg,1100,29.wav,2025-06-30 04:45:59,good morning sir thank you for calling what can I assist you with today I have been using the protein powder for a little over a week and I just wanted to give him give some quick feedback about it because I would like to hear experience on it so first of I appreciate the fact that it's not too heavy some powder is make me feel very over full or uncomfortable this but this one feels light and easy to digest but there are any other areas on which we could improve packaging it's very hard to scope the poverty and the container starts running low a wider container or the longest cook can handle it thanks for your valuable suggestions anything else that's it thank you,-0.9288781881332397
+1030,5030,Chocolate flavoured whey 1kg,1250,30.wav,2025-06-26 15:09:14,thank you for calling customer support how may I help you today looking for a Protein powder for about a week now and I just wanted to share a bit of a feedback it's not a major issue but I think it's worth mentioning of course I am happy to hear thoughts about it what seems to be a concern while the protein powder doesn't make as smoothly as I expected even when I use the Shaker bottle I still get a few small lumps it's not the end of the world but it makes the texture a bit of I understand your problem have you tried using a Blender or adding it gradually in the proper amount a little but not so much but I think the protein powder itself could be a little more final all the taste is alright not too bad but just a bit too sweet for me personally I know that subjective but maybe offering an unsuitable version could also be nice thank you for a valuable feedback anything else know that the working of find otherwise not digesting just thought I would share it could be more improved in the future batches thank you,0.9952450394630432

src/logo.png ADDED Viewed

Git LFS Details

SHA256: b8f4548b636974ef247e23a937a93173b731ee5a1c1edefd290737cc1c34da95
Pointer size: 131 Bytes
Size of remote file: 125 kB

src/pages/newprod.py ADDED Viewed

	@@ -0,0 +1,451 @@

+import streamlit as st
+import json
+import os
+import numpy as np
+import plotly.graph_objs as go
+from groq import Groq
+from dotenv import load_dotenv
+load_dotenv()  # load .env file
+GROQ_API_KEY = os.environ.get("GROQ_API_KEY")
+# --- CONFIG ---
+GROQ_MODEL = "llama3-70b-8192"
+groq_client = Groq(api_key=GROQ_API_KEY)
+PERSONA_PATH = "personas.json"
+# --- THEME COLORS ---
+neon_blue = "#00fff7"
+neon_green = "#7CFC00"
+neon_pink = "#F72585"
+neon_cyan = "#0ffcff"
+neon_bg = "#181830"
+neon_orange = "#FFB347"
+neon_shadow = "#2dfdff44"
+font_main = "Inter, Segoe UI, Arial, sans-serif"
+st.set_page_config(page_title="🚀 New Launch Studio", layout="wide", initial_sidebar_state="collapsed")
+# --- STYLE ---
+st.markdown(f"""
+    <style>
+    html, body, [class*="css"] {{
+        background-color: {neon_bg} !important;
+        font-family: {font_main} !important;
+    }}
+    /* HEADERS */
+    .neon-title {{
+        font-size:2.8rem; font-weight:900; color:{neon_blue};
+        letter-spacing:0.02em; margin-bottom:7px; margin-top:6px;
+        text-shadow:0 2px 24px {neon_blue}33;
+    }}
+    .neon-sub {{
+        font-size:1.25rem;font-weight:600;color:#fff;
+        margin-bottom:2px;margin-top:0px;
+    }}
+    .neon-heads-up {{
+        font-size:1.08rem;color:{neon_pink};font-weight:700;margin-bottom:32px;
+        margin-top:7px;
+    }}
+    /* BUTTONS */
+    .neon-btn {{
+        display:inline-block;
+        font-weight:bold;
+        padding:13px 32px;
+        border:none;
+        border-radius:13px;
+        font-size:1.10em;
+        margin-right:16px;
+        cursor:pointer;
+        box-shadow:0 0 13px {neon_blue}55;
+        color:#1d1d1d !important;
+        background:linear-gradient(90deg,{neon_green}, {neon_blue});
+        text-decoration:none !important;
+        transition:transform 0.10s;
+    }}
+    .neon-btn-pink {{
+        background:linear-gradient(90deg,{neon_pink}, {neon_blue});
+        color:#fff !important;
+        box-shadow:0 0 16px {neon_pink}88;
+    }}
+    .neon-btn:hover {{ transform:scale(1.06); }}
+    /* PERSONA NAME BOX */
+    .persona-name-box {{
+        background: linear-gradient(90deg, {neon_blue}, {neon_pink} 80%);
+        color: #15192A;
+        font-size:2.2rem;
+        font-weight:900;
+        border-radius:28px;
+        padding: 12px 40px 10px 25px;
+        margin-bottom:15px;
+        display: inline-block;
+        box-shadow: 0 2px 26px {neon_cyan}99;
+        letter-spacing:0.01em;
+        margin-top:18px;
+    }}
+    /* PERSONA CARD CONTENTS */
+    .persona-section-row {{
+        display: flex;
+        gap: 2.5em;
+        margin-bottom: 0;
+    }}
+    .persona-section-col {{
+        flex: 1;
+        min-width: 340px;
+    }}
+    /* LABELS */
+    .block-label {{
+        font-weight:900;
+        font-size:1.15em;
+        margin-bottom:8px;
+        margin-top:8px;
+        letter-spacing:0.01em;
+        display:flex;
+        align-items:center;
+        gap:0.6em;
+    }}
+    .label-blue {{ color:{neon_blue}; }}
+    .label-green {{ color:{neon_green}; }}
+    .label-pink {{ color:{neon_pink}; }}
+    .label-orange {{ color:{neon_orange}; }}
+    .label-cyan {{ color:{neon_cyan}; }}
+    /* BULLET LISTS */
+    ul.insight-list {{
+        margin-top:7px; margin-bottom:16px;
+        padding-left:22px;
+    }}
+    ul.insight-list li {{
+        font-size:1.11em; font-weight:500; color:#fff;
+        margin-bottom:5px; line-height:1.53;
+    }}
+    /* INTEREST & NOTIF */
+    .interest-badge {{
+        display:inline-block;
+        background:linear-gradient(90deg, {neon_green}, {neon_blue} 90%);
+        color:#15192A; font-size:1.09em; font-weight:900;
+        border-radius:15px; padding:8px 30px 7px 18px;
+        margin-right:14px;
+        box-shadow:0 0 17px {neon_green}2c;
+        margin-top:10px;
+    }}
+    .notification-block {{
+        background:linear-gradient(90deg,{neon_cyan}44,#232344 96%);
+        border-left:5px solid {neon_blue};
+        padding:17px 23px 17px 23px;
+        border-radius:14px;
+        font-weight:700;
+        color:{neon_blue};
+        font-size:1.06em;
+        line-height:1.45;
+        box-shadow:0 2px 18px {neon_cyan}1a;
+        margin-bottom:8px;
+        margin-top:10px;
+        letter-spacing:0.01em;
+        max-width:430px;
+        min-width: 240px;
+        display: inline-block;
+    }}
+    /* CHART/INSIGHT CARDS */
+    .section-card {{
+        background:rgba(23,28,49,0.97);
+        border-radius: 17px;
+        box-shadow:0 0 22px {neon_cyan}32;
+        padding: 34px 42px 22px 42px;
+        margin-bottom:36px;
+        margin-top:16px;
+    }}
+    /* COMBINED INSIGHTS */
+    .insight-box {{
+        background:rgba(23,28,49,0.98);
+        border-radius: 18px;
+        box-shadow:0 0 26px {neon_blue}45;
+        padding: 32px 34px 18px 34px;
+        margin-bottom:33px;
+        margin-top:20px;
+    }}
+    /* SUMMARY BOX */
+    .summary-box {{
+        background:rgba(23,28,49,0.97);
+        border-radius: 15px;
+        box-shadow:0 0 22px {neon_green}32;
+        padding: 32px 38px 22px 38px;
+        margin-bottom:36px;
+        margin-top:14px;
+        color:#fff;
+        font-size:1.17em;
+    }}
+    /* RESPONSIVE */
+    @media (max-width: 1000px) {{
+      .persona-section-row {{ flex-direction: column; }}
+      .persona-section-col {{ min-width: 100%; }}
+    }}
+    </style>
+""", unsafe_allow_html=True)
+# --- TITLE & DESCRIPTION ---
+st.markdown(f"<div class='neon-title'>🚀 New Launch Studio</div>", unsafe_allow_html=True)
+st.markdown(f"<div class='neon-sub'>Will your next product idea actually vibe with your audience? Pop your concept below and instantly see what your customer personas think—no fluff, just punchy, actionable feedback and a reality check on your launch.</div>", unsafe_allow_html=True)
+st.markdown(f"<div class='neon-heads-up'>⚡ Heads up: Our demo and market data is based on protein powder reviews—so for best results, enter a health, nutrition, or supplement product!</div>", unsafe_allow_html=True)
+# --- NAVIGATION BUTTONS ---
+st.markdown(f"""
+<div style="display:flex;gap:2em;justify-content:flex-start;margin-bottom:6px;">
+    <a href="/prt111" class="neon-btn" target="_self">🏠 Home</a>
+    <a href="/persona" class="neon-btn neon-btn-pink" target="_self">👤 Persona Analysis</a>
+</div>
+""", unsafe_allow_html=True)
+# --- PRODUCT DESCRIPTION INPUT ---
+st.markdown(f"<h2 style='color:{neon_blue};font-size:2.04rem;font-weight:900;margin-top:30px;margin-bottom:7px;'>1. Describe Your New Product</h2>", unsafe_allow_html=True)
+product_desc = st.text_area(
+    "",
+    height=110,
+    placeholder="E.g. Introducing VanillaWhey: zero sugar, 25g protein, added digestive enzymes, eco-packaging, smooth vanilla flavor, perfect for fitness and daily wellness."
+)
+# --- LOAD PERSONAS ---
+if os.path.exists(PERSONA_PATH):
+    with open(PERSONA_PATH, "r", encoding="utf-8") as f:
+        personas = json.load(f)
+else:
+    personas = []
+    st.warning("No personas found. Please generate personas first in the Persona Analysis page.")
+def clean_points(text, max_points=2):
+    lines = [l for l in text.replace('\r', '\n').split('\n') if l.strip() and not l.strip().lower().startswith(
+        ('here is', 'here are', 'persona:', 'this is', 'for this persona', 'concerns:', 'the following', 'alignment:', '*', 'point'))]
+    points = []
+    for l in lines:
+        l = l.lstrip('-•1234567890. ').strip()
+        if l and len(points) < max_points:
+            points.append(l)
+    return points if points else [text.strip()]
+def ai_points(prompt, max_points=2, max_tokens=120):
+    try:
+        chat_completion = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system",
+                 "content": f"You are a market research strategist. Reply with ONLY exactly {max_points} very brief, but fully written bullet points—no intros, no repetition, no generic phrases. Each point should be a full, clear sentence. Never add 'Here are' or any extra intro. Dont mention any names."},
+                {"role": "user", "content": prompt}
+            ],
+            max_tokens=max_tokens, temperature=0.7, stop=None
+        )
+        return clean_points(chat_completion.choices[0].message.content.strip(), max_points)
+    except Exception as e:
+        return [f"Error: {e}"]
+def ai_notification(prompt, max_tokens=44):
+    try:
+        chat_completion = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system",
+                 "content": "You are a copywriter. Write a single, short, energetic notification or email (max 30 words, no names, no symbols), ending with a call-to-action. Make it stand out and complete."},
+                {"role": "user", "content": prompt}
+            ],
+            max_tokens=max_tokens, temperature=0.72, stop=None
+        )
+        return chat_completion.choices[0].message.content.strip().replace("**", "")
+    except Exception as e:
+        return f"Error: {e}"
+def ai_percent(prompt):
+    try:
+        chat_completion = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[{"role": "system", "content": "You are a market research strategist."}, {"role": "user", "content": prompt}],
+            max_tokens=8, temperature=0.25
+        )
+        s = chat_completion.choices[0].message.content.strip()
+        percent = ''.join([c for c in s if c.isdigit()])
+        return percent + "%" if percent else s
+    except Exception as e:
+        return "?"
+def ai_graph_insights(prompt, max_tokens=160):
+    try:
+        chat_completion = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system",
+                 "content": "You are a market analyst. Give only 4 numbered, very concise but meaningful insights in separate sentences, no intro line or extra formatting, no 'Here are', no asterisks or stars, just the facts."},
+                {"role": "user", "content": prompt}
+            ],
+            max_tokens=max_tokens, temperature=0.7, stop=None
+        )
+        # Always keep only 4, no prefix text
+        lines = [l.lstrip('-•1234567890. ').strip().replace("**", "") for l in chat_completion.choices[0].message.content.strip().split('\n') if l.strip()]
+        return lines[:4]
+    except Exception as e:
+        return [f"Error: {e}"]
+def ai_summary(prompt, max_tokens=90):
+    try:
+        chat_completion = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system",
+                 "content": "Write a concise, professional executive summary in 3 sentences. No intro lines, no 'Here is', no asterisks. Be direct and to the point."},
+                {"role": "user", "content": prompt}
+            ],
+            max_tokens=max_tokens, temperature=0.7, stop=None
+        )
+        return chat_completion.choices[0].message.content.strip().replace("**", "")
+    except Exception as e:
+        return f"Error: {e}"
+st.markdown("<div style='height:16px;'></div>", unsafe_allow_html=True)
+# --- GENERATE BUTTON ---
+test_btn = st.button(
+    "🚦 Run Persona–Product Fit Check",
+    help="Instantly see AI-powered feedback from every persona's perspective!",
+    use_container_width=True
+)
+st.markdown("<div style='height:12px;'></div>", unsafe_allow_html=True)
+if test_btn and product_desc and personas:
+    st.markdown(f"<h2 style='color:{neon_blue};font-size:2.23rem;font-weight:900;margin-bottom:12px;margin-top:17px;'>2. Persona-by-Persona Results</h2>", unsafe_allow_html=True)
+    persona_colors = [neon_blue, neon_green, neon_pink, neon_orange, neon_cyan]
+    persona_cycle = iter(persona_colors)
+    section_icons = {
+        "Probable Reaction": "💡",
+        "Alignment with Persona": "✅",
+        "Potential Mismatches or Concerns": "⚠️",
+        "Marketing Strategy": "📢",
+        "Personalized Notification": "🔔",
+    }
+    def persona_block(persona, color):
+        return st.container()
+    # Pair personas 2 per row
+    for i in range(0, len(personas), 2):
+        cols = st.columns(2, gap="large")
+        for j, col in enumerate(cols):
+            if i + j < len(personas):
+                persona = personas[i + j]
+                color = next(persona_cycle, neon_blue)
+                with col:
+                    st.markdown(f"<div class='persona-name-box' style='background:linear-gradient(90deg,{neon_blue},{neon_pink} 80%);margin-bottom:16px;'><span>{persona.get('icon','')} {persona['name']}</span></div>", unsafe_allow_html=True)
+                    st.markdown(f"<div style='height:4px;'></div>", unsafe_allow_html=True)
+                    st.markdown("<div class='persona-section-row'>", unsafe_allow_html=True)
+                    st.markdown("<div class='persona-section-col'>", unsafe_allow_html=True)
+                    st.markdown(f"<div class='block-label label-blue'>{section_icons['Probable Reaction']} Probable Reaction</div>", unsafe_allow_html=True)
+                    reactions = ai_points(
+                        f"Summarize two brief but complete points for this persona's likely reaction to the product: {product_desc}. Use clear, direct language.",
+                        max_points=2, max_tokens=90
+                    )
+                    st.markdown(f"<ul class='insight-list'>" + "".join([f"<li>{r}</li>" for r in reactions]) + "</ul>", unsafe_allow_html=True)
+                    st.markdown(f"<div class='block-label label-green'>{section_icons['Alignment with Persona']} Alignment with Persona</div>", unsafe_allow_html=True)
+                    aligns = ai_points(
+                        f"List two specific ways this persona's characteristics or needs will match with the features or benefits of the product: {product_desc}. "
+                        f"Be explicit: mention which part of the persona is satisfied by which product feature. Use clear, direct language.",
+                        max_points=2, max_tokens=100
+                    )
+                    st.markdown(f"<ul class='insight-list'>" + "".join([f"<li>{a}</li>" for a in aligns]) + "</ul>", unsafe_allow_html=True)
+                    st.markdown("</div>", unsafe_allow_html=True)
+                    st.markdown("<div class='persona-section-col'>", unsafe_allow_html=True)
+                    st.markdown(f"<div class='block-label label-pink'>{section_icons['Potential Mismatches or Concerns']} Potential Mismatches or Concerns</div>", unsafe_allow_html=True)
+                    mismatches = ai_points(
+                        f"List two precise concerns or mismatches: Which features or aspects of the {product_desc} may NOT align with this persona's preferences or needs? "
+                        f"Be explicit: mention which product feature is likely to be a turn-off or ignored by this persona.",
+                        max_points=2, max_tokens=100
+                    )
+                    st.markdown(f"<ul class='insight-list'>" + "".join([f"<li>{m}</li>" for m in mismatches]) + "</ul>", unsafe_allow_html=True)
+                    st.markdown(f"<div class='block-label label-orange'>{section_icons['Marketing Strategy']} Marketing Strategy</div>", unsafe_allow_html=True)
+                    strategy = ai_points(
+                        f"Suggest two creative, product-specific marketing strategies targeted at this persona for this product: {product_desc}. "
+                        f"Each point must clearly connect a product feature with a unique marketing approach for this persona.",
+                        max_points=2, max_tokens=100
+                    )
+                    st.markdown(f"<ul class='insight-list'>" + "".join([f"<li>{s}</li>" for s in strategy]) + "</ul>", unsafe_allow_html=True)
+                    st.markdown("</div>", unsafe_allow_html=True)
+                    st.markdown("</div>", unsafe_allow_html=True)
+                    st.markdown(
+                        f"""
+                        <div style='display:flex;align-items:center;gap:22px;margin-top:12px;margin-bottom:22px;'>
+                            <span class='interest-badge'>Interest Likelihood: {ai_percent('Estimate the likelihood (percent) that '+persona['name']+' would be interested in this product. Just the number and % sign, nothing else.')}</span>
+                            <div>
+                                <div class='block-label label-cyan' style='margin-bottom:3px;'>{section_icons['Personalized Notification']} Personalized Notification</div>
+                                <div class='notification-block'>{ai_notification(
+                                    f"Write a concise, energetic notification or email about this product: {product_desc} aimed specifically at the persona {persona['name']}. "
+                                    f"Address their top motivations and finish with a strong call-to-action. No names, no symbols."
+                                )}</div>
+                        </div>
+                        """, unsafe_allow_html=True
+                    )
+                    st.markdown("<div style='height:4px;'></div>", unsafe_allow_html=True)
+    # --- CHARTS (Demo) ---
+    st.markdown(f"<h2 style='color:{neon_cyan};font-size:2.1rem;font-weight:800;margin-top:32px;'>3. Projected Market Impact</h2>", unsafe_allow_html=True)
+    persona_names = [p['name'] for p in personas]
+    np.random.seed(42)
+    projected_market_share = np.random.dirichlet(np.ones(len(persona_names)), size=1)[0]
+    projected_sentiment = projected_market_share * 0.6 + np.random.rand(len(persona_names)) * 0.4  # correlation
+    c1, c2 = st.columns(2)
+    with c1:
+        st.markdown(f"<div style='font-size:1.17em;color:{neon_blue};font-weight:700;margin-bottom:6px;'>Projected Market Share by Persona</div>", unsafe_allow_html=True)
+        fig1 = go.Figure(data=[go.Pie(labels=persona_names, values=projected_market_share, hole=0.45)])
+        fig1.update_traces(textinfo='percent+label')
+        fig1.update_layout(margin=dict(l=14, r=14, b=14, t=14), showlegend=True)
+        st.plotly_chart(fig1, use_container_width=True)
+    with c2:
+        st.markdown(f"<div style='font-size:1.17em;color:{neon_orange};font-weight:700;margin-bottom:6px;'>Projected Sentiment by Persona</div>", unsafe_allow_html=True)
+        fig2 = go.Figure(data=[go.Bar(x=persona_names, y=projected_sentiment,
+                                      marker=dict(color=[neon_green, neon_blue, neon_pink, neon_orange, neon_cyan][:len(persona_names)]))])
+        fig2.update_layout(xaxis_title="Persona", yaxis_title="Projected Sentiment", font=dict(size=15))
+        st.plotly_chart(fig2, use_container_width=True)
+    # --- Combined Chart Insights ---
+    combined_prompt = (
+        f"Given the projected market share {list(np.round(projected_market_share*100,1))} percent and projected sentiment {list(np.round(projected_sentiment*100,1))} for these personas: {', '.join(persona_names)}, "
+        "summarize 4 concise points that correlate the two charts and reveal the most important market insights. Each point should be in a new line and fully written."
+    )
+    insights = ai_graph_insights(combined_prompt, max_tokens=200)
+    st.markdown(
+        f"<div class='insight-box'><div style='font-size:1.18em;color:{neon_blue};font-weight:700;margin-bottom:10px;'>Key Combined Insights</div>"
+        f"<ul class='insight-list'>" + "".join([f"<li>{bp}</li>" for bp in insights]) + "</ul></div>", unsafe_allow_html=True
+    )
+    # --- OVERALL SUMMARY ---
+    st.markdown(f"<h2 style='color:{neon_green};font-size:2rem;font-weight:900;margin-top:18px;'>4. Overall Summary</h2>", unsafe_allow_html=True)
+    overall_prompt = (
+        f"Given these personas: {', '.join([p['name'] for p in personas])}, and the new product: {product_desc}, "
+        "write a concise executive summary (3 sentences, no intro, no asterisks), focusing on overall fit, the main challenge, and the best next move for launch."
+    )
+    summary_text = ai_summary(overall_prompt, max_tokens=1000)
+    st.markdown(
+        f"<div class='summary-box'>{summary_text}</div>",
+        unsafe_allow_html=True
+    )
+    st.markdown("---")
+elif test_btn:
+    st.warning("Please enter your product description to see the results.")
+# --- FOOTER ---
+st.markdown(
+    f"<small style='color:{neon_pink};font-size:1.09em;'>Powered by Bugs Fring</small>",
+    unsafe_allow_html=True
+)

src/pages/persona.py ADDED Viewed

	@@ -0,0 +1,410 @@

+import streamlit as st
+import pandas as pd
+import numpy as np
+import os
+import re
+from groq import Groq
+import plotly.graph_objs as go
+from collections import defaultdict
+from itertools import cycle
+import json
+from dotenv import load_dotenv
+PERSONA_PATH = "personas.json"
+# --- THEME COLORS ---
+neon_blue = "#00fff7"
+neon_green = "#7CFC00"
+neon_pink = "#F72585"
+neon_yellow = "#FFF600"
+neon_bg = "#181830"
+neon_orange = "#FFB347"
+neon_dark = "#202037"
+load_dotenv()  # load .env file
+GROQ_API_KEY = os.environ.get("GROQ_API_KEY")
+# --- CONFIG ---
+GROQ_MODEL = "llama3-70b-8192"
+groq_client = Groq(api_key=GROQ_API_KEY)
+PRODUCT_CONTEXT = (
+    "You are an AI market research expert analyzing customer reviews for a chocolate-flavoured whey protein powder. "
+    "Generate user personas based on patterns and diversity in the reviews."
+)
+CSV_PATH = "/Users/kushagraaatre/Downloads/Texpedition/data_with_text.csv"
+st.set_page_config(page_title="Persona Lab", layout="wide", initial_sidebar_state="collapsed")
+st.markdown(
+    "<h1 style='color:#00fff7;font-size:2.6rem;font-weight:900;letter-spacing:0.01em;margin-bottom:5px;'>🎭 Persona Lab</h1>",
+    unsafe_allow_html=True
+)
+st.markdown(
+    f"""
+    <div style="font-size:1.21rem; color:#AC7CFF; font-weight:600; margin-top:-13px; margin-bottom:14px; line-height:1.5;">
+        Ready to peek inside the minds of your customers?
+        This is your sandbox for uncovering who buys, why they rave, and what they crave—powered by real reviews and sharp AI.
+        Dive in, explore the personas that drive your market, and see your brand through their eyes (and taste buds)!
+    </div>
+    """,
+    unsafe_allow_html=True
+)
+# --- NAVIGATION BUTTONS ---
+st.markdown("""
+    <style>
+    .neon-btn {
+        display:inline-block;
+        font-weight:bold;
+        padding:14px 32px;
+        border:none;
+        border-radius:12px;
+        font-size:1.1em;
+        margin-right:18px;
+        cursor:pointer;
+        box-shadow:0 0 14px #00fff777;
+        color:#222 !important;
+        background:linear-gradient(90deg,#7CFC00,#00fff7);
+        text-decoration:none !important;
+        transition: transform 0.08s;
+    }
+    .neon-btn-pink {
+        background:linear-gradient(90deg,#F72585,#00fff7);
+        color:#fff !important;
+        box-shadow:0 0 14px #F7258577;
+    }
+    .neon-btn:hover {
+        transform:scale(1.04);
+        box-shadow:0 0 24px #00fff799;
+    }
+    .neon-btn-pink:hover {
+        box-shadow:0 0 24px #F7258599;
+    }
+    </style>
+""", unsafe_allow_html=True)
+st.markdown("""
+<div style="display:flex;gap:2em;justify-content:flex-start;">
+    <a href="/prt111" class="neon-btn"target="_self">🏠 Home</a>
+    <a href="/newprod" class="neon-btn neon-btn-pink"target="_self">🚀 New Product Launch</a>
+</div>
+<br>
+""", unsafe_allow_html=True)
+def block_markdown(text, color):
+    text = text.replace('\n', '<br>')
+    return (
+        f'<div style="background:linear-gradient(90deg,{color}22,#181830 90%);'
+        f'padding:16px 22px;border-radius:16px;margin:10px 0 24px 0;'
+        f'font-weight:600;color:#fff;font-size:1.04em;line-height:1.6;box-shadow:0 2px 24px {color}19;">'
+        f'{text}</div>'
+    )
+@st.cache_data(show_spinner=True)
+def load_reviews(csv_path):
+    if not os.path.exists(csv_path):
+        st.error(f"CSV file not found: {csv_path}")
+        return pd.DataFrame()
+    df = pd.read_csv(csv_path)
+    if "polarity" not in df.columns:
+        try:
+            from transformers import pipeline
+            sa = pipeline("sentiment-analysis", model="distilbert-base-uncased-finetuned-sst-2-english")
+            df["polarity"] = df["review_text"].apply(lambda x: 1 if sa(x)[0]["label"] == "POSITIVE" else -1)
+        except Exception as e:
+            st.warning("Could not compute sentiment scores. All reviews set to neutral (0).")
+            df["polarity"] = 0
+    if "review_length" not in df.columns:
+        df["review_length"] = df["review_text"].apply(lambda x: len(str(x).split()))
+    return df
+def generate_personas(review_texts, n_personas=4):
+    prompt = (
+        f"Read the following customer reviews for a chocolate-flavored whey protein powder. "
+        f"Based on the language, interests, and context, segment these users into {n_personas} distinct personas. "
+        "For each persona, provide:\n"
+        "1. Persona Name starting with emoji\n"
+        "2. A one-line summary\n"
+        "3. Five detailed bullet points describing their characteristics, needs, goals, or behaviors (each bullet should be specific and insightful, not generic).\n"
+        "Give the answer as a numbered list, one for each persona. Format:\n"
+        "1. [Emoji] Persona Name\nSummary: ...\n- ...\n- ...\n- ...\n- ...\n- ...\n"
+        "\nREVIEWS:\n" +
+        "\n".join(review_texts[:120])[:3600]
+    )
+    try:
+        chat_completion = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system", "content": PRODUCT_CONTEXT},
+                {"role": "user", "content": prompt}
+            ],
+            max_tokens=900,
+            temperature=0.6,
+        )
+        return chat_completion.choices[0].message.content.strip()
+    except Exception as e:
+        return f"Error generating personas: {e}"
+def parse_personas_bulletproof(llm_output, n=4):
+    lines = llm_output.splitlines()
+    persona_headers = []
+    for i, line in enumerate(lines):
+        if re.match(r"^([0-9]{1,2}[.)-]?\s*)?[\U0001F300-\U0001FAFF]", line.strip()):
+            persona_headers.append(i)
+    persona_blocks = []
+    for idx, start in enumerate(persona_headers):
+        end = persona_headers[idx+1] if idx+1 < len(persona_headers) else len(lines)
+        persona_blocks.append(lines[start:end])
+    personas = []
+    for block in persona_blocks[:n]:
+        name_line = re.sub(r"^([0-9]{1,2}[.)-]?\s*)?", "", block[0]).strip().replace("**", "")
+        summary = ""
+        bullets = []
+        for l in block[1:]:
+            l = l.strip()
+            if not l: continue
+            if not summary and ("summary" in l.lower() or not l.startswith(("-", "•", "*", "+"))):
+                summary = re.sub(r"^summary[:\- ]*", "", l, flags=re.I)
+            elif l.startswith(("-", "•", "*", "+")) or re.match(r"^[0-9]{1,2}[.)-]", l):
+                b = re.sub(r"^[-•*+0-9. ]+", "", l)
+                if b: bullets.append(b)
+        personas.append({
+            "name": name_line,
+            "summary": summary,
+            "bullets": bullets[:5]
+        })
+    return personas
+def assign_review_to_persona_tfidf(df, persona_defs):
+    # Use TF-IDF cosine similarity for assignment (faster than LLM for large data)
+    from sklearn.feature_extraction.text import TfidfVectorizer
+    persona_texts = [p["summary"] + " " + " ".join(p["bullets"]) for p in persona_defs]
+    tfidf = TfidfVectorizer(stop_words='english')
+    X = tfidf.fit_transform(df["review_text"].tolist() + persona_texts)
+    review_vecs = X[:-len(persona_texts)]
+    persona_vecs = X[-len(persona_texts):]
+    assignments = []
+    for i in range(review_vecs.shape[0]):
+        sims = review_vecs[i].dot(persona_vecs.T).toarray().flatten()
+        idx = np.argmax(sims)
+        assignments.append(persona_defs[idx]["name"])
+    return assignments
+def groq_bullets_persona(chart_desc, chart_data_text):
+    user_prompt = (
+        f"Summarize as exactly two bullet points the main insights for this chart: {chart_desc}. "
+        f"Here is the data: {chart_data_text}. "
+        "Provide a percentage if applicable. Just facts."
+    )
+    try:
+        chat_completion = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system", "content": PRODUCT_CONTEXT},
+                {"role": "user", "content": user_prompt}
+            ],
+            max_tokens=80,
+            temperature=0.5,
+        )
+        bullets = chat_completion.choices[0].message.content.strip()
+        points = [line for line in bullets.splitlines() if line.strip().startswith(("-", "•"))]
+        return "\n".join(points[:2]) if len(points) >= 2 else "- " + bullets
+    except Exception:
+        return "- Summary not available.\n- (LLM error)"
+# --- EMOTION PIPELINE (optional) ---
+def emotion_pipeline(df):
+    try:
+        from transformers import pipeline
+        emo = pipeline(
+            "text-classification",
+            model="finiteautomata/bertweet-base-emotion-analysis",  # much smaller than roberta-base!
+            top_k=None,
+            device=-1  # always use CPU, avoid meta-tensor bug
+        )
+    except Exception as e:
+        st.warning(f"Could not load emotion model, skipping emotion analysis: {e}")
+        df["main_emotion"] = "neutral"
+        return df
+    all_emotions = []
+    for t in df["review_text"]:
+        try:
+            emotions = emo(t[:512])
+            if isinstance(emotions, list) and len(emotions) and isinstance(emotions[0], list):
+                # Sometimes returns list of lists
+                emotions = emotions[0]
+            main_emo = sorted(emotions, key=lambda x: -x["score"])[0]["label"]
+        except Exception:
+            main_emo = "neutral"
+        all_emotions.append(main_emo)
+    df["main_emotion"] = all_emotions
+    return df
+# ========== MAIN PIPELINE ========== #
+with st.spinner("🔎 Analyzing your data... Please wait a few moments."):
+    df = load_reviews(CSV_PATH)
+    reviews = df["review_text"].dropna().tolist() if not df.empty else []
+    reviews = [t for t in reviews if "unreadable" not in t and "missing" not in t and t.strip()]
+    if reviews:
+        personas_raw = generate_personas(reviews, 4)
+        personas = parse_personas_bulletproof(personas_raw, 4)
+        if personas:
+            with open(PERSONA_PATH, "w", encoding="utf-8") as f:
+                json.dump(personas, f, ensure_ascii=False, indent=2)
+            st.session_state['personas'] = personas
+            st.success(f"{len(personas)} personas saved for next use.")
+    else:
+        personas = []
+    persona_colors = [neon_green, neon_blue, neon_pink, neon_orange]
+    persona_cycler = cycle(persona_colors)
+    persona_blocks = []
+    persona_names = []
+    # Persona grid (left-right)
+    if personas:
+        st.markdown("<br>", unsafe_allow_html=True)
+        grid_cols = st.columns(2)
+        for i, p in enumerate(personas):
+            c = next(persona_cycler)
+            col = grid_cols[i%2]
+            with col:
+                st.markdown(
+                    f"<div style='background:linear-gradient(90deg,{c}18,#181830 95%);"
+                    "padding:24px 26px 16px 26px;border-radius:18px;margin-bottom:24px;"
+                    f"box-shadow:0 2px 22px {c}22;'>"
+                    f"<h2 style='color:{c};margin-bottom:0.18em'>{p['name']}</h2>"
+                    f"<div style='color:#fff;font-size:1.15em;font-weight:500;margin-bottom:10px'>Summary: {p['summary']}</div>"
+                    f"<div style='color:{neon_pink};font-weight:700;font-size:1.08em;margin-bottom:2px'>Characteristics</div>"
+                    f"<ul style='font-size:1.02em;margin-top:3px'>{''.join([f'<li>{b}</li>' for b in p['bullets']])}</ul>"
+                    "</div>", unsafe_allow_html=True
+                )
+            persona_names.append(p["name"])
+        st.markdown("<hr>", unsafe_allow_html=True)
+    if personas and len(reviews) > 0:
+        # Assign reviews to persona via TF-IDF (fast)
+        persona_for_review = assign_review_to_persona_tfidf(df, personas)
+        df_reviews = df.copy()
+        df_reviews = df_reviews.iloc[:len(persona_for_review)].copy()
+        df_reviews["persona"] = persona_for_review
+        # --- Generate all summary stats for new graphs
+        # 1. Persona Review Share
+        persona_counts = df_reviews["persona"].value_counts()
+        # 2. Persona Sentiment
+        avg_sentiment = df_reviews.groupby("persona")["polarity"].mean()
+        # 3. Persona Review Length
+        avg_length = df_reviews.groupby("persona")["review_length"].mean()
+        # 4. Persona Emotion (optional)
+        if "main_emotion" not in df_reviews.columns:
+            df_reviews = emotion_pipeline(df_reviews)
+        emo_dist = df_reviews.groupby("persona")["main_emotion"].value_counts().unstack().fillna(0)
+        # --- Row 1: Pie and Sentiment Bar
+        c1, c2 = st.columns(2)
+        with c1:
+            st.markdown("<h3 style='color:#fff;font-size:2rem;font-weight:700;'>Sales/Review Share by Persona</h3>", unsafe_allow_html=True)
+            fig = go.Figure(data=[go.Pie(labels=persona_counts.index, values=persona_counts.values, hole=0.45)])
+            fig.update_traces(textinfo='percent+label')
+            st.plotly_chart(fig, use_container_width=True)
+            st.markdown(block_markdown(
+                groq_bullets_persona("Sales/Review Share by Persona", persona_counts.to_dict()), neon_green
+            ), unsafe_allow_html=True)
+        with c2:
+            st.markdown("<h3 style='color:#fff;font-size:2rem;font-weight:700;'>Average Sentiment by Persona</h3>", unsafe_allow_html=True)
+            fig2 = go.Figure(data=[go.Bar(x=avg_sentiment.index, y=avg_sentiment.values, marker=dict(color=[neon_green, neon_blue, neon_pink, neon_orange]))])
+            fig2.update_layout(xaxis_title="Persona", yaxis_title="Avg Sentiment", font=dict(size=15))
+            st.plotly_chart(fig2, use_container_width=True)
+            st.markdown(block_markdown(
+                groq_bullets_persona("Average Sentiment by Persona", avg_sentiment.to_dict()), neon_blue
+            ), unsafe_allow_html=True)
+        # --- Row 2: Review Length and Emotion Distribution
+        c3, c4 = st.columns(2)
+        with c3:
+            st.markdown("<h3 style='color:#fff;font-size:2rem;font-weight:700;'>Persona vs. Review Length Distribution</h3>", unsafe_allow_html=True)
+            fig3 = go.Figure(data=[go.Bar(x=avg_length.index, y=avg_length.values, marker=dict(color=[neon_green, neon_blue, neon_pink, neon_orange]))])
+            fig3.update_layout(xaxis_title="Persona", yaxis_title="Avg Review Length", font=dict(size=15))
+            st.plotly_chart(fig3, use_container_width=True)
+            st.markdown(block_markdown(
+                groq_bullets_persona("Average review length (words) by persona", avg_length.to_dict()), neon_orange
+            ), unsafe_allow_html=True)
+        with c4:
+            st.markdown("<h3 style='color:#fff;font-size:2rem;font-weight:700;'>Persona vs. Emotion Distribution</h3>", unsafe_allow_html=True)
+            fig4 = go.Figure()
+            for idx, em in enumerate(emo_dist.columns):
+                fig4.add_trace(go.Bar(name=em, x=emo_dist.index, y=emo_dist[em].values))
+            fig4.update_layout(barmode='stack', xaxis_title="Persona", yaxis_title="Emotion Count", font=dict(size=15))
+            st.plotly_chart(fig4, use_container_width=True)
+            st.markdown(block_markdown(
+                groq_bullets_persona("Distribution of primary emotions per persona", emo_dist.to_dict()), neon_pink
+            ), unsafe_allow_html=True)
+        # --- Persona-wise Highlights, grouped by persona with headings ---
+st.markdown("<hr><h2 style='color:#fff'>Persona-wise Sentiment Highlights & Recommendations</h2>", unsafe_allow_html=True)
+persona_grid = st.columns(2)
+for idx, p in enumerate(personas):
+    persona_df = df_reviews[df_reviews["persona"] == p["name"]]
+    top_pos = persona_df[persona_df["polarity"] > 0]["review_text"].head(2).tolist()
+    top_neg = persona_df[persona_df["polarity"] < 0]["review_text"].head(2).tolist()
+    pos_summary = groq_bullets_persona(
+        f"Summarize two main positive sentiment points, with percentage, for persona '{p['name']}'.",
+        " ".join(top_pos)
+    ) if top_pos else "No positive reviews."
+    neg_summary = groq_bullets_persona(
+        f"Summarize two main negative sentiment points, with percentage, for persona '{p['name']}'.",
+        " ".join(top_neg)
+    ) if top_neg else "No negative reviews."
+    rec_prompt = (
+    f"You are a product marketing strategist. "
+    f"Based on the review highlights and persona details for '{p['name']}' "
+    f"(do not repeat the characteristics), write one concise or mention name of user, actionable product or marketing recommendation. Dont put * anywhere "
+    f"for the company to better engage this persona. "
+    f"Focus on practical actions the business can take (such as messaging, offers, features, or campaigns). "
+    f"Reply with 1-2 sentences, avoid restating the persona’s traits."
+    )
+    try:
+        rec_out = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system", "content": PRODUCT_CONTEXT},
+                {"role": "user", "content": rec_prompt}
+            ],
+            max_tokens=80, temperature=0.5
+        ).choices[0].message.content.strip()
+    except:
+        rec_out = "No recommendation available."
+    with persona_grid[idx % 2]:
+        st.markdown(
+            f"<div style='margin-bottom:38px;padding:18px 20px 8px 20px;border-radius:18px;"
+            f"background:linear-gradient(90deg,{persona_colors[idx%4]}22,#181830 100%);box-shadow:0 2px 22px {persona_colors[idx%4]}18;'>"
+            f"<h2 style='color:{persona_colors[idx%4]};font-size:1.35em;margin-bottom:0.3em'>{p['name']}</h2>"
+            f"<div style='color:#fff;font-size:1.13em;font-weight:400;margin-bottom:14px;'>{p['summary']}</div>"
+            "<div style='margin-bottom:16px'>"
+            f"<b style='color:{neon_green};font-size:1.1em;'>Top Positive Sentiments:</b><br>{block_markdown(pos_summary, neon_green)}"
+            "</div>"
+            "<div style='margin-bottom:16px'>"
+            f"<b style='color:{neon_pink};font-size:1.1em;'>Top Negative Sentiments:</b><br>{block_markdown(neg_summary, neon_pink)}"
+            "</div>"
+            "<div>"
+            f"<b style='color:{neon_yellow};font-size:1.1em;'>Recommendation:</b><br>{block_markdown(rec_out, neon_yellow)}"
+            "</div>"
+            "</div>", unsafe_allow_html=True
+        )
+st.markdown("---")
+st.markdown(
+    f"<small style='color:{neon_yellow}'>Powered By Bugs Fring</small>",
+    unsafe_allow_html=True
+)

src/pages/prt111.py ADDED Viewed

	@@ -0,0 +1,700 @@

+import streamlit as st
+import pandas as pd
+import os
+from PIL import Image
+import pytesseract
+import speech_recognition as sr
+import re
+from collections import Counter
+from wordcloud import WordCloud
+from transformers import pipeline
+import plotly.graph_objs as go
+from sklearn.feature_extraction.text import TfidfVectorizer, CountVectorizer
+from groq import Groq
+import matplotlib.pyplot as plt
+import numpy as np
+from itertools import combinations
+import networkx as nx
+from sklearn.manifold import TSNE
+from dotenv import load_dotenv
+load_dotenv()  # load .env file
+GROQ_API_KEY = os.environ.get("GROQ_API_KEY")
+# --- CONFIG ---
+GROQ_MODEL = "llama3-70b-8192"
+groq_client = Groq(api_key=GROQ_API_KEY)
+PRODUCT_CONTEXT = (
+    "You are analyzing customer reviews for a chocolate-flavoured whey protein powder. "
+    "The product is aimed at fitness enthusiasts and helps with muscle growth and recovery."
+)
+RAW_CSV_PATH = "/Users/kushagraaatre/Downloads/Texpedition/data.csv"
+REVIEW_FOLDER = "/Users/kushagraaatre/Downloads/Texpedition/review_files"
+DEFAULT_CSV_PATH = "/Users/kushagraaatre/Downloads/Texpedition/data_with_text.csv"
+# Neon colors for blocks
+neon_blue = "#00fff7"
+neon_green = "#7CFC00"
+neon_pink = "#F72585"
+neon_yellow = "#FFF600"
+neon_bg = "#181830"
+neon_orange = "#FFB347"
+# --- UTILS ---
+def clean_name(name):
+    return (
+        str(name)
+        .strip()
+        .replace('\ufeff', '')
+        .replace('\n', '')
+        .replace('\r', '')
+        .replace('\t', '')
+        .lower()
+    )
+def extract_review_text(df, review_file_dict):
+    review_texts = []
+    for i, row in df.iterrows():
+        fname = clean_name(row['review_file'])
+        file = review_file_dict.get(fname)
+        text = ""
+        if file is None:
+            text = "(missing file)"
+        elif fname.endswith(".txt"):
+            try:
+                with open(file, "r", encoding="utf-8", errors="ignore") as f:
+                    text = f.read().strip()
+                if not text:
+                    text = "(text unreadable)"
+            except Exception:
+                text = "(text unreadable)"
+        elif fname.endswith(".png"):
+            try:
+                img = Image.open(file)
+                text = pytesseract.image_to_string(img).strip()
+                if not text:
+                    text = "(image unreadable)"
+            except Exception:
+                text = "(image unreadable)"
+        elif fname.endswith(".wav"):
+            r = sr.Recognizer()
+            try:
+                with sr.AudioFile(file) as source:
+                    audio = r.record(source)
+                text = r.recognize_google(audio)
+                if not text:
+                    text = "(audio unreadable)"
+            except Exception:
+                text = "(audio unreadable)"
+        else:
+            text = "(unsupported file)"
+        review_texts.append(text)
+    return review_texts
+@st.cache_resource(show_spinner=True)
+def get_sentiment_pipeline():
+    return pipeline("sentiment-analysis", model="distilbert-base-uncased-finetuned-sst-2-english")
+def hf_sentiment(text):
+    try:
+        result = sentiment_pipeline(text[:512])[0]
+        label = result['label']
+        score = result['score']
+        if score <= 0.6:
+            return ("Neutral", 0.0)
+        if label == "POSITIVE" and score > 0.8:
+            return ("Strongly Positive", score)
+        elif label == "POSITIVE":
+            return ("Positive", score)
+        elif label == "NEGATIVE" and score > 0.8:
+            return ("Strongly Negative", -score)
+        else:
+            return ("Negative", -score)
+    except Exception:
+        return ("Neutral", 0.0)
+def groq_bullets(chart_desc, chart_data_text):
+    user_prompt = (
+        f"Summarize as exactly two bullet points the main insights for a chocolate whey protein product, from this chart: {chart_desc}. "
+        f"Here is the relevant data or result: {chart_data_text}. "
+        "Do not use the words 'says', 'shows', 'suggests', 'tells', 'reveals', 'indicates', or any similar phrases. Just facts."
+    )
+    try:
+        chat_completion = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system", "content": PRODUCT_CONTEXT},
+                {"role": "user", "content": user_prompt}
+            ],
+            max_tokens=80,
+            temperature=0.6,
+        )
+        bullets = chat_completion.choices[0].message.content.strip()
+        points = [line for line in bullets.splitlines() if line.strip().startswith(("-", "•"))]
+        points = [pt.strip() for pt in points if pt.strip() and not pt.lower().startswith("summary")]
+        return "\n".join(points[:2]) if len(points) >= 2 else "- " + bullets
+    except Exception:
+        return "- Summary not available.\n- (LLM error)"
+def block_markdown(text, color):
+    text = text.replace('\n', '<br>')
+    return (
+        f'<div style="background:linear-gradient(90deg,{color}22,#181830 90%);'
+        f'padding:16px 22px;border-radius:14px;margin:10px 0 24px 0;'
+        f'font-weight:600;color:#fff;font-size:1.04em;line-height:1.6">'
+        f'{text}</div>'
+    )
+def groq_summary_block(prompt):
+    try:
+        resp = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system", "content": PRODUCT_CONTEXT},
+                {"role": "user", "content": prompt}
+            ],
+            max_tokens=100,
+            temperature=0.4,
+        )
+        return resp.choices[0].message.content.strip()
+    except Exception:
+        return "(Summary not available.)"
+def groq_top_sentiments(all_text, pos_or_neg="positive"):
+    prompt = (
+        f"Summarize the top 3 {pos_or_neg} sentiments from these customer reviews about a chocolate whey protein powder. "
+        f"Give each sentiment as a short, specific bullet point (not quotes)."
+        f"Reviews: {all_text[:4000]}"
+    )
+    try:
+        resp = groq_client.chat.completions.create(
+            model=GROQ_MODEL,
+            messages=[
+                {"role": "system", "content": PRODUCT_CONTEXT},
+                {"role": "user", "content": prompt}
+            ],
+            max_tokens=80,
+            temperature=0.5,
+        )
+        lines = [line for line in resp.choices[0].message.content.strip().split('\n') if line.strip().startswith(("-", "•"))]
+        return "\n".join(lines[:3])
+    except Exception:
+        return "- Not available.\n- (LLM error)"
+def top_n_reviews(df, sentiment, n=3):
+    if sentiment.lower().startswith("pos"):
+        filt = df["sentiment_label"].str.contains("Positive", case=False)
+        top = df.loc[filt].sort_values("polarity", ascending=False)
+    elif sentiment.lower().startswith("neg"):
+        filt = df["sentiment_label"].str.contains("Negative", case=False)
+        # Filter for .txt reviews only
+        if 'review_file' in df.columns:
+            txt_mask = df["review_file"].astype(str).str.endswith('.txt')
+            top = df.loc[filt & txt_mask].sort_values("polarity")
+        else:
+            top = df.loc[filt].sort_values("polarity")
+    else:
+        return []
+    return top["review_text"].head(n).tolist()
+# --- LAYOUT ---
+st.set_page_config(page_title="🌐 Insight Engine", layout="wide", initial_sidebar_state="collapsed")
+# --- AGE TITLE ---
+st.markdown(
+    "<h1 style='color:#00fff7;font-size:2.65rem;font-weight:900;letter-spacing:0.01em;margin-bottom:5px;'>🌐 Insight Engine</h1>",
+    unsafe_allow_html=True
+)
+# --- CHEEKY INTRO (PURPLE, Multimodal, Text to Insights) ---
+st.markdown("""
+<div style="font-size:1.22rem; color:#AC7CFF; font-weight:600; margin-top:-12px; margin-bottom:11px; line-height:1.56;">
+    🚀 Welcome to your all-in-one playground for market insight magic—supercharged with <b>multimodal skills</b>!
+    Drop in text, images, or even audio—we'll crunch it all and transform bland data into beautiful, actionable insights.
+    Curious what customers really think? Need to turn a wall of reviews into dazzling graphs, smart summaries, and aha-moments?
+</div>
+""", unsafe_allow_html=True)
+# --- EXPLANATION FOR THE DEMO DATASET (YELLOW/ORANGE) ---
+st.markdown("""
+<div style="font-size:1.12rem; color:#FFB347; font-weight:700; margin-bottom:14px; line-height:1.49;">
+    For this demo, we’ve loaded up a dataset of chocolate protein powder reviews—so you can see all features in action, no setup needed.
+    But hey, The magic works for everything from cookies to kettlebells.
+</div>
+""", unsafe_allow_html=True)
+# Add custom CSS for neon buttons
+st.markdown("""
+    <style>
+    .neon-btn {
+        display:inline-block;
+        font-weight:bold;
+        padding:14px 32px;
+        border:none;
+        border-radius:12px;
+        font-size:1.1em;
+        margin-right:18px;
+        cursor:pointer;
+        box-shadow:0 0 14px #00fff777;
+        color:#222 !important;
+        background:linear-gradient(90deg,#7CFC00,#00fff7);
+        text-decoration:none !important;
+        transition: transform 0.08s;
+    }
+    .neon-btn-pink {
+        background:linear-gradient(90deg,#F72585,#00fff7);
+        color:#fff !important;
+        box-shadow:0 0 14px #F7258577;
+    }
+    .neon-btn:hover {
+        transform:scale(1.04);
+        box-shadow:0 0 24px #00fff799;
+    }
+    .neon-btn-pink:hover {
+        box-shadow:0 0 24px #F7258599;
+    }
+    </style>
+""", unsafe_allow_html=True)
+# Place the links side by side
+st.markdown("""
+<div style="display:flex;gap:2em;">
+    <a href="/persona" class="neon-btn"target="_self">👤 Persona Analysis</a>
+    <a href="/newprod" class="neon-btn neon-btn-pink"target="_self">🚀 New Product Launch</a>
+</div>
+<br>
+""", unsafe_allow_html=True)
+# --- LOAD DATA & PREPROCESS ---
+csv_path = DEFAULT_CSV_PATH
+if not os.path.exists(csv_path):
+    st.warning(f"Preprocessed CSV not found at {csv_path}. Starting file extraction & text recognition...")
+    if not os.path.exists(RAW_CSV_PATH):
+        st.error(f"Raw CSV file not found at {RAW_CSV_PATH}")
+        st.stop()
+    df = pd.read_csv(RAW_CSV_PATH)
+    review_file_dict = {}
+    if not os.path.exists(REVIEW_FOLDER):
+        st.error(f"Review folder not found at {REVIEW_FOLDER}")
+        st.stop()
+    for fname in os.listdir(REVIEW_FOLDER):
+        key = clean_name(fname)
+        full_path = os.path.join(REVIEW_FOLDER, fname)
+        if os.path.isfile(full_path):
+            review_file_dict[key] = full_path
+    df["review_text"] = extract_review_text(df, review_file_dict)
+    df.to_csv(csv_path, index=False)
+    st.success("Preprocessing complete! Continuing with analysis...")
+else:
+    df = pd.read_csv(csv_path)
+df["review_text"] = df["review_text"].fillna("")
+sentiment_pipeline = get_sentiment_pipeline()
+with st.spinner("Running HuggingFace sentiment analysis on reviews... (first time may take a minute)"):
+    df[["sentiment_label", "polarity"]] = df["review_text"].apply(
+        lambda x: hf_sentiment(x) if x and "unreadable" not in x and "missing" not in x else ("Neutral", 0)
+    ).apply(pd.Series)
+df["review_length"] = df["review_text"].apply(lambda x: len(str(x).split()))
+df_valid = df[
+    ~df["review_text"].str.contains("unreadable|missing|unsupported", case=False, na=False)
+    & df["review_text"].str.strip().astype(bool)
+]
+all_reviews = " ".join(df_valid["review_text"])
+# ----------------- MAIN GRAPHS (numbered, with summaries in blocks) -------------------
+# --- 1 & 2. Sentiment Distribution + Top Themes ---
+c1, c2 = st.columns(2)
+with c1:
+    st.subheader("1. Sentiment Distribution")
+    sentiment_counts = df["sentiment_label"].value_counts()
+    color_dict = {
+        "Strongly Positive": neon_green,
+        "Positive": neon_blue,
+        "Neutral": neon_yellow,
+        "Negative": neon_pink,
+        "Strongly Negative": "#c1121f"
+    }
+    colors = [color_dict.get(lbl, "#a67b5b") for lbl in sentiment_counts.index]
+    fig_pie = go.Figure(data=[go.Pie(
+        labels=sentiment_counts.index,
+        values=sentiment_counts.values,
+        hole=0.4,
+        marker=dict(colors=colors),
+    )])
+    fig_pie.update_traces(textinfo='percent+label')
+    fig_pie.update_layout(showlegend=True, legend=dict(orientation="h"), font=dict(size=16))
+    st.plotly_chart(fig_pie, use_container_width=True)
+    st.markdown(block_markdown(groq_bullets("Sentiment distribution pie chart", f"Counts: {sentiment_counts.to_dict()}"), neon_blue), unsafe_allow_html=True)
+with c2:
+    st.subheader("2. Top Themes")
+    if len(df_valid) > 0:
+        vectorizer = TfidfVectorizer(stop_words="english", max_features=10)
+        X = vectorizer.fit_transform(df_valid["review_text"].fillna(""))
+        keywords = [w for w in vectorizer.get_feature_names_out() if len(w) > 2 and w.lower() not in ["says", "tells", "said", "like", "really"]]
+        counts = X.sum(axis=0).A1
+        theme_counts = sorted(zip(keywords, counts), key=lambda x: -x[1])
+        fig_theme = go.Figure(data=[
+            go.Bar(
+                x=[k for k, _ in theme_counts], y=[int(c) for _, c in theme_counts],
+                marker=dict(color=[neon_green, neon_pink, neon_blue, neon_yellow, neon_orange]*2)
+            )
+        ])
+        fig_theme.update_layout(xaxis_title='Theme/Keyword', yaxis_title='Frequency', font=dict(size=16))
+        st.plotly_chart(fig_theme, use_container_width=True)
+        st.markdown(block_markdown(
+            groq_bullets("Bar chart of frequency of top review themes",
+                         ', '.join([k for k,_ in theme_counts])), neon_orange), unsafe_allow_html=True)
+    else:
+        st.write("No valid reviews for theme extraction.")
+        st.markdown(block_markdown("- No data.\n- No chart.", neon_orange), unsafe_allow_html=True)
+st.markdown("---")
+# --- 3 & 4. Sentiment Trend Over Time + Aspect-Based Sentiment ---
+c3, c4 = st.columns(2)
+with c3:
+    st.subheader("3. Sentiment Trend Over Time")
+    df_valid = df_valid.reset_index()
+    df_valid["review_idx"] = df_valid.index + 1
+    df_valid_trend = df_valid.groupby("review_idx").agg({"polarity": "mean"}).reset_index()
+    fig_line = go.Figure(data=[
+        go.Scatter(
+            x=df_valid_trend["review_idx"], y=df_valid_trend["polarity"],
+            mode="lines+markers+text",
+            line=dict(color=neon_pink, width=4, dash='dash'),
+            marker=dict(size=8, color=neon_green, symbol="diamond"),
+        )
+    ])
+    fig_line.update_layout(
+        xaxis_title="Review Index (chronological)",
+        yaxis_title="Avg Sentiment",
+        font=dict(size=16, color=neon_pink),
+        plot_bgcolor=neon_bg
+    )
+    st.plotly_chart(fig_line, use_container_width=True)
+    st.markdown(block_markdown(
+        groq_bullets("Sentiment trend line over time (reviews in chronological order)",
+        f"Polarity: {list(df_valid_trend['polarity'][:30])}"), neon_pink), unsafe_allow_html=True)
+with c4:
+    st.subheader("4. Aspect-Based Sentiment")
+    aspects = ["price", "quality", "delivery", "taste", "mixability"]
+    aspect_scores = []
+    for aspect in aspects:
+        mask = df_valid["review_text"].str.contains(aspect, case=False, na=False)
+        pols = df_valid.loc[mask, "polarity"]
+        aspect_scores.append(pols.mean() if not pols.empty else 0)
+    fig_aspect = go.Figure(data=[
+        go.Bar(
+            x=aspects, y=aspect_scores,
+            marker=dict(color=[neon_blue, neon_green, neon_pink, neon_yellow, neon_orange])
+        )
+    ])
+    fig_aspect.update_layout(xaxis_title="Aspect", yaxis_title="Avg Sentiment", font=dict(size=16))
+    st.plotly_chart(fig_aspect, use_container_width=True)
+    st.markdown(block_markdown(
+        groq_bullets(
+            "Bar chart of sentiment for product aspects (price, quality, delivery, taste, mixability)",
+            str(dict(zip(aspects, [f"{x:.2f}" for x in aspect_scores])))
+        ), neon_green), unsafe_allow_html=True)
+st.markdown("---")
+# --- 5 & 6. Word Cloud + Review Length Trend ---
+# Add this above your word cloud and co-occurrence logic
+stopwords = set("""
+the and for with you that this are have from all has can will just get out too its on an is in it of to a i my says said tell tells also would could should not as if be do does did was were been being by he she they them their our we us his her its so or at more most some such only may might like one two first second every much well still own even many go goes gone didn't don't isn't aren't wasn't weren't doesn't haven't hadn't can't won't won't wouldn't mustn't protein powder review
+""".split())
+def filter_tokens(words):
+    return [w for w in words if w not in stopwords and len(w) > 2 and not w.isnumeric()]
+c5, c6 = st.columns(2)
+with c5:
+    st.subheader("5. Word Cloud")
+    if all_reviews.strip():
+        words = re.findall(r'\w+', all_reviews.lower())
+        filtered_words = filter_tokens(words)
+        filtered_text = " ".join(filtered_words)
+        wc = WordCloud(
+            width=900, height=400, background_color=neon_bg, colormap='winter',
+            max_words=80, random_state=42
+        ).generate(filtered_text)
+        st.image(wc.to_array(), use_column_width=True)
+        top_words = ", ".join([w for w, _ in Counter(filtered_words).most_common(12)])
+        st.markdown(block_markdown(groq_bullets("Word cloud of frequent review words", top_words), neon_yellow), unsafe_allow_html=True)
+    else:
+        st.write("No review text available.")
+        st.markdown(block_markdown("- No text for word cloud.", neon_yellow), unsafe_allow_html=True)
+with c6:
+    st.subheader("6. Review Length Trend")
+    if len(df_valid) > 0:
+        review_lengths = df_valid["review_length"].reset_index(drop=True)
+        fig_line_length = go.Figure(data=[
+            go.Scatter(
+                x=review_lengths.index + 1, y=review_lengths,
+                mode="lines+markers",
+                line=dict(color=neon_orange, width=3)
+            )
+        ])
+        fig_line_length.update_layout(
+            xaxis_title="Review (chronological order)",
+            yaxis_title="Review Length (words)",
+            font=dict(size=16), plot_bgcolor=neon_bg
+        )
+        st.plotly_chart(fig_line_length, use_container_width=True)
+        st.markdown(block_markdown(
+            groq_bullets("Line chart showing trend of review lengths (number of words) in chronological order",
+            f"Lengths: {list(review_lengths[:50])}"), neon_orange), unsafe_allow_html=True)
+    else:
+        st.write("No valid reviews for length trend.")
+        st.markdown(block_markdown("- No data.\n- No chart.", neon_orange), unsafe_allow_html=True)
+st.markdown("---")
+# --- 7 & 8. Sentiment Polarity Histogram + Emotion Analysis ---
+c7, c8 = st.columns(2)
+with c7:
+    st.subheader("7. Sentiment Polarity Histogram")
+    # Make histogram visually full by using kde line (density)
+    polarity_values = df_valid["polarity"].values
+    fig_hist, ax = plt.subplots(figsize=(7,3))
+    ax.hist(polarity_values, bins=8, color=neon_blue, alpha=0.88, edgecolor="#222", density=True)
+    ax.set_xlabel("Sentiment Polarity Score")
+    ax.set_ylabel("Density")
+    ax.set_title("Distribution of Sentiment Scores")
+    # KDE line
+    if len(polarity_values) > 1:
+        from scipy.stats import gaussian_kde
+        kde = gaussian_kde(polarity_values)
+        x_range = np.linspace(-1, 1, 200)
+        ax.plot(x_range, kde(x_range), color=neon_green, lw=2)
+    st.pyplot(fig_hist)
+    st.markdown(block_markdown(
+        groq_bullets("Histogram of sentiment scores", list(polarity_values[:50])), neon_blue
+    ), unsafe_allow_html=True)
+with c8:
+    st.subheader("8. Emotion Analysis Bar Chart")
+    @st.cache_resource(show_spinner=True)
+    def get_emotion_pipeline():
+        return pipeline("text-classification", model="j-hartmann/emotion-english-distilroberta-base", top_k=None)
+    emotion_pipeline = get_emotion_pipeline()
+    emotion_counts = {}
+    for review in df_valid["review_text"]:
+        try:
+            emotions = emotion_pipeline(review[:512])
+            for e in emotions:
+                for d in e:
+                    emotion = d['label']
+                    if d['score'] > 0.2:
+                        emotion_counts[emotion] = emotion_counts.get(emotion, 0) + 1
+        except Exception:
+            continue
+    if emotion_counts:
+        fig_emotion = go.Figure(data=[
+            go.Bar(
+                x=list(emotion_counts.keys()),
+                y=list(emotion_counts.values()),
+                marker=dict(color=[neon_pink, neon_green, neon_blue, neon_yellow, neon_orange])
+            )
+        ])
+        fig_emotion.update_layout(xaxis_title="Emotion", yaxis_title="Count", font=dict(size=16))
+        st.plotly_chart(fig_emotion, use_container_width=True)
+        st.markdown(block_markdown(
+            groq_bullets("Bar chart of detected emotions in reviews", str(emotion_counts)), neon_pink
+        ), unsafe_allow_html=True)
+    else:
+        st.write("No emotion results (try more reviews).")
+st.markdown("---")
+# --- 9 & 10. Bigram/Trigram Frequency + Co-occurrence Network ---
+c9, c10 = st.columns(2)
+with c9:
+    st.subheader("9. Bigram/Trigram Frequency")
+    # Use only meaningful ngrams (exclude numbers, names)
+    corpus = df_valid["review_text"].tolist()
+    vect = CountVectorizer(ngram_range=(2,3), stop_words='english', max_features=20, token_pattern=r'\b[a-zA-Z][a-zA-Z]+\b')
+    X_ngram = vect.fit_transform(corpus)
+    ngram_counts = X_ngram.sum(axis=0).A1
+    ngrams = vect.get_feature_names_out()
+    ngram_freq = sorted(zip(ngrams, ngram_counts), key=lambda x: -x[1])
+    fig_ngram = go.Figure(data=[
+        go.Bar(
+            y=[ng for ng,_ in ngram_freq],
+            x=[int(c) for _,c in ngram_freq],
+            orientation='h',
+            marker=dict(color=neon_blue)
+        )
+    ])
+    fig_ngram.update_layout(yaxis_title='Phrase', xaxis_title='Count', font=dict(size=15))
+    st.plotly_chart(fig_ngram, use_container_width=True)
+    st.markdown(block_markdown(
+        groq_bullets("Bar chart of most common bigrams/trigrams", ', '.join([f"{ng}: {c}" for ng,c in ngram_freq])), neon_blue
+    ), unsafe_allow_html=True)
+with c10:
+    st.subheader("10. Co-occurrence Network Graph")
+    def get_top_cooc_words(texts, top_n=12):
+        words = [filter_tokens(re.findall(r'\w+', t.lower())) for t in texts]
+        all_pairs = []
+        for wlist in words:
+            all_pairs.extend(list(combinations(set(wlist), 2)))
+        counter = Counter(all_pairs)
+        return counter.most_common(top_n)
+    top_pairs = get_top_cooc_words(df_valid["review_text"])
+    G = nx.Graph()
+    for (a, b), w in top_pairs:
+        G.add_edge(a, b, weight=w)
+    # Use Kamada-Kawai layout for more even node spacing
+    pos = nx.kamada_kawai_layout(G)
+    # Adjust node and font size for clarity
+    node_count = G.number_of_nodes()
+    base_node_size = 620 if node_count <= 10 else max(390, 1400 // (node_count + 1))
+    font_size = 15 if node_count <= 10 else max(9, 20 - node_count // 2)
+    plt.figure(figsize=(7.4, 6.1))
+    nx.draw_networkx_nodes(
+        G, pos, node_color=neon_orange, edgecolors="#fff", linewidths=2,
+        node_size=base_node_size, alpha=0.96
+    )
+    nx.draw_networkx_edges(
+        G, pos,
+        width=[2.2 + G[u][v]['weight'] / 2.4 for u, v in G.edges()],
+        edge_color=neon_blue, alpha=0.76
+    )
+    nx.draw_networkx_labels(
+        G, pos, font_size=font_size, font_color="#212121", font_weight="bold"
+    )
+    plt.axis('off')
+    plt.tight_layout(pad=0.3)
+    st.pyplot(plt.gcf())
+    plt.clf()
+    # --- GROQ SUMMARY (2 lines, info box style) ---
+    def groq_summary_graph(prompt):
+        try:
+            resp = groq_client.chat.completions.create(
+                model=GROQ_MODEL,
+                messages=[
+                    {"role": "system", "content": PRODUCT_CONTEXT},
+                    {"role": "user", "content": prompt}
+                ],
+                max_tokens=90,
+                temperature=0.55,
+            )
+            # Remove asterisks, intro, etc
+            lines = [
+                line.strip(" *-•1234567890.").replace("**", "")
+                for line in resp.choices[0].message.content.strip().split("\n")
+                if line.strip()
+            ]
+            # Only first 2 lines (you may get 1-3 lines, but only keep 2)
+            return "<br>".join(lines[:2])
+        except Exception:
+            return "Summary not available."
+    cooc_pairs_str = "; ".join([f"{a}-{b} ({w})" for (a, b), w in top_pairs])
+    graph_summary = groq_summary_graph(
+        f"Summarize the key relationships or surprising findings in exactly two punchy, non-repetitive lines from this co-occurrence network of customer review words. "
+        f"No generic intro, only crisp insights. Pairs: {cooc_pairs_str}"
+    )
+    st.markdown(
+        f"""
+        <div style='background:linear-gradient(90deg,{neon_blue}22,{neon_orange}22);border-radius:14px;padding:18px 22px 12px 22px;margin-top:14px;margin-bottom:14px;box-shadow:0 2px 18px {neon_blue}19;'>
+            <span style='color:{neon_orange};font-size:1.15em;font-weight:800;'>Quick Network Insights:</span><br>
+            <span style='color:#fff;font-size:1.09em;'>{graph_summary}</span>
+        </div>
+        """, unsafe_allow_html=True
+    )
+st.markdown("---")
+# --- 11. Review Cluster Visualization (t-SNE) ---
+st.subheader("11. Review Cluster Visualization (t-SNE)")
+vectorizer = TfidfVectorizer(stop_words="english", max_features=100)
+X = vectorizer.fit_transform(df_valid["review_text"].fillna("")).toarray()
+tsne = TSNE(n_components=2, random_state=42, perplexity=min(30, max(5, len(df_valid)//2)))
+X_tsne = tsne.fit_transform(X)
+fig_tsne = go.Figure(data=[
+    go.Scatter(
+        x=X_tsne[:,0], y=X_tsne[:,1], mode="markers",
+        marker=dict(color=df_valid["polarity"], colorscale="RdYlGn", size=12, showscale=True),
+        text=df_valid["sentiment_label"]
+    )
+])
+fig_tsne.update_layout(xaxis_title="t-SNE 1", yaxis_title="t-SNE 2", font=dict(size=16))
+st.plotly_chart(fig_tsne, use_container_width=True)
+st.markdown(block_markdown(
+    groq_bullets("2D scatterplot of review clusters by t-SNE", "points colored by sentiment"), neon_blue
+), unsafe_allow_html=True)
+st.markdown("---")
+# ----------- Final Neon Blocks: Top Quotes and Summaries -----------
+st.markdown("---")
+cl1, cl2 = st.columns(2)
+with cl1:
+    st.markdown(block_markdown(
+        "<b>Top 3 Enthusiastic Positive Reviews:</b><br>" + "<br><br>".join(
+            [f'<span style="color:{neon_green}">“{r}”</span>' for r in top_n_reviews(df_valid, "Positive", 3)]
+        ),
+        neon_green), unsafe_allow_html=True)
+with cl2:
+    st.markdown(block_markdown(
+        "<b>Top 3 Most Critical Negative Reviews:</b><br>" + "<br><br>".join(
+            [f'<span style="color:{neon_pink}">“{r}”</span>' for r in top_n_reviews(df_valid, "Negative", 3)]
+        ),
+        neon_pink), unsafe_allow_html=True)
+cl3, cl4 = st.columns(2)
+with cl3:
+    all_pos_text = " ".join(df_valid[df_valid["polarity"] > 0]["review_text"])
+    st.markdown(block_markdown(
+        "<b>Top 3 Positive Sentiments:</b><br>" + groq_top_sentiments(all_pos_text, "positive"),
+        neon_green), unsafe_allow_html=True)
+with cl4:
+    all_neg_text = " ".join(df_valid[df_valid["polarity"] < 0]["review_text"])
+    st.markdown(block_markdown(
+        "<b>Top 3 Negative Sentiments:</b><br>" + groq_top_sentiments(all_neg_text, "negative"),
+        neon_pink), unsafe_allow_html=True)
+cl5, cl6 = st.columns(2)
+with cl5:
+    sentiment_texts = groq_summary_block(
+        "List the top 3 overall customer sentiments about the chocolate whey protein product as short phrases (not sentences, not quotes, just phrases)."
+    )
+    st.markdown(block_markdown(
+        "<b>Top 3 Overall Sentiments:</b><br>" + sentiment_texts.replace('\n', '<br>'),
+        neon_yellow), unsafe_allow_html=True)
+with cl6:
+    trend_summary = groq_summary_block(
+        "Summarize trends in one short sentence for chocolate protein reviews. "
+        "What do people like most, and what do they dislike most?"
+    )
+    st.markdown(block_markdown(
+        "<b>Summary of Trends:</b><br>" + trend_summary,
+        neon_blue), unsafe_allow_html=True)
+st.markdown("---\n<small style='color:#7CFC00'>Bugs Fring — End of Report</small>", unsafe_allow_html=True)

src/personas.json ADDED Viewed

	@@ -0,0 +1,42 @@

+[
+  {
+    "name": "💪 Fitness Enthusiast",
+    "summary": "Avid gym-goers who prioritize performance and recovery.",
+    "bullets": [
+      "Focus on improving stamina, muscle recovery, and post-workout nutrition.",
+      "Value effectiveness, consistency, and ease of use.",
+      "Often mention \"noticeable improvement\" and \"feel more energetic\" in their reviews.",
+      "May be interested in optimizing their workout routine and nutrition plan."
+    ]
+  },
+  {
+    "name": "🤔 Practical Shoppers",
+    "summary": "Budget-conscious consumers who weigh price against product quality.",
+    "bullets": [
+      "Frequently mention the product's price, value, and quantity.",
+      "May be willing to compromise on flavor options or texture for a better price.",
+      "Still prioritize effectiveness and ease of use.",
+      "May be interested in finding the best deal for their money."
+    ]
+  },
+  {
+    "name": "👅 Flavor Fans",
+    "summary": "Customers who prioritize taste and texture in their supplements.",
+    "bullets": [
+      "Often mention the flavor, consistency, and mixability of the product.",
+      "May be willing to pay a premium for a product that tastes great.",
+      "May be interested in exploring different flavor options or brands.",
+      "May be influenced by reviews that mention taste and texture."
+    ]
+  },
+  {
+    "name": "🏋️‍♂️ Newbies",
+    "summary": "Beginners who are new to the world of fitness and supplements.",
+    "bullets": [
+      "May be enthusiastic and excited about their new fitness journey.",
+      "Often mention being a \"beginner\" or \"new to the gym.\"",
+      "May prioritize ease of use, convenience, and a gentle learning curve.",
+      "May be interested in educational resources or guidance on how to use the product effectively."
+    ]
+  }
+]

src/review_files/ 1.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Mary says:
2	+ Improved my stamina noticeably. I feel more energetic during workouts. Good consistency and not too sweet.

src/review_files/ 2.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Christopher says:
2	+ Great taste and mixes well. The flavor is okay, nothing special.

src/review_files/ 3.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Maria says:
2	+ I feel more energetic during workouts. Noticeable muscle recovery improvement.

src/review_files/ 4.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Dawn says:
2	+ Blends easily with water or milk. Very effective for post-workout nutrition. Slight aftertaste but manageable. Improved my stamina noticeably.

src/review_files/ 5.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Amy says:
2	+ Perfect for daily supplementation. Could have a few more flavor options. No digestive issues so far. Very effective for post-workout nutrition.

src/review_files/ 6.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Stephanie says:
2	+ Bit pricey for the quantity. I feel more energetic during workouts. The flavor is okay, nothing special.

src/review_files/ 7.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Janet says:
2	+ Mild smell, not unpleasant though. Noticeable muscle recovery improvement. Very effective for post-workout nutrition. I feel more energetic during workouts.

src/review_files/ 8.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Robin says:
2	+ Good consistency and not too sweet. Works fine if you’re consistent.

src/review_files/ 9.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Cynthia says:
2	+ Not as filling as expected. Serving scoop could be better marked. Noticeable muscle recovery improvement. Improved my stamina noticeably.

src/review_files/.DS_Store ADDED Viewed

Binary file (8.2 kB). View file

src/review_files/10.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Stanley says:
2	+ Good consistency and not too sweet. Very effective for post-workout nutrition. Packaging could be more durable.

src/review_files/11.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Kyle says:
2	+ Good consistency and not too sweet. Blends easily with water or milk.

src/review_files/12.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Angela says:
2	+ Bit pricey for the quantity. Good consistency and not too sweet. Texture is decent, not too gritty. Helped me maintain my protein intake.

src/review_files/13.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Todd says:
2	+ Could have a few more flavor options. Perfect for daily supplementation.

src/review_files/14.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Thomas says:
2	+ No digestive issues so far. Bit pricey for the quantity. Could have a few more flavor options.

src/review_files/15.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Julia says:
2	+ Noticeable muscle recovery improvement. Blends easily with water or milk. Blends easily with water or milk. Seems effective but too early to judge.

src/review_files/16.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Evelyn says:
2	+ Perfect for daily supplementation. Perfect for daily supplementation. The flavor is okay, nothing special. Blends easily with water or milk.

src/review_files/17.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Bob says:
2	+ Blends easily with water or milk. Perfect for daily supplementation. Clumps if not shaken properly.

src/review_files/18.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Richard says:
2	+ Great taste and mixes well. Not as filling as expected. Helped me maintain my protein intake. Very effective for post-workout nutrition.

src/review_files/19.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Aaron says:
2	+ Good consistency and not too sweet. Great taste and mixes well.

src/review_files/20.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ Shelby says:
2	+ Noticeable muscle recovery improvement. Noticeable muscle recovery improvement.

src/review_files/21.png ADDED Viewed

src/review_files/22.png ADDED Viewed

src/review_files/23.png ADDED Viewed

src/review_files/24.png ADDED Viewed

src/review_files/25.png ADDED Viewed

src/review_files/26.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e1313f099ee9871d96090cf579fd1dc0d0deffe7ca7aff9e91345b730fd3a79d
+size 4509774

src/review_files/27.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1349862387c637a1b4b9bf4f3c6d1c882db0566770682edb03276ed916bb0c91
+size 4655182

src/review_files/28.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6bb2308d55d5d6c13ec0114279a8544c445cca1a3944bef58ded76fb665b3744
+size 4143182

src/review_files/29.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e1313f099ee9871d96090cf579fd1dc0d0deffe7ca7aff9e91345b730fd3a79d
+size 4509774

src/review_files/30.wav ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d1debef5f8b69d3b98c85ae545a937b6adfb3571bca6f5e66398970ff97496ba
+size 5750862