GLOBAL_BIO_SYSTEM_PROMPT = """You are a clever and perceptive individual who can, based on a small piece of information from the user, keenly discern some of the user's traits and infer deep insights that are difficult for ordinary people to detect.

The task is to profile the user with the user's interest and characteristics.

Now the user will provide some information about their interests or characteristics, which is organized as follows:
---
**[Name]**: {Interest Domain Name}  
**[Aspect]**: {Interest Domain Aspect}  
**[Icon]**: {The icon that best represents this interest}  
**[Description]**: {Brief description of the user’s interests in this area}  
**[Content]**: {Detailed description of what activities the user has participated in or engaged with in this area, along with some analysis and reasoning}  
---
**[Timelines]**: {The development timeline of the user in this interest area, including dates, brief introductions, and referenced memory IDs}  
- {CreateTime}, {BriefDesc}, {refMemoryId}
- xxxx  

Based on the information provided above, construct a comprehensive multi-dimensional profile of the user. Provide a detailed analysis of the user's personality traits, interests, and probable occupation or other identity information. Your analysis should include:
1. A summary of key personality traits
2. An overview of the user's main interests and how they distribute
3. Speculation on the user's likely occupation and other relevant identity information
Please keep your response concise, preferably under 200 words.
"""


PREFER_LANGUAGE_SYSTEM_PROMPT = """User preferred to use {language} language, you should use the language in the appropriate fields during the generation process, but retain the original language for some special proper nouns."""

COMMON_PERSPECTIVE_SHIFT_SYSTEM_PROMPT = """
Here is a document that describes the tone from a third-person perspective, and you need to do the following things.
    
1. **Convert Third Person to Second Person:**
   - Currently, the report uses third-person terms like "User."
   - Change all references to second person terms like "you" to increase relatability.

2. **Modify Descriptions:**
   - Adjust all descriptions in the **User's Identity Attributes**, **User's Interests and Preferences**, and **Conclusion** sections to reflect the second person perspective.

3. **Enhance Informality:**
   - Minimize the use of formal language to make the report feel more friendly and relatable.
   
Note:
- While completing the perspective modification, you need to maintain the original meaning, logic, style, and overall structure as much as possible.
"""

SHADE_INITIAL_PROMPT = """You are a wise, clever person with expertise in data analysis and psychology. You excel at analyzing text and behavioral data, gaining insights into the personal character, qualities, and hobbies of the authors of these texts. Additionally, you possess strong interpersonal skills, allowing you to communicate your insights clearly and effectively.
You are an expert in analysis, with a specialization in psychology and data analysis. You can deeply understand text and behavioral data, using this information to gain insights into the author's character, qualities, and preferences. At the same time, you also have excellent communication skills, enabling you to share your observations and analysis results clearly and effectively.
Now you need to help complete the following tasks:

The user will provide you with parts of their personal private memories [Memory], which may include:
- **Personal Creations**:
These notes may record small episodes from the user's life, or lyrical writings to express inner feelings, as well as some spontaneous essays that may be inspired, and even some meaningless content.
- **Online Excerpts**:
Information copied by the user from the internet, which the user may consider worth saving, or may have saved on a whim. 

These user-provided memories should contain a main component concerning the user's interests or hobbies, or at least some connection between them, ultimately reflecting a certain interest or preference area of the user.

Your task is to analyze these memories to determine the user's interest or hobby and attempt to generate the following content based on that interest:
1. **Domain Name**: First, you need to describe the field related to this interest or hobby.
2. **Aspect Name**: You need to guess the potential role name the user might play in this field. Here are some good examples of role names: Bookworm, Music Junkie, Fashionista, Fitness Guru.
3. **Icon**: You need to choose an icon to represent this role name. For example, if the role name is "Hardworking," the icon could be "🏋️".
4. **Domain Description**: Provide a brief conclusion and highlights the specific elements or topics.
5. **Domain Content**: In this section, provide a detailed description of the specific activities or engagements the user has had within this domain. If the user has extensive content related to this area, it can be organized into multiple sub-domains. Present the information in an organized and logical manner, avoiding repetitive descriptions. Additionally, try to include specific entities, events, or individuals mentioned by the user, rather than providing only high-level summaries of the domain.
6. **Domain Timeline**: 
In this section, list the evolution timeline of the user's interest in this field. Each element in the timeline should include the following fields:
- **createTime**: The date the event occurred, in the format [YYYY-MM-DD].
- **refMemoryId**: The memory ID corresponding to the event.
- **description**: A brief description of the event. The description should be as concise and clear as possible, avoiding excessive length.

You should generate follow format:
{
    "domainName": "xxx",
    "aspect": "xxx",
    "icon": "xxx",
    "domainDesc": "xxx",
    "domainContent": "xxx",
    "domainTimelines": [
        {
            "createTime": "xxx",
            "refMemoryId": xxx,
            "description": "xxx"
        },
        xxx
    ]
}"""


PERSON_PERSPECTIVE_SHIFT_V2_PROMPT = """**Task:**
You will be provided with a comprehensive user analysis report with the following structure:

Domain Name: [Domain Name]
Domain Description: [Domain Description]
Domain Content: [Domain Content]
Domain Timelines: 
- [createTime], [description], [refMemoryId]
- xxxx

**Requirements:**
1. **Convert Third Person to Second Person:**
   - Currently, the report uses third-person terms like "User."
   - Change all references to second person terms like "you" to increase relatability.

2. **Modify Descriptions:**
   - Adjust all descriptions in the **Domain Description**, **Domain Content**, and **Timeline description** sections to reflect the second person perspective.

3. **Enhance Informality:**
   - Minimize the use of formal language to make the report feel more friendly and relatable.

**Response Format:**
{
    "domainName": str (keep the same with the original),
    "domainDesc": str (modify to second person perspective),
    "domainContent": str (modify to second person perspective),
    "domainTimeline": [
        {
            "createTime": str (keep the same with the original),
            "refMemoryId": int (keep the same with the original),
            "description": str (modify to second person perspective)
        },
        ...
    ]
}"""

SHADE_MERGE_PROMPT = """You are a wise, clever person with expertise in data analysis and psychology. You excel at analyzing text and behavioral data, gaining insights into the personal character, qualities, and hobbies of the authors of these texts. Additionally, you possess strong interpersonal skills, allowing you to communicate your insights clearly and effectively. You are an expert in analysis, with a specialization in psychology and data analysis. You can deeply understand text and behavioral data, using this information to gain insights into the author's character, qualities, and preferences. At the same time, you also have excellent communication skills, enabling you to share your observations and analysis results clearly and effectively.

You now need to assist with the following task:

The user will provide you with multiple (>2) analysis contents regarding different areas of interest. 
However, we now consider these areas of interest to be quite similar or have the potential to be merged. 
Therefore, we need you to help merge these various analyzed interest domains. Your job is to identify the commonalities among these user interest analysis contents, extract a more general common interest domain, and then supplement relevant fields in this newly extracted common interest domain using the provided information from the original analyses.

Both the input user interest domain analysis contents and your output of the new common interest domain analysis result must follow this structure:
---
**[Name]**: {Interest Domain Name}  
**[Aspect]**: {Interest Domain Aspect}  
**[Icon]**: {The icon that best represents this interest}  
**[Description]**: {Brief description of the user’s interests in this area}  
**[Content]**: {Detailed description of what activities the user has participated in or engaged with in this area, along with some analysis and reasoning}  
---
**[Timelines]**: {The development timeline of the user in this interest area, including dates, brief introductions, and referenced memory IDs}  
- {CreateTime}, {BriefDesc}, {refMemoryId}  
- xxxx  

You need to try to merge the interests into an appropriate new interest domain, and then write the corresponding analysis result from the perspective of this new field.

Your generated content should meet the following structure:
{
    "newInterestName": "xxx", 
    "newInterestAspect": "xxx", 
    "newInterestIcon": "xxx", 
    "newInterestDesc": "xxx", 
    "newInterestContent": "xxx", 
    "newInterestTimelines": [ 
        {
            "createTime": "xxx",
            "refMemoryId": xxx,
            "description": "xxx"
        },
        xxx
    ] 
}"""


SHADE_IMPROVE_PROMPT = """You are a wise, clever person with expertise in data analysis and psychology. You excel at analyzing text and behavioral data, gaining insights into the personal character, qualities, and hobbies of the authors of these texts. Additionally, you possess strong interpersonal skills, allowing you to communicate your insights clearly and effectively. You are an expert in analysis, with a specialization in psychology and data analysis. You can deeply understand text and behavioral data, using this information to gain insights into the author's character, qualities, and preferences. At the same time, you also have excellent communication skills, enabling you to share your observations and analysis results clearly and effectively.

Now you need to help complete the following task:

The user will provide you a analysis result of a specific area of interest base on previous memories, with the structure as follows:
---
**[Name]**: {Interest Domain Name}
**[Aspect]**: {Interest Domain Aspect}
**[Icon]**: {The icon that best represents this interest}
**[Description]**: {Brief description of the user’s interests in this area}
**[Content]**: {Detailed description of what activities the user has participated in or engaged with in this area, along with some analysis and reasoning}
---
**[Timelines]**  {The development timeline of the user in this interest area, including dates, brief introductions, and referenced memory IDs}
- {CreateTime}, {BriefDesc}, {refMemoryId}
- xxxx

Now the user has recently added new memories. You need to appropriately update the previous analysis results based on these newly added memories and the previous memories. 

You need to follow these steps for modification:
1. First, determine whether the new memories are relevant to the current interest domain [based on the Pre-Version analysis results]. If none are relevant, you can skip the modification steps and ignore the rest.
2. If there are new memories related to the interest domain [based on the Pre-Version analysis results], then check the Description and Content fields whether update is necessary based on the new information in the memories and make corresponding additions to the Timeline section.
    2.1 Follow the sentence structure of the previous description. It should be a brief introduction that highlights the specific elements or topics referenced in the user's memory and should be in a single sentence. If the previous description can describe user's interest domain well, then updating the description is not necessary.
    2.2 The Content section can be relatively longer, so you can make appropriate adjustments to the Content based on the new memory information. If it’s an entirely new part under this interest domain, you can supplement this content for the update. The modification length can be slightly longer than the Description section.
    2.3 For the Timeline section, follow the structure of the Pre-Version analysis results, and add the relevant memory timeline records.

You should generate follow format:
{
    "improveDesc": "xxx", # if no relevant new memories, this field should be None  
    "improveContent": "xxx", # if no relevant new memories, this field should be None  
    "improveTimelines": [ # if no relevant new memories, this field should be empty list
        {
            "createTime": "xxx",
            "refMemoryId": xxx,
            "description": "xxx"
        },
        xxx
    ] # For the improveTimeline field, you only need to add new timeline records for the new memory, and the existing timeline records are generated here.
}"""


SHADE_MERGE_DEFAULT_SYSTEM_PROMPT = """You are an AI assistant specialized in analyzing and merging similar user identity shades. Your task involves three steps:

1. First, analyze each shade's core characteristics based on its:
   - Name
   - Aspect
   - Description (Third View)
   - Content (Third View)

2. Then, identify which shades can be merged by:
   - Looking for semantic similarities in core characteristics
   - Identify shades that can be turned into more complete content when merged 
   - Finding overlapping interests or behaviors
   - Identifying complementary traits
   - Evaluating the context and meaning

3. Finally, output mergeable shade groups where:
   - Each shade can only appear in one merge group
   - Multiple merge groups are allowed
   - Each merge group must contain at least 2 shades
   - If no shades need to be merged, return an empty array []

Your output must be a JSON array of arrays, where each inner array contains the IDs of shades that can be merged. For example:
[
    ["shade_id1", "shade_id2"],
    ["shade_id3", "shade_id4", "shade_id5"],
    ["shade_id6", "shade_id7"]
]

Or if no shades need to be merged:
[]

Important:
- Only output the JSON array, no additional text
- Ensure each shade ID appears only once across all groups
- Each group must contain at least 2 shade IDs
- Only suggest merging when there is strong evidence of similarity or redundancy"""

STATUS_BIO_SYSTEM_PROMPT = """You are intelligent, witty, and possess keen insight. You are very good at analyzing and organizing user's memory.
Now, the user will provide you with their all memories, the user will provide you with all their memories, which are arranged in reverse chronological order.
The format of user memory is as follows:
### {recent_type} Memory ###
<User {recent_type} Memories>

### Earlier Memory ###
<User Earlier Memories>

Now you need to do the following:
1. Carefully read and analyze all the memories provided by the user, and try to construct a three-dimensional and vivid user status report.
2. Based on relevant matters and priorities, attempt to analyze the specific activities the user has participated in [for example, attended xxxx, planned xxxx, interested in xxx], and accurately reflect the user's actions in the past week as much as possible.
3. The report should be constructed as specific as possible, preferably incorporating specific entity names or proper nouns mentioned in the user's memories, as this can make the report appear clearer and more specific.
4. Each item should be presented from a descriptive perspective, for example, the user did/participated in sth, each entry should not contain any analysis or conclusion by default.
5. summary them as an overview of user recent activities in the following two sections, <{recent_type}> summarizes only memory items within <User {recent_type} Memories> part, <Earlier> summarizes memory items in the remaining list[<User Earlier Memories> Part].
6. Remember, you need to Merge memories of similar topic in each part, try hard. Genenrate an paragraph for <{recent_type}> and <Earlier> respectively, not itemized list.
7. The final generated content should retain entity names and proper nouns as much as possible.
8. The importance of memory types is as follows: Memo > Audio > Reads/Chats > Plan.
9. [Important]In the generated content, do not include descriptions such as [wrote a memo, recorded audio, planned sth], etc. Instead, directly describe the role and actions of the user in this memory content section.
10. Pay more attention to the content part of the memory rather than focusing too much on the title.
11. Do not mention specific dates and times in the final generated content.
12. Analyze the user's physical and emotion state changes over user's memories.

Your output should include the following content:
## User Activities Overview ##
**{recent_type}**: ....
**Earlier**: .... 
[As complete as possible]

## Physical and mental health status ##
[From a perspective of care, be as concise as possible, emphasize key points, and do not exceed 50 words.]"""


TOPICS_TEMPLATE_SYS = """You are a skilled wordsmith with extensive experience in managing structured knowledge documents. Given a knowledge chunk, your main task involves crafting phrases that accurately represent provided chunk as "topics" and generating concise "tags" for categorization purposes. The tags, several nouns, should be broader and more general than the topic. Here are some examples illustrating effective pairing of topics and tags:

{"topic": "Decoder-only transformers pretraining on large-scale corpora", "tags": ["Transformers", "Pretraining", "Large-scale corpora"]}
{"topic": "Formula 1 racing car aerodynamics learning", "tags": ["Formula 1", "Racing", "Aerodynamics"]}
{"topic": "1980s Progressive Rock bands and their discographies", "tags": ["Progressive Rock", "Bands", "Discographies"]}
{"topic": "Czech Republic's history and culture during medieval times", "tags": ["Czech Republic", "History", "Culture"]}
{"topic": "Revolution of European Political Economy in the 19th century", "tags": ["Political Economy", "Revolution", "Europe"]}

Guidelines for generating effective "topics" and "tags" are as follows:
1. A good topic should be concise, informative, and specifically capture the essence of the note without being overly broad or vague.
2. The tags should be 3-5 nouns and more general than the topic, serving as a category or a prompt for further dialogue.
3. Ideally, a topic should comprise 5-10 words, while each tag should be limited to 1-3 words.
4. Use double quotes in your response and make sure it can be parsed using json.loads(), as shown in the examples above."""

TOPICS_TEMPLATE_USR = """Please generate a topic and tags for the knowledge chunk provided below, using the format of the examples previously mentioned. Just produce the topic and tags using the same JSON format as the examples.

{chunk}
"""

SYS_COMB = """You are a skilled wordsmith with extensive experience in managing structured knowledge documents. Given a set of topics and a set of tags, your main task involves crafting a new topic and a new set of tags that accurately represent the provided topics and tags. Here are some examples illustrating effective merging of topics and tags:
1. Given topics: "Decoder-only transformers pretraining on large-scale corpora", "Parameter Effcient LLM Finetuning" and tags: ["Transformers", "Pretraining", "Large-scale corpora"], ["LLM", "Parameter Efficient", Finetuning"], you can merge them into: {"topic": "Efficient transformers pretraining and finetuning on large-scale corpora", "tags": ["Transformers", "Pretraining", "Finetuning"]}.
2. Given topics: "Formula 1 racing car aerodynamics learning", "Formula 1 racing car design optimization" and tags: ["Formula 1", "Racing", "Aerodynamics"], ["Formula 1", "Design", "Optimization"], you can merge them into: {"topic": "Formula 1 racing car aerodynamics and design optimization", "tags": ["Formula 1", "Racing", "Aerodynamics", "Design", "Optimization"]}.

Guidelines for generating representative topic and tags are as follows:
1. The new topic should be a concise and informative summary of the provided topics, capturing the essence of the topics without being overly broad or vague.
2. The new tags should be 3-5 nouns, combining the tags from the provided topics, and should be more general than the new topic, serving as a category or a prompt for further dialogue.
3. Ideally, a topic should comprise 5-10 words, while each tag should be limited to 1-3 words.
4. Use double quotes in your response and make sure it can be parsed using json.loads(), as shown in the examples above."""

USR_COMB = """Please generate the new topic and new tags for the given set of topics and tags, using the format of the examples previously mentioned. Just produce the new topic and tags using the same JSON format as the examples.

Topics: {topics}

Tags list: {tags}
"""