We may earn compensation from some providers below. Learn More.
Our videos have over 100 thousand views on
Youtube logo

See our channel

How We Test & Score AI Girlfriend Apps

uPDATED ON: October 19, 2024

Question Mark Tooltip Icon

Update Schedule Overview

Every 3 months: We check our information for accuracy, update pricing, and make small adjustments as needed.

Every 6 months: We conduct a comprehensive review of all content to ensure it remains relevant, making major updates where necessary.

This schedule ensures you receive the most current and accurate information. For more details, visit our update process.

Question Mark Tooltip Icon

Content Review Process

This piece of content is reviewed by one of our experts, ensuring the information you receive is accurate and trustworthy.

For more details, learn about our fact-checking process.

Our team has spent countless of ours refining our AI girlfriend testing process. Our system is designed to create a clear and fair assessment for each AI girlfriend app.

If you use our scores as guidelines you can quickly discover which AI girlfriend app is best for you.

Below we’re going factor-by-factor so you can see exactly how we test and score AI girlfriend apps.

We currently use this rating system for the following types of products:

  • AI roleplay apps
  • AI girlfriend apps
  • (NSFW) Chatbots

Scoring Factors

Our overall score is based on a weighted average of 6 scoring factors.

  • 15% – Character Diversity
  • 15% – Customization
  • 20% – Conversation
  • 20% Image Quality
  • 10% – Privacy
  • 20% – Value for Money
How We review an AI Girlfriend app

Understanding Our Scores

We rate each feature based on how important it is, then add those ratings to get the overall score. The more points an AI girlfriend app scores in any given criteria, the better its overall score is going to be.

Here’s a breakdown of what each overall score stands for:

4.6

5.0

This is a perfect score, showing the AI girlfriend app is top-notch with exceptional features.

4.0

4.5

Good score, the app meets basic requirements but isn’t flawless.

3.0

3.9

Bad score, the app significantly underperforms compared to similar competitors.

We rate individual aspects using a star system from 1.0 to 5.0, in 0.5 steps. The total score may include decimals.

Scoring Methodology

Over time, we may find ways to improve our testing process or even add or remove criteria. These changes aim to make sure our ratings give you the best advice on AI girlfriend apps.

Right now, our evaluation system is at version 1.2, designed from insights gathered through a survey of ai girlfriend app users. This survey asked users to identify the features they value most in an AI girlfriend app.

What features do you value most in an ai girlfriend app

If we make any updates to how we rate apps, we’ll also go back and update our existing reviews to match. That way, you’ll always know which version of our methods was used to rate an app, as we’ll note it at the start of every review.

Character Diversity – 15%

Our character diversity score is an objective assessment based on:

  • Variety of AI Characters Available 3.75%
  • Availability of Anime-Styled Characters 3.75%
  • Inclusion of Male Characters 3.75%
  • Diversity in Ages, Ethnicities, and Personalities 3.75%

To see how varied the personalities are, we look at the types of AI girlfriends the app offers.

If there aren’t specific types, we randomly pick some characters and read their stories to check for different personalities.

Naomi Carter personality (candy.ai)
Character Biography

Here’s the scale we use to determine character diversity scores:

4.6

5.0

Excellent range of characters

4.0

4.5

Good character selection

3.0

3.9

Very minimum character diversity

Customization – 15%

Our evaluation of customization capabilities measures how much users can personalize their AI girlfriend. We explore customization options across multiple dimensions:

Identity & Physical Traits – 3.75%

  • Ethnicity: Variety of ethnic backgrounds.
  • Age: Range of age options.
  • Body Type: Diversity in body shapes.
  • Face Style: Different facial structures and looks.
  • Hair Color & Style: Assortment of hair colors and styles.
  • Breast & Butt: Options for various sizes.

Appearance & Style – 3.75%

  • Clothes: Selection of clothing styles.
  • Tattoos: Availability of tattoo designs.
  • Photo Style: Various photographic aesthetics.

Personal Preferences & Characteristics – 3.75%

  • Name Selection: Ability to choose or input names.
  • Relationship Status: Options for defining the nature of the AI relationship.
  • Occupation: Variety of career backgrounds.
  • Hobbies: Range of interests and activities.
  • Personality Type: Diversity in personality traits.
  • Voice: Options for different voice tones and accents.

Setting & Context – 3.75%

  • Environment: Choices for different backgrounds or settings.

Apps that enable users to input their own customization prompts will receive additional bonus points, reflecting the enhanced personalization capability.

I wish you looked like section
Customization prompt from Muah AI

Here’s the scale we use to determine customization scores:

4.6

5.0

Unparalleled customization options

4.0

4.5

Adequate customization features

3.0

3.9

Basic customization capabilities

Conversation – 20%

When we surveyed regular users of AI girlfriend apps, they all said that having good conversations is the most important thing for them. That’s why we pay a lot of attention to how well the apps handle chatting, (group) roleplaying, and even sexting.

We look closely at how deep and meaningful the conversations are.

Messages that are longer and use special formatting, like italics, to show feelings tend to feel more real and score better.

Example of the use of italics during roleplay

Memory – 10%

Memory plays a crucial role in assessing conversation quality in AI girlfriend apps. High-quality apps can recall details mentioned many messages ago, avoiding the common issue of forgetting important information or falling into repetitive loops.

As of March 2024, the standard memory capacity for most AI girlfriend apps is about 20 messages. This means they can remember details shared up to 20 messages earlier, which is considered very good. Apps with a larger memory capacity receive higher scores in our conversation quality evaluation.

Memory Limit (Messages)Rating
Up to 10Below Average
11-20Good
21-30Very Good
31+Excellent

Additionally, some AI girlfriend apps feature “memory injection,” allowing users to input specific facts, events, or characteristics that the AI will permanently remember.

For instance, if you tell the AI you met at a festival in Tokyo, it will retain that memory regardless of the number of messages exchanged afterward. This feature significantly enhances the conversation quality and the overall experience with the app.

Memory injection on muah ai
Memory injection on muah.ai
Memory injection on GirlfriendGPT
Memory injection on GirlfriendGPT

Reply Speed – 10%

We also check how fast the AI responds, especially to longer or more complex questions. It’s these little things that make some AI girlfriend apps feel more genuine and stand out from the rest.

Reply Speed (Seconds)Rating
1-2Too fast
3-5Good
6-8A bit slow
9+Too slow

Finally, we check if you can generate voice messages with the AI girlfriend app and how real the voice sounds. This makes chatting feel more like talking to a real person.

Here’s the scale we use to determine conversation scores:

4.6

5.0

Exceptionally engaging and realistic conversations

4.0

4.5

Satisfactory conversation experience

3.0

3.9

Basic conversational interaction

Image Quality – 20%

Some AI girlfriend apps let you create pictures while chatting, and others have a special tool for making lots of images quickly (bulk image generator). We rate these two features on their own and then combine them into an overall image quality score.

in-Game Images – 6.67%

We look at how simple it is to make images during chat and how many different commands you can use. For instance, if an app only accepts a few specific commands, it might be tricky to create images unless you know the exact commands to use.

Commands for generating images on candy ai
Commands for generating images on candy ai

Image Generator – 6.67%

Some AI girlfriend apps feature a dedicated AI image generator for creating realistic images. When evaluating this, we consider several key aspects:

  • Image Generation Speed: Ideally, it should take less than 10 seconds to create an image.
  • Ease of Use: The process should be straightforward, utilizing drop-down menus for simplicity. If text prompts are necessary, they must be clear and easy to follow.
  • Maximum Number of Images: While we don’t have a set minimum, more is generally better.

Image Accuracy – 6.67%

Accuracy is crucial for both in-chat images and the dedicated image generator. We assess this by using identical prompts with an added detail to see how long the images stay true to our request. A higher score is given for images that maintain accuracy over time.

latina in her bedroom
Wearing a skirt
Wearing a blue skirt
Wearing a blue skirt
Latina in the park wearing a blue skirt with white stripes
Wearing a blue skirt with white stripes

For each app, we’ll produce at least ten images and track any discrepancies, such as:

  • Unintended extra limbs
  • Facial features inconsistency
  • Odd-looking backgrounds

The fewer these issues occur, the better the app’s score.

Here’s the scale we use to determine image quality scores:

4.6

5.0

Flawless image realism and clarity

4.0

4.5

Good quality images

3.0

3.9

Adequate image quality

Privacy – 10%

Privacy is crucial, especially for adult-themed services. AI girlfriend apps come with their own set of privacy concerns because the technology is so new and sometimes falls into legal grey areas.

We take a close look at each app’s privacy policy and terms of service to see what data they collect and if they can share your information without asking you first.

The overall privacy rating is determined by asessing two subratings:

  • Chat monitoring: 5%
  • Billing Discretion: 5%

Chat Monitoring – 5%

Another big point for us is if the app protects your chats with end-to-end encryption. This means only you and the app can see your messages, keeping them safe from hackers and even the app makers. Apps that do this get extra credit for privacy.

If an app doesn’t use encryption, we dig deeper into their policies to see if they watch over your chats. If their rules don’t make it clear, we’ll directly ask the app’s team to explain their chat monitoring policy.

Screenshot response Tom on Candy AI privacy
Response from Candy.ai to the question: “Does candy.ai monitor chats?”

Billing Discretion – 5%

Our AI experts personally purchase and test each AI girlfriend app to assess billing discretion. We examine if the billing statement uses explicit terms or the brand name that hints at purchasing an AI girlfriend app. For billing that isn’t anonymous and gives away the nature of the purchase, we reduce points from the app’s privacy rating.

Here’s the scale we use to determine privacy scores:

4.6

5.0

Unmatched privacy and data protection

4.0

4.5

Good privacy but room for improvement

3.0

3.9

Basic privacy features

Value For Money – 20%

To determine the value of an AI girlfriend app, we assess its features, the test results of the other performance factors and the pricing options. We then compare these variables against competitors and the market standard (currently at $12.99/month).

Comparison of subscription prices for popular ai girlfriend apps

Here’s the scale we use to determine value for money scores:

4.6

5.0

Exceptional value at every level

4.0

4.5

Good investment

3.0

3.9

Modest value for the investment

Mistakes and How to Fix Them

Our way of testing and giving scores is really detailed and not easy to do. Because of this, we might sometimes make mistakes, like getting a test result wrong, messing up a score, forgetting to update something, or other slip-ups.

We try really hard to avoid these errors by checking our work carefully. But mistakes can still happen.

If you notice any mistake or something that doesn’t seem right, please tell us here. We’re always ready to fix errors and clear up any confusion.

Got Ideas?

If you have any ideas on how we can improve our scoring, we’d love to hear from you. Please share your thoughts with us on our contact form.

Herman J. Carter Signature
Herman Carter