Interview Guides

Detect Flaky Tests from Runs | Dataford Interview Questions - Dataford - Ace your Interview

Detect Flaky Tests from Runs

Easy

Coding

SortingSearchingGreedy

Problem

At BuildForge, a test is considered flaky if it has both passed and failed across multiple CI runs. Given a list of test execution records, return the names of all flaky tests in lexicographical order.

Implement a function that scans the records and detects which tests are inconsistent over time.

Formal Specification

Input: records, a list of pairs [test_name, status]
- test_name is a non-empty string
- status is either "pass" or "fail"
Output: A list of unique test names that have appeared with both statuses at least once

Examples

Example 1

Input: records = [["login", "pass"], ["checkout", "fail"], ["login", "fail"], ["search", "pass"]]
Output: ["login"]

login appears once as pass and once as fail, so it is flaky.

Example 2

Input: records = [["a", "pass"], ["b", "fail"], ["a", "pass"], ["b", "fail"]]
Output: []

Neither test changes outcome, so no test is flaky.

Constraints

1 <= len(records) <= 10^5
1 <= len(test_name) <= 100
Each status is either "pass" or "fail"
Test names may appear multiple times
Output must contain unique names sorted in ascending lexicographical order

Examples

Example 1

Inputrecords = [["login", "pass"], ["checkout", "fail"], ["login", "fail"], ["search", "pass"]]Output["login"]Why`login` appears with both `pass` and `fail`, while the other tests appear with only one status.

Example 2

Inputrecords = [["a", "pass"], ["b", "fail"], ["a", "pass"], ["b", "fail"]]Output[]WhyBoth tests are consistent across all runs, so none are flaky.

Example 3

Inputrecords = [["api", "fail"], ["ui", "pass"], ["api", "pass"], ["ui", "fail"], ["api", "fail"]]Output["api", "ui"]WhyBoth `api` and `ui` have been observed with both possible outcomes.

Constraints

1 <= len(records) <= 10^5
1 <= len(test_name) <= 100
Each record is a pair [test_name, status]
status is either "pass" or "fail"
Test names may repeat many times
Return unique flaky test names sorted lexicographically

Function Signature

def find_flaky_tests(records):

Problem

Implement a function that scans the records and detects which tests are inconsistent over time.

Formal Specification

Input: records, a list of pairs [test_name, status]
- test_name is a non-empty string
- status is either "pass" or "fail"
Output: A list of unique test names that have appeared with both statuses at least once

Examples

Example 1

Input: records = [["login", "pass"], ["checkout", "fail"], ["login", "fail"], ["search", "pass"]]
Output: ["login"]

login appears once as pass and once as fail, so it is flaky.

Example 2

Input: records = [["a", "pass"], ["b", "fail"], ["a", "pass"], ["b", "fail"]]
Output: []

Neither test changes outcome, so no test is flaky.

Constraints

1 <= len(records) <= 10^5
1 <= len(test_name) <= 100
Each status is either "pass" or "fail"
Test names may appear multiple times
Output must contain unique names sorted in ascending lexicographical order

Examples

Example 1

Example 2

Inputrecords = [["a", "pass"], ["b", "fail"], ["a", "pass"], ["b", "fail"]]Output[]WhyBoth tests are consistent across all runs, so none are flaky.

Example 3

Inputrecords = [["api", "fail"], ["ui", "pass"], ["api", "pass"], ["ui", "fail"], ["api", "fail"]]Output["api", "ui"]WhyBoth `api` and `ui` have been observed with both possible outcomes.

Constraints

1 <= len(records) <= 10^5
1 <= len(test_name) <= 100
Each record is a pair [test_name, status]
status is either "pass" or "fail"
Test names may repeat many times
Return unique flaky test names sorted lexicographically

Function Signature

def find_flaky_tests(records):

Practice Python

Python 3.10

Open on desktop for the full Python editor with syntax highlighting and autocomplete.

Up next

Summarize Test Run ResultsEasy

Next question

Detect Flaky Tests from Runs

Easy

Coding

SortingSearchingGreedy

Problem

Implement a function that scans the records and detects which tests are inconsistent over time.

Formal Specification

Input: records, a list of pairs [test_name, status]
- test_name is a non-empty string
- status is either "pass" or "fail"
Output: A list of unique test names that have appeared with both statuses at least once

Examples

Example 1

Input: records = [["login", "pass"], ["checkout", "fail"], ["login", "fail"], ["search", "pass"]]
Output: ["login"]

login appears once as pass and once as fail, so it is flaky.

Example 2

Input: records = [["a", "pass"], ["b", "fail"], ["a", "pass"], ["b", "fail"]]
Output: []

Neither test changes outcome, so no test is flaky.

Constraints

1 <= len(records) <= 10^5
1 <= len(test_name) <= 100
Each status is either "pass" or "fail"
Test names may appear multiple times
Output must contain unique names sorted in ascending lexicographical order

Examples

Example 1

Example 2

Inputrecords = [["a", "pass"], ["b", "fail"], ["a", "pass"], ["b", "fail"]]Output[]WhyBoth tests are consistent across all runs, so none are flaky.

Example 3

Inputrecords = [["api", "fail"], ["ui", "pass"], ["api", "pass"], ["ui", "fail"], ["api", "fail"]]Output["api", "ui"]WhyBoth `api` and `ui` have been observed with both possible outcomes.

Constraints

1 <= len(records) <= 10^5
1 <= len(test_name) <= 100
Each record is a pair [test_name, status]
status is either "pass" or "fail"
Test names may repeat many times
Return unique flaky test names sorted lexicographically

Function Signature

def find_flaky_tests(records):

Problem

Implement a function that scans the records and detects which tests are inconsistent over time.

Formal Specification

Input: records, a list of pairs [test_name, status]
- test_name is a non-empty string
- status is either "pass" or "fail"
Output: A list of unique test names that have appeared with both statuses at least once

Examples

Example 1

Input: records = [["login", "pass"], ["checkout", "fail"], ["login", "fail"], ["search", "pass"]]
Output: ["login"]

login appears once as pass and once as fail, so it is flaky.

Example 2

Input: records = [["a", "pass"], ["b", "fail"], ["a", "pass"], ["b", "fail"]]
Output: []

Neither test changes outcome, so no test is flaky.

Constraints

1 <= len(records) <= 10^5
1 <= len(test_name) <= 100
Each status is either "pass" or "fail"
Test names may appear multiple times
Output must contain unique names sorted in ascending lexicographical order

Examples

Example 1

Example 2

Inputrecords = [["a", "pass"], ["b", "fail"], ["a", "pass"], ["b", "fail"]]Output[]WhyBoth tests are consistent across all runs, so none are flaky.

Example 3

Inputrecords = [["api", "fail"], ["ui", "pass"], ["api", "pass"], ["ui", "fail"], ["api", "fail"]]Output["api", "ui"]WhyBoth `api` and `ui` have been observed with both possible outcomes.

Constraints

1 <= len(records) <= 10^5
1 <= len(test_name) <= 100
Each record is a pair [test_name, status]
status is either "pass" or "fail"
Test names may repeat many times
Return unique flaky test names sorted lexicographically

Function Signature

def find_flaky_tests(records):

Practice Python

Python 3.10

Open on desktop for the full Python editor with syntax highlighting and autocomplete.

Up next

Summarize Test Run ResultsEasy

Next question