Cache tests #124

slavingia · 2024-12-20T17:43:58Z

A way to make tests faster.

Via snapshot images? Via URLs?

Bounty: $1,000 for each PR that improves the speed of a shortest test.

bojl · 2024-12-24T04:12:47Z

One of way of speeding up tests is to maintain a cache that maps the test input (prompt, site) to the steps the AI decides to take (mouse move, click, type, button press, etc). That way, during the next identical test run you can just have playwright execute the steps without calling the LLM until the test is complete (to verify that the test was successful). You might also be able to store jest verification steps along the way (like validating divs exist, etc) to avoid the need to call the LLM at all.

This is useful for testing regressions, but obviously not as useful when the page you're testing has changed since the last test run.

Another thing you can do is allow the LLM to suggest multiple action in a single prompt rather than 1 action = 1 prompt

bojl · 2024-12-24T04:25:51Z

Ah I've been beaten to the punch, at least on the first suggestion-- #134

slavingia · 2024-12-24T10:36:27Z

Yep, that's probably the right approach. And rerun a step/whole test upon failure in case of a page change.

gladyshcodes · 2024-12-26T14:48:41Z

@slavingia Hi there! That is how I see caching implemented, what do you think?

Essentially, we could keep track of every step made along with LLM resps in cache. I believe it's better to do it per test rather than per test file. On initial run, each step will be stored in cache. On subsequent test runs, we would replay those instructions. If any of them fail or UI have changed, we will fall back to a normal test execution and call LLM, rebuilding cache on the fly. I am not sure about the performance gain and effectiveness of such approach yet, just wanted to gather some feedback and thoughts :)

slavingia · 2024-12-26T15:20:52Z

Overall flow makes sense! Should make most tests very fast and computer use only needed in the case of failure, where certain things changed.

gladyshcodes · 2024-12-26T17:57:25Z

I will make PR implementing this functionality soon 👍

kshmir mentioned this issue Dec 24, 2024

add a basic implementation of a recording based tool and client #134

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache tests #124

Cache tests #124

slavingia commented Dec 20, 2024 •

edited

Loading

bojl commented Dec 24, 2024

bojl commented Dec 24, 2024

slavingia commented Dec 24, 2024

gladyshcodes commented Dec 26, 2024

slavingia commented Dec 26, 2024

gladyshcodes commented Dec 26, 2024 •

edited

Loading

Cache tests #124

Cache tests #124

Comments

slavingia commented Dec 20, 2024 • edited Loading

bojl commented Dec 24, 2024

bojl commented Dec 24, 2024

slavingia commented Dec 24, 2024

gladyshcodes commented Dec 26, 2024

slavingia commented Dec 26, 2024

gladyshcodes commented Dec 26, 2024 • edited Loading

slavingia commented Dec 20, 2024 •

edited

Loading

gladyshcodes commented Dec 26, 2024 •

edited

Loading