-
Notifications
You must be signed in to change notification settings - Fork 1.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
flaky CI tests (maybe term.reset related) #5184
Comments
Some digging on
Is it a microtask?
Is it a macrotask?
So yepp, we have here the infamous nextTick-issue, that many nodejs devs should know, but with a pending task on the browser's macrotask queue. So why is this a problem during test execution? await beforeEach();
await test1();
await beforeEach();
await test2();
... So export async function timeout(ms: number): Promise<void> {
return new Promise<void>(r => setTimeout(r, ms));
} Solution: Second best solution is to fix all playwright tests with a |
So the issue is in fact a bit more complicated. I added this test snippet to an addon test: test.describe.only('buggy', async () => {
const logs: string[] = [];
const MAX = 19;
new Promise<string[]>(r => {
for (let i = 0; i <= MAX; ++i) {
test(''+i, async () => {
const content: string = await ctx.page.evaluate(`document.querySelector('.xterm-rows > div').innerHTML`);
logs.push(content.slice(0, 50));
if (i === MAX) r(logs);
});
}
}).then(logs => console.log(logs));
}); and run the test with $> yarn test-integration --workers=50% --suite=addon-image If resetting is perfectly sync, i'd expect this result for every browser: [
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>'
] which means, the DOM repr of the terminal buffer contains only one cell for the cursor. I actually get this from Chromium (and to a much lesser degree from Webkit, Firefox seems fine locally): [
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '<span> </span>',
'<span> </span>', '',
'<span> </span>', '',
'<span> </span>', '<span> </span>',
'<span> </span>', '',
'<span> </span>', '<span> </span>',
'', '<span> </span>',
'<span> </span>', ''
] So the reset is sometimes not finished on certain browsers. Turns out, that not test.beforeEach(async () => {
await ctx.page.evaluate(`
window.__f = async () => {
window.term.reset();
return new Promise(r => requestAnimationFrame(r));
}
window.__f();
`);
}); This now reliably fixes the reset handling between tests, even under high load (tested locally up to a load of 20). |
The CI sometimes produces test failures for no obvious reasons. Those failures are not reproducible locally and seem to be more likely, when the CI machine is under heavy load.
Example:
Thats from weblinks tests, where all neighboring tests take a moderate time of ~130ms, while one test runs into a timeout. Those tests actually poll for a certain change to happen on the DOM, but for some reason that change never happened.
Our playwright test modules run all in the same page on the same terminal instance, but separate tests by a
to reset the terminal and the tested addon to initial state. This raised my suspicion, whether there might be something off with the reset handling here. This is further backed by the fact, that introducing a wait after
term.reset
solves the issue:The text was updated successfully, but these errors were encountered: