Only declare lifx update failure after 3 attempts #90872

bdraco · 2023-04-05T18:51:39Z

Proposed change

These devices sometimes flakey and generate a lot of noise from drop outs since communication is UDP best-effort. We should only mark them unavailable if its not a momentary blip

fixes #78876

Type of change

Dependency upgrade
Bugfix (non-breaking change which fixes an issue)
New integration (thank you!)
New feature (which adds functionality to an existing integration)
Deprecation (breaking change to happen in the future)
Breaking change (fix/feature causing existing functionality to break)
Code quality improvements to existing code or addition of tests

Additional information

This PR fixes or closes issue: fixes #
This PR is related to issue:
Link to documentation pull request:

Checklist

The code change is tested and works locally.
Local tests pass. Your PR cannot be merged unless tests pass
There is no commented out code in this PR.
I have followed the development checklist
I have followed the perfect PR recommendations
The code has been formatted using Black (black --fast homeassistant tests)
Tests have been added to verify that the new code works.

If user exposed functionality or configuration variables are added/changed:

Documentation added/updated for www.home-assistant.io

If the code communicates with devices, web services, or third-party tools:

The manifest file has all fields filled out correctly.
Updated and included derived files by running: python3 -m script.hassfest.
New or updated dependencies have been added to requirements_all.txt.
Updated by running python3 -m script.gen_requirements_all.
For the updated dependencies - a link to the changelog, or at minimum a diff between library versions is added to the PR description.
Untested files have been added to .coveragerc.

To help with the load of incoming pull requests:

I have reviewed two other open pull requests in this repository.

These devices sometimes flakey and generate a lot of noise from drop outs since communication is UDP best-effort. We should only mark them unavailable if its not a momentary blip fixes #78876

home-assistant · 2023-04-05T18:51:46Z

Hey there @Djelibeybi, mind taking a look at this pull request as it has been labeled with an integration (lifx) you are listed as a code owner for? Thanks!

Code owner commands

Code owners of lifx can trigger bot actions by commenting:

@home-assistant close Closes the pull request.
@home-assistant rename Awesome new title Renames the pull request.
@home-assistant reopen Reopen the pull request.
@home-assistant unassign lifx Removes the current integration label and assignees on the pull request, add the integration domain after the command.

Djelibeybi · 2023-04-05T19:26:20Z

homeassistant/components/lifx/coordinator.py

@@ -189,41 +192,54 @@ def async_get_entity_id(self, platform: Platform, key: str) -> str | None:
 async def _async_update_data(self) -> None:
 """Fetch all device data from the api."""
 async with self.lock:


I would remove this lock, btw. I added it in the previous integration's incarnation to prevent multiple discoveries from firing at the same time, but that's no longer an issue (or possible).

For reference, Photons uses a Semaphore instead of a lock to limit the number of packets "in-flight", i.e. awaiting a response, but that's not bulb specific. Photons is essentially just a very fancy way of getting the least amount of packets on the network to do the most amount of work across the largest number of devices.

Djelibeybi · 2023-04-05T19:27:09Z

homeassistant/components/lifx/coordinator.py

+ # device.mac_addr is not the mac_address, its the serial number
+ if self.device.mac_addr == TARGET_ANY:
+ self.device.mac_addr = response.target_addr


Aside: we should move this into the callback of async_execute_lifx so that it happens earlier.

Djelibeybi · 2023-04-06T01:07:39Z

The code as written never gets passed a single attempt before the update coordinator marks the update as failed (for me). I refactored it like so: lifx_raise_fallback...Djelibeybi:home-assistant-core:lifx_raise_fallback (which is NOT production ready because I'm using warning logs for debugging purposes).

This refactor has not resulted in a single timeout at all. Not one in over 15 minutes of running. That's unprecedented with my fleet. Tests need to be updated, but they still all pass too.

bdraco · 2023-04-06T01:12:43Z

The code as written never gets passed a single attempt before the update coordinator marks the update as failed (for me). I refactored it like so: lifx_raise_fallback...Djelibeybi:home-assistant-core:lifx_raise_fallback (which is NOT production ready because I'm using warning logs for debugging purposes).

This refactor has not resulted in a single timeout at all. Not one in over 15 minutes of running. That's unprecedented with my fleet. Tests need to be updated, but they still all pass too.

I'm happy to close this PR if you want to open another one. I'm working on release issues so it will be a while before I get back to this.

Djelibeybi · 2023-04-06T01:14:03Z

Sure, that works. I'll fix up the tests shortly.

Djelibeybi · 2023-04-06T02:08:35Z

I've just opened #90891 as a replacement for this one.

bdraco · 2023-04-06T03:26:24Z

closing in favor of #90891

Only declare lifx update failure after 3 attempts

2f881c7

These devices sometimes flakey and generate a lot of noise from drop outs since communication is UDP best-effort. We should only mark them unavailable if its not a momentary blip fixes #78876

home-assistant bot added cla-signed integration: lifx by-code-owner Quality Scale: platinum labels Apr 5, 2023

bdraco mentioned this pull request Apr 5, 2023

Lifx integration with many devices frequently goes unavailable #78876

Closed

Djelibeybi reviewed Apr 5, 2023

View reviewed changes

Djelibeybi mentioned this pull request Apr 6, 2023

Make LIFX update handle transient communication failures #90891

Closed

20 tasks

bdraco closed this Apr 6, 2023

github-actions bot locked and limited conversation to collaborators Apr 7, 2023

bdraco deleted the lifx_raise_fallback branch May 25, 2023 13:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only declare lifx update failure after 3 attempts #90872

Only declare lifx update failure after 3 attempts #90872

bdraco commented Apr 5, 2023

home-assistant bot commented Apr 5, 2023

Djelibeybi Apr 5, 2023

Djelibeybi Apr 5, 2023

Djelibeybi commented Apr 6, 2023

bdraco commented Apr 6, 2023

Djelibeybi commented Apr 6, 2023

Djelibeybi commented Apr 6, 2023

bdraco commented Apr 6, 2023

Only declare lifx update failure after 3 attempts #90872

Only declare lifx update failure after 3 attempts #90872

Conversation

bdraco commented Apr 5, 2023

Proposed change

Type of change

Additional information

Checklist

home-assistant bot commented Apr 5, 2023

Djelibeybi Apr 5, 2023

Choose a reason for hiding this comment

Djelibeybi Apr 5, 2023

Choose a reason for hiding this comment

Djelibeybi commented Apr 6, 2023

bdraco commented Apr 6, 2023

Djelibeybi commented Apr 6, 2023

Djelibeybi commented Apr 6, 2023

bdraco commented Apr 6, 2023