add backoff, use direct requests instead of inference client 875e2f3 zulissimeta commited on 8 days ago