grpc-java

Commit Graph

Author	SHA1	Message	Date
Kannan J	c7202c0db5	Bump readme (#12305 )	2025-08-24 18:21:52 +05:30
Eric Anderson	028afbe352	xds: Implement equals in WRRLBConfig Just an is `a8de9f0`, lack of equals causes cluster_resolver to consider every update a different configuration and restart itself. Handling NaN should really be prevented with validation, but it looks like that would lead to yak shaving at the moment. b/435208946	2025-08-22 08:07:51 -07:00
John Cormie	afdbecb235	binder: Move BinderTransport's inner classes to the top level (#12303 ) BinderTransport.java was getting too long and deeply nested. This is a pure refactor with no behavior changes.	2025-08-21 16:08:58 -07:00
MV Shiva	2039266ebc	xds: xdsClient caches transient error for new watchers (#12262 )	2025-08-19 21:41:52 +05:30
Jiri Kaplan	43bef65cf9	netty: Support BCJSSE provider in GrpcSslContexts	2025-08-19 07:17:57 -07:00
Eric Anderson	437e03dc98	xds: Avoid PriorityLb re-enabling timer on duplicate CONNECTING (#12289 ) Since `c4256add4` we no longer fabricate a TRANSIENT_FAILURE update from children. However, previously that would have set seenReadyOrIdleSinceTransientFailure = false and prevented future timer creation. If a LB policy gives extraneous updates with state CONNECTING, then it was possible to re-create failOverTimer which would then wait the 10 seconds for the child to finish CONNECTING. We only want to give the child one opportunity after transitioning out of READY/IDLE. https://github.com/grpc/proposal/pull/509	2025-08-19 12:53:47 +05:30
Eric Anderson	6462ef9a11	netty: Count sent RST_STREAMs against limit Http2RstCounterEncoder has to be constructed before NettyServerHandler/Http2ConnectionHandler so it must be static. Thus the code/counters were moved into RstStreamCounter which then can be constructed earlier and shared. This depends on Netty 4.1.124 for a bug fix to actually call the encoder: `be53dc3c9a`	2025-08-18 07:23:34 -07:00
Eric Anderson	95d16d85c8	Upgrade to Netty 4.1.124.Final This implicitly disables NettyAdaptiveCumulator (#11284), which can have a performance impact. We delayed upgrading Netty to give time to rework the optimization, but we've gone too long already without upgrading which causes problems for vulnerability tracking.	2025-08-13 14:23:14 -07:00
Sangamesh	f50726d32e	android: Clean up android lint and other warnings (#12143 ) Worked on clearing the lint warnings (OldTargetApi, ObsoleteSdkInt, InlinedApi, NewApi) Fixes #12142	2025-08-11 15:18:01 -07:00
Eric Anderson	06707f7c38	xds: Use a different log name for XdsClientImpl and ControlPlaneClient Seems like a good time to stop hating ourselves, as that seems to be the only reason to use the same string.	2025-08-08 14:23:43 -07:00
John Cormie	efcdebb904	Introduce a NameResolver for Android's `intent:` URIs (#12248 ) Let grpc-binder clients find on-device services by [implicit Intent](https://developer.android.com/guide/components/intents-filters#Types) target URI, lifting the need to hard code a server's package name.	2025-08-07 08:38:44 -07:00
Eric Anderson	f30964ab82	Bump versions of dependencies (#12252 ) Notably, protobuf to 3.25.8, opentelemetry to 1.52.0. Protobuf in Bazel has 25.5 in the BCR and it seems better to align the WORKSPACE with that version. But we can't actually use 25.5 in BCR because it is incompatible with Bazel 7.	2025-08-06 11:01:45 -07:00
MV Shiva	7040417eee	stub: use the closedTrailers in StatusException (#12259 )	2025-08-06 12:24:33 +05:30
camel	a40c8cf5a4	binder: Let apps call SecurityPolicy.checkAuthorization() by PeerUid (#12257 ) This allows a server with access to PeerUid to check additional application-layer security policy after the call itself is authorized by the transport layer. Cross cutting application-layer checks could be done from a ServerInterceptor (RPC method level policy, say). Checks based on the substance of a request message could be done by the individual RPC method implementations themselves.	2025-08-05 16:47:45 -07:00
Kannan J	8b46ad58c3	Start 1.76.0 development cycle (#12258 )	2025-08-05 22:00:39 +05:30
apolcyn	d947c80f99	interop-testing: make soak test use logger rather than writing to stderr directly	2025-07-30 09:31:02 -07:00
Eric Anderson	6ffcbd927e	Bump Gradle to 8.14.3 and upgrade plugins The syntax changes adding `=` were to address: https://docs.gradle.org/8.14.3/userguide/upgrading_version_8.html#groovy_space_assignment_syntax	2025-07-30 08:43:06 -07:00
MV Shiva	36fe276a50	xds: add "resource_timer_is_transient_failure" server feature (#12249 )	2025-07-30 17:54:45 +05:30
Benjamin Peterson	ba0a7329da	stub: simplify BlockingClientCall infinite blocking (#12217 ) Move deadline computation into overloads with finite timeouts. Blocking calls without timeouts now do not have to read the clock.	2025-07-29 09:33:09 -07:00
Eric Anderson	28f14255ce	Update README etc to reference 1.74.0	2025-07-29 07:51:17 -07:00
Kannan J	7e982e48a1	Xds: Aggregate cluster fixes (A75) (#12186 ) Instead of representing an aggregate cluster as a single cluster whose priorities come from different underlying clusters, represent an aggregate cluster as an instance of a priority LB policy where each child is a cds LB policy for the underlying cluster.	2025-07-29 18:06:39 +05:30
MV Shiva	c3ef1ab034	xds: Envoy proto sync to (#12224 )	2025-07-28 20:56:27 +05:30
Eric Anderson	8f09b96899	bazel: Use jarjar to avoid xds deps (#12243 ) Avoiding so many deps will allow us to upgrade the protos without being forced to upgrade to protobuf-java 4.x. It also removes the remaining non-bzlmod dependencies. It'd be really easy to get this wrong, so we do two things 1) mirror the gradle configuration as much as possible, as that sees a lot of testing, and 2) run the fake control plane with the _results_ of jarjar. There's lots of classes that we could mess up, but that at least kicks the tires. XdsTestUtils.buildRouteConfiguration() was moved to ControlPlaneRule to stop the unnecessary circular dependency between the classes and to avoid the many dependencies of XdsTestUtils. I'm totally hacking java_grpc_library to improve the dependency situation. Long-term, I think we will stop building Java libraries with Bazel and require users to rely entirely on Maven Central. That seems to be the direction Bazel is going and it will greatly simplify the problems we've seen with protobuf having a single repository for many languages. So while the hack isn't too bad, I hope we won't have to live with it long-term.	2025-07-28 12:30:39 +05:30
Kannan J	42e1829b37	xds: Do RLS fallback policy eagar start (#12211 ) The resource subscription to the fallback target was done only at the time of falling back, which can cause rpcs to fail. This change makes the fallback target to be subscribed and cached earlier, similar to C++ and go gRPC implementations.	2025-07-24 16:58:32 +05:30
Eric Anderson	c4256add4d	xds: Align PriorityLB child selection with A56 The PriorityLB predates A56. tryNextPriority() now matches ChoosePriority() from the gRFC. The biggest change is waiting on CONNECTING children instead of failing after the failOverTimer fires. The failOverTimer should be used to start lower priorities more eagerly, but shouldn't cause the overall connectivity state to become TRANSIENT_FAILURE on its own. The prior behavior of creating the "Connection timeout for priority" failing picker was particularly strange, because it didn't update child's connectivity state. This previous behavior was creating errors because of the failOverTimer with no way to diagnose what was going wrong. b/428517222	2025-07-23 06:38:33 -07:00
Eric Anderson	6ff8ecac09	core: Don't pre-compute DEADLINE_EXCEEDED message for delayed calls The main reason I made a change here was to fix the tense from the deadline "will be exceeded in" to "was exceeded after". But we really don't want to be doing the string formatting unless the deadline is actually exceeded. There were a few more changes to make some variables effectively final.	2025-07-22 06:56:02 -07:00
Patrick Strawderman	80217275db	api: Size Sets and Maps correctly in handling of Metadata values to be exchanged during a call (#12229 ) Fix HashSet / HashMap initializations to have sufficient capacity allocated based on the number of keys to be inserted, without which it would always lead to a rehash / resize operation.	2025-07-22 09:14:08 +05:30
Eric Anderson	2e96fbf1e8	netty: Associate netty stream eagerly to avoid client hang In #12185, RPCs were randomly hanging. In #12207 this was tracked down to the headers promise completing successfully, but the netty stream was null. This was because the headers write hadn't completed but stream.close() had been called by goingAway().	2025-07-17 21:55:53 +00:00
George Gensure	a37d3eb349	Guarantee missing stream promise delivery In observed cases, whether RST_STREAM or another failure from netty or the server, listeners can fail to be notified when a connection yields a null stream for the selected streamId. This causes hangs in clients, despite deadlines, with no obvious resolution. Tests which relied upon this promise succeeding must now change.	2025-07-17 21:55:16 +00:00
Eric Anderson	1fc4ab0bb2	LBs should avoid calling LBs after lb.shutdown() LoadBalancers shouldn't be called after shutdown(), but RingHashLb could have enqueued work to the SynchronizationContext that executed after shutdown(). This commit fixes problems discovered when auditing all LBs usage of the syncContext for that type of problem. Similarly, PickFirstLb could have requested a new connection after shutdown(). We want to avoid that sort of thing too. RingHashLb's test changed from CONNECTING to TRANSIENT_FAILURE to get the latest picker. Because two subchannels have failed it will be in TRANSIENT_FAILURE. Previously the test was using an older picker with out-of-date subchannelView, and the verifyConnection() was too imprecise to notice it was creating the wrong subchannel. As discovered in b/430347751, where ClusterImplLb was seeing a new subchannel being called after the child LB was shutdown (the shutdown itself had been caused by RingHashConfig not implementing equals() and was fixed by `a8de9f07ab`, which caused ClusterResolverLb to replace its state): ``` java.lang.NullPointerException at io.grpc.xds.ClusterImplLoadBalancer$ClusterImplLbHelper.createClusterLocalityFromAttributes(ClusterImplLoadBalancer.java:322) at io.grpc.xds.ClusterImplLoadBalancer$ClusterImplLbHelper.createSubchannel(ClusterImplLoadBalancer.java:236) at io.grpc.util.ForwardingLoadBalancerHelper.createSubchannel(ForwardingLoadBalancerHelper.java:47) at io.grpc.util.ForwardingLoadBalancerHelper.createSubchannel(ForwardingLoadBalancerHelper.java:47) at io.grpc.internal.PickFirstLeafLoadBalancer.createNewSubchannel(PickFirstLeafLoadBalancer.java:527) at io.grpc.internal.PickFirstLeafLoadBalancer.requestConnection(PickFirstLeafLoadBalancer.java:459) at io.grpc.internal.PickFirstLeafLoadBalancer.acceptResolvedAddresses(PickFirstLeafLoadBalancer.java:174) at io.grpc.xds.LazyLoadBalancer$LazyDelegate.activate(LazyLoadBalancer.java:64) at io.grpc.xds.LazyLoadBalancer$LazyDelegate.requestConnection(LazyLoadBalancer.java:97) at io.grpc.util.ForwardingLoadBalancer.requestConnection(ForwardingLoadBalancer.java:61) at io.grpc.xds.RingHashLoadBalancer$RingHashPicker.lambda$pickSubchannel$0(RingHashLoadBalancer.java:440) at io.grpc.SynchronizationContext.drain(SynchronizationContext.java:96) at io.grpc.SynchronizationContext.execute(SynchronizationContext.java:128) at io.grpc.xds.client.XdsClientImpl$ResourceSubscriber.onData(XdsClientImpl.java:817) ```	2025-07-17 12:56:33 +00:00
MV Shiva	6935d3a115	Revert "xds: add "resource_timer_is_transient_failure" server feature (#12063 )" (#12228 )	2025-07-17 11:35:34 +05:30
MV Shiva	d7d70c6905	xds: cncf/xds proto sync to 2025-05-02 (#12225 )	2025-07-17 10:26:12 +05:30
Kannan J	d352540a02	api: Add more Javadoc for NameResolver.Listener2 interface (#12220 )	2025-07-16 14:39:43 +05:30
MV Shiva	5a8326f1c7	xds: add "resource_timer_is_transient_failure" server feature (#12063 )	2025-07-15 15:33:02 +05:30
Eric Anderson	a8de9f07ab	xds: Implement equals in RingHashConfig Lack of equals causes cluster_resolver to consider every update a different configuration and restart itself. b/430347751	2025-07-14 14:06:15 +00:00
Eric Anderson	9d191b31b5	xds: Check isHttp11ProxyAvailable in equals() This fixes an equals/hashCode bug introduced in `12197065fe`. Discovered when investigating b/430347751	2025-07-14 14:05:35 +00:00
Richard Belleville	01bd63d88f	Remove inactive maintainers (#12187 )	2025-07-11 15:07:00 -07:00
John Cormie	94532a6b56	binder: Introduce server pre-authorization (#12127 ) grpc-binder clients authorize servers by checking the UID of the sender of the SETUP_TRANSPORT Binder transaction against some SecurityPolicy. But merely binding to an unauthorized server to learn its UID can enable "keep-alive" and "background activity launch" abuse, even if security policy ultimately decides the connection is unauthorized. Pre-authorization mitigates this kind of abuse by looking up and authorizing a candidate server Application's UID before binding to it. Pre-auth is especially important when the server's address is not fixed in advance but discovered by PackageManager lookup.	2025-07-10 14:14:36 -07:00
Eric Anderson	6dfa03c51c	core: grpc-timeout should always be positive (#12201 ) PROTOCOL-HTTP2.md specifies "TimeoutValue → {positive integer as ASCII string of at most 8 digits}". Zero is not positive, so it should be avoided. So make sure timeouts are at least 1 nanosecond instead of 0 nanoseconds. grpc-go recently began disallowing zero timeouts in https://github.com/grpc/grpc-go/pull/8290 which caused a regression as grpc-java can generate such timeouts. Apparently no gRPC implementation had previously been checking for zero timeouts. Instead of changing the max(0) to max(1) everywhere, just move the max handling into TimeoutMarshaller, since every caller of TIMEOUT_KEY was doing the same max() handling. Before `fd8fd517d` (in 2016!), grpc-java actually behaved correctly, as it failed RPCs with timeouts "<= 0". The commit changed the handling to the max(0) handling we see now. b/427338711	2025-07-03 11:44:04 +05:30
Abhishek Agrawal	919370172d	census: APIs for stats and tracing (#12050 )	2025-07-01 20:44:28 +05:30
Eric Anderson	ca99a8c478	Fix RLS regressions from XdsDepMan conversion `297ab05ef` converted CDS to XdsDependencyManager. This caused three regressions: * CdsLB2 as a RLS child would always fail with "Unable to find non-dynamic root cluster" because is_dynamic=true was missing in its service config * XdsNameResolver only propagated resolution updates when the clusters changed, so a CdsUpdate change would be ignored. This caused a hang for RLS even with is_dynamic=true. For non-RLS the lack config update broke the circuit breaking psm interop test. This would have been more severe if ClusterResolverLb had been converted to XdsDependenceManager, as it would have ignored EDS updates * RLS did not propagate resolution updates, so CdsLB2 even with is_dynamic=true the CdsUpdate for the new cluster would never arrive, causing a hang b/428120265 b/427912384	2025-06-30 14:23:32 +00:00
John Cormie	2ee4f9b488	AndroidComponentAddress constructor can be private. (#12188 )	2025-06-27 10:58:48 +05:30
John Cormie	74aee11389	Clarify requirements for creating a cross-user Channel. (#12181 ) The @SystemApi runtime visibility requirement isn't really new. It has always been implicit in the required INTERACT_ACROSS_USERS permission, which (in production) can only be held by system apps. The SDK_INT >= 30 requirement was also always present, via @RequiresApi() on BinderChannelBuilder#bindAsUser. This change just updates its replacement APIs (AndroidComponentAddress and TARGET_ANDROID_USER) to require it too.	2025-06-26 17:43:13 -07:00
vimanikag	64322c3243	11243: RLS cleanups (#12085 )	2025-06-25 10:55:00 +05:30
Eric Anderson	af7efeb9f5	core: Rely on ping-pong for flow control testing The previous code did a ping-pong to make sure the transport had enough time to process, but then proceeded to sleep 5 seconds. That sleep would have been needed without the ping-pong, but with the ping-pong we are confident all events have been drained from the transport. Deleting the unnecessary sleeps saves 10 seconds, for each of the 9 instances of this test.	2025-06-25 04:52:46 +00:00
Eric Anderson	ebc6d3e932	Start 1.75.0 development cycle	2025-06-25 04:52:17 +00:00
Eric Anderson	d374b26b68	xds: Disable LOGICAL_DNS in XdsDepMan until used ClusterResolverLb is still doing DNS itself, so disable it in XdsDepMan until that migration has finished. EDS is fine in XdsDepman, because XdsClient will share the result with ClusterResolverLb.	2025-06-24 14:56:16 +00:00
MV Shiva	f99b2aaef8	release: Migrate artifacts publishing from legacy OSSRH to Central Portal (#12156 )	2025-06-24 10:12:35 +05:30
John Cormie	30d40a6179	binder: Cancel checkAuthorization() request if still pending upon termination (#12167 )	2025-06-23 12:51:40 -07:00
John Cormie	9a6bdc70af	download maven using archive/permalink url (#12169 )	2025-06-23 12:00:25 -07:00

1 2 3 4 5 ...

6837 Commits All Branches Search

6837 Commits

All Branches