Bot releases are visible (Hide)
arrow-14.0.2
(#15108) @galipremsagarpandas-2.2
in cudf
(#15100) @galipremsagarfuture_stack
to DataFrame.stack
(#15015) @galipremsagarpandas-2.x
support in cudf
(#14916) @galipremsagardtype='category'
(#15476) @galipremsagarFalse
(#15340) @rjzamorainflate_kernel
(#15309) @vuulehostdevice_vector
and add more APIs (#15252) @ttnghiacollect_list
/collect_set
of lists column (#15243) @ttnghia.columns
by an external API (#15212) @galipremsagar__dask_tokenize__
(#15187) @rjzamoraListColumn.to_pandas()
to retain list
type (#15155) @galipremsagarconst
from range_window_bounds::_extent
. (#15138) @mythrocksGroupBy.apply
when chunk results are reindexed series (#15109) @brandon-b-millerlarge_string
in cudf
(#15093) @galipremsagarsort_values
pytest failure with pandas-2.x regression (#15092) @galipremsagarget_json_object
(#15082) @SurajAralihalliis_device_write_preferred
in void_sink
and user_sink_wrapper
(#15064) @vuuleIndex.difference
to handle duplicate values when one of the inputs is empty (#15016) @galipremsagarfuture_stack
to DataFrame.stack
(#15015) @galipremsagarDataFrame.sort_index
to respect ignore_index
on all axis (#14995) @galipremsagarSeriesGroupBy.aggregate
to SeriesGroupBy.agg
(#14971) @rjzamoraCUDF_SPILL
after a pytest (#14958) @galipremsagardeveloper_guide.md
with new guidance on quoted internal includes (#15238) @harrismsegmented_row_bit_count
for computing row sizes by segments of rows (#15169) @ttnghiaconda env create --yes
instead of --force
(#15403) @bdicecudf.pandas
(#15316) @galipremsagarcudf.pandas
pandas tests (#15284) @galipremsagarmixed_types_as_string
option is enabled (#15236) @shrshipytest-xdist
algo to worksteal
(#15207) @galipremsagarget_upstream_resource
method to stream_checking_resource_adaptor
(#15203) @misccooffset
value for generating test data in parquet_chunked_reader_test.cu
(#15200) @ttnghia__cuda_array_interface__
tests (#15188) @bdicebyte_range
in read_json
when the size is not smaller than the input data (#15180) @vuulecudf.pandas
(#15147) @galipremsagarcudf
(#15145) @galipremsagarhostdevice_vector
in kernel_error
to avoid the pageable copy (#15140) @vuulearrow-14.0.2
(#15108) @galipremsagarpandas-2.2
in cudf
(#15100) @galipremsagardatetime
binop pytest failures in pandas-2.2 (#15090) @galipremsagarcudf::sort
(#15066) @wence-DeprecationWarning
in test_hdf (#15044) @mroeschkeDeprecationWarning
's by ArrowTable.to_pandas()
(#14989) @galipremsagarctest
is called with --no-tests=error
. (#14983) @bdiceperiods
in date_range
and interval_range
(#14976) @galipremsagarGetJsonObjectOptions
in getJSONObject
Java API (#14956) @SurajAralihallipandas-2.x
support in cudf
(#14916) @galipremsagarcuco_allocator
(#14877) @PointKernelcudf.__version__
in Sphinx build (#14872) @KyleFromNVIDIArapids_cuda_set_runtime
to determine cuda runtime usage by target (#14833) @vyasrbuild_struct|list_column
(#14786) @mroeschkePublished by rapids-bot[bot] 7 months ago
dask_cudf
is imported (#15469) @rjzamoravar
logic in dask-cudf (#15347) @rjzamorato_arrow_device
function to cudf interop using nanoarrow (#15047) @zeroshadedask_cudf
json and s3 tests with query-planning on (#15408) @rjzamoraarm
arch (#15402) @galipremsagarnumeric_only=True
for simple groupby reductions (#15326) @rjzamoraModuleAccelerator
performance: cache the result of checking if a caller is in the denylist (#15056) @shwinaPublished by raydouglass 8 months ago
shuffle_method
kwarg (#14708) @pentschevint64_t
type (#14907) @SurajAralihallishuffle
deprecation to align with dask/dask (#14762) @rjzamoraGroupby.get_group
(#14728) @rjzamorashuffle_method
kwarg (#14708) @pentschevBaseOffset
(#14678) @shwinacuda::proclaim_return_type
from nested lambda (#14607) @ttnghiacudf::clamp
APIs (#14580) @davidwendtpd.Timestamp
objects to be picklable when cudf.pandas
is active (#14474) @shwinaMultiIndex.from_pandas
(#14470) @mroeschkestring_scalar
constructors (#14444) @vuulenumba<0.58
(#14616) @brandon-b-millerpytest<8
(#14920) @galipremsagarTimedeltaIndex.__init__
constructor (#14775) @mroeschkeDatetimeIndex.__init__
constructor (#14774) @mroeschkeframe.py
typing, move seldom used methods in frame.py
(#14766) @mroeschkepynvjitlink
as a dependency (#14763) @brandon-b-millercreate_structs_data
(#14761) @SurajAralihallicudf.MultiIndex.from_arrays
(#14740) @mroeschkeio::detail::orc
namespace into io::orc::detail
(#14696) @ttnghiaFieldType
to scoped enum (#14642) @vuulefreq
in DatetimeIndex (#14593) @shwinaCompactProtocolReader
(#14582) @vuuleCUDF_TEST_EXPECT_COLUMN_*
macros (#14559) @SurajAralihalliio::statistics
(#14542) @vuuleIndexedFrame.duplicated
with distinct_indices
+ scatter
(#14493) @wence-volatile
in Parquet (#14448) @vuulePublished by raydouglass 8 months ago
shuffle_method
kwarg (#14708) @pentschevint64_t
type (#14907) @SurajAralihallishuffle
deprecation to align with dask/dask (#14762) @rjzamoraGroupby.get_group
(#14728) @rjzamorashuffle_method
kwarg (#14708) @pentschevBaseOffset
(#14678) @shwinacuda::proclaim_return_type
from nested lambda (#14607) @ttnghiacudf::clamp
APIs (#14580) @davidwendtpd.Timestamp
objects to be picklable when cudf.pandas
is active (#14474) @shwinaMultiIndex.from_pandas
(#14470) @mroeschkestring_scalar
constructors (#14444) @vuulenumba<0.58
(#14616) @brandon-b-millerpytest<8
(#14920) @galipremsagarTimedeltaIndex.__init__
constructor (#14775) @mroeschkeDatetimeIndex.__init__
constructor (#14774) @mroeschkeframe.py
typing, move seldom used methods in frame.py
(#14766) @mroeschkepynvjitlink
as a dependency (#14763) @brandon-b-millercreate_structs_data
(#14761) @SurajAralihallicudf.MultiIndex.from_arrays
(#14740) @mroeschkeio::detail::orc
namespace into io::orc::detail
(#14696) @ttnghiaFieldType
to scoped enum (#14642) @vuulefreq
in DatetimeIndex (#14593) @shwinaCompactProtocolReader
(#14582) @vuuleCUDF_TEST_EXPECT_COLUMN_*
macros (#14559) @SurajAralihalliio::statistics
(#14542) @vuuleIndexedFrame.duplicated
with distinct_indices
+ scatter
(#14493) @wence-volatile
in Parquet (#14448) @vuulePublished by raydouglass 8 months ago
shuffle_method
kwarg (#14708) @pentschevint64_t
type (#14907) @SurajAralihallishuffle
deprecation to align with dask/dask (#14762) @rjzamoraGroupby.get_group
(#14728) @rjzamorashuffle_method
kwarg (#14708) @pentschevBaseOffset
(#14678) @shwinacuda::proclaim_return_type
from nested lambda (#14607) @ttnghiacudf::clamp
APIs (#14580) @davidwendtpd.Timestamp
objects to be picklable when cudf.pandas
is active (#14474) @shwinaMultiIndex.from_pandas
(#14470) @mroeschkestring_scalar
constructors (#14444) @vuulenumba<0.58
(#14616) @brandon-b-millerpytest<8
(#14920) @galipremsagarTimedeltaIndex.__init__
constructor (#14775) @mroeschkeDatetimeIndex.__init__
constructor (#14774) @mroeschkeframe.py
typing, move seldom used methods in frame.py
(#14766) @mroeschkepynvjitlink
as a dependency (#14763) @brandon-b-millercreate_structs_data
(#14761) @SurajAralihallicudf.MultiIndex.from_arrays
(#14740) @mroeschkeio::detail::orc
namespace into io::orc::detail
(#14696) @ttnghiaFieldType
to scoped enum (#14642) @vuulefreq
in DatetimeIndex (#14593) @shwinaCompactProtocolReader
(#14582) @vuuleCUDF_TEST_EXPECT_COLUMN_*
macros (#14559) @SurajAralihalliio::statistics
(#14542) @vuuleIndexedFrame.duplicated
with distinct_indices
+ scatter
(#14493) @wence-volatile
in Parquet (#14448) @vuulePublished by rapids-bot[bot] 9 months ago
future_stack
to DataFrame.stack
(#15015) @galipremsagarpandas-2.x
support in cudf
(#14916) @galipremsagarIndex.difference
to handle duplicate values when one of the inputs is empty (#15016) @galipremsagarfuture_stack
to DataFrame.stack
(#15015) @galipremsagarDataFrame.sort_index
to respect ignore_index
on all axis (#14995) @galipremsagarSeriesGroupBy.aggregate
to SeriesGroupBy.agg
(#14971) @rjzamoraCUDF_SPILL
after a pytest (#14958) @galipremsagarDeprecationWarning
's by ArrowTable.to_pandas()
(#14989) @galipremsagarctest
is called with --no-tests=error
. (#14983) @bdiceperiods
in date_range
and interval_range
(#14976) @galipremsagarGetJsonObjectOptions
in getJSONObject
Java API (#14956) @SurajAralihallipandas-2.x
support in cudf
(#14916) @galipremsagarcuco_allocator
(#14877) @PointKernelrapids_cuda_set_runtime
to determine cuda runtime usage by target (#14833) @vyasrPublished by raydouglass 11 months ago
reindex
when index
is not unique (#14400) @galipremsagarcudf.pandas
profiler (#14514) @galipremsagarbranch-23.10
into branch-23.12
(#14435) @raydouglassreindex
when index
is not unique (#14400) @galipremsagar__getattr__
(#14388) @shwinacudf::merge
(#14345) @divyegaladask_cudf/io
pytests in CI (#14338) @galipremsagarrecover_with_nulls
is enabled (#14252) @elstehledeallocate
(#14251) @robertmaynardpynvjitlink
for CUDA 12+ MVC (#13650) @brandon-b-millertest_kurt_skew_error
(#14368) @galipremsagarnullable()
to has_nulls()
in cudf::detail::gather
(#14363) @divyegalaarrow
to 13
(#14330) @galipremsagarpyorc
dependency and use pandas
/pyarrow
instead (#14323) @galipremsagarpyarrow.fs
import for local storage (#14321) @rjzamoradask
and distributed
for 23.12
development (#14320) @galipremsagarsource_info
(#14308) @vuuleshared-action-workflows
references (#14289) @AyodeAwepartd
encode dispatch in dask_cudf
(#14287) @rjzamoracudf::merge
for nested types without using comparators (#14250) @divyegalacontains_column
by invoking contains_table
(#14238) @PointKernel23.10
into 23.12
(#14231) @galipremsagarbytes_per_second
to distinct_count of stream_compaction nvbench. (#14172) @Blonckbytes_per_second
to transpose benchmark (#14170) @Blonckbytes_per_second
to shift benchmark (#13950) @Blonckdebug_utilities.hpp/cu
from column_utilities.hpp/cu
(#13720) @ttnghiaPublished by raydouglass 11 months ago
reindex
when index
is not unique (#14400) @galipremsagarcudf.pandas
profiler (#14514) @galipremsagarbranch-23.10
into branch-23.12
(#14435) @raydouglassreindex
when index
is not unique (#14400) @galipremsagar__getattr__
(#14388) @shwinacudf::merge
(#14345) @divyegaladask_cudf/io
pytests in CI (#14338) @galipremsagarrecover_with_nulls
is enabled (#14252) @elstehledeallocate
(#14251) @robertmaynardpynvjitlink
for CUDA 12+ MVC (#13650) @brandon-b-millertest_kurt_skew_error
(#14368) @galipremsagarnullable()
to has_nulls()
in cudf::detail::gather
(#14363) @divyegalaarrow
to 13
(#14330) @galipremsagarpyorc
dependency and use pandas
/pyarrow
instead (#14323) @galipremsagarpyarrow.fs
import for local storage (#14321) @rjzamoradask
and distributed
for 23.12
development (#14320) @galipremsagarsource_info
(#14308) @vuuleshared-action-workflows
references (#14289) @AyodeAwepartd
encode dispatch in dask_cudf
(#14287) @rjzamoracudf::merge
for nested types without using comparators (#14250) @divyegalacontains_column
by invoking contains_table
(#14238) @PointKernel23.10
into 23.12
(#14231) @galipremsagarbytes_per_second
to distinct_count of stream_compaction nvbench. (#14172) @Blonckbytes_per_second
to transpose benchmark (#14170) @Blonckbytes_per_second
to shift benchmark (#13950) @Blonckdebug_utilities.hpp/cu
from column_utilities.hpp/cu
(#13720) @ttnghiaPublished by rapids-bot[bot] 11 months ago
pd.Timestamp
objects to be picklable when cudf.pandas
is active (#14474) @shwinaMultiIndex.from_pandas
(#14470) @mroeschkestring_scalar
constructors (#14444) @vuulevolatile
in Parquet (#14448) @vuulePublished by raydouglass 11 months ago
reindex
when index
is not unique (#14429) @galipremsagarMixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagar23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuulereindex
when index
is not unique (#14429) @galipremsagaraws-sdk-cpp<1.11
(#14173) @pentschevDataFrame
constructor (#14119) @galipremsagarDataFrame
's as input to DataFrame
constructor (#14118) @galipremsagarnumeric_only=True
for reduction operations on numeric types (#14111) @galipremsagarDataFrame
(#14110) @galipremsagarkwargs
from Series.count
(#14106) @galipremsagarIndex.to_frame
and MultiIndex.to_frame
APIs (#14105) @galipremsagar__round__
in Series
and DataFrame
(#14099) @galipremsagarSeries
and Index
(#14080) @galipremsagarconda mambabuild
rather than mamba mambabuild
(#14067) @wence-Index.intersection
(#14054) @galipremsagarIndex.difference
to match with pandas (#14053) @galipremsagarIntervalIndex.union
to preserve type-metadata (#14051) @galipremsagarMixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarsort_remaining
for sort_index
(#14033) @wence-quantile
(#14014) @galipremsagarMultiIndex.difference
(#14009) @galipremsagarpd.IntervalDtype
(#14006) @galipremsagarname
selection in Index.difference
and Index.intersection
(#13986) @galipremsagardropna
to fix factorize
API (#13980) @galipremsagarMultiIndex.to_numpy
to return numpy array with tuples (#13966) @galipremsagarIntervalIndex.repr
when null values are present (#13958) @galipremsagarColumn.unique
(#13957) @galipremsagarInterval
scalars when passed in list-like inputs to cudf.Index
(#13956) @galipremsagardtype
is passed to a CategoricalColumn
(#13955) @galipremsagaras_index
in GroupBy.apply
(#13951) @brandon-b-millernsmallest
and nlargest
(#13946) @galipremsagarindex
of Groupby.apply
results when it is performed on empty objects (#13944) @galipremsagardevice_sum
functions (#13943) @brandon-b-millerGrouping
objects (#13932) @galipremsagarloc
when column names is MultiIndex
(#13929) @galipremsagarsearchsorted
(#13925) @galipremsagarname
in reindex
(#13917) @galipremsagarcudf::thread_index_type
in cuIO to prevent overflow in row indexing (#13910) @vuulethread_index_type
to avoid index overflow in grid-stride loops (#13895) @PointKernelDataFrame
with mixed types (#13889) @galipremsagarnan
when one variable to be correlated has zero variance in JIT GroupBy Apply (#13884) @brandon-b-millerread_csv
with compressed input (#13881) @vuuleMultiIndex.isin
(#13879) @galipremsagarMultiIndex.levels
(#13870) @galipremsagarfreq
is set in pandas-compatibility mode (#13857) @galipremsagarNA
from a TimedeltaColumn
(#13853) @galipremsagarfreq
(#13844) @shwinaSeries
and Index
(#13842) @galipremsagartimezone-aware
data is passed to as_column
(#13835) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagarDataFrame
constructor (#13811) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagarin
) method for Series
(#13779) @galipremsagarintegers_to_hex
(#14205) @razajafriHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14154) @ttnghiaGroupBy.value_counts
to match pandas API (#14114) @stmiohash_reduce_by_row
(#14095) @ttnghiaHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14045) @ttnghiaDataFrame.stack
to support multi level column names (#13927) @isVoiddatasource
to improve H2D throughput (#13814) @vuulecorr
in GroupBy.apply
through the jit engine (#13767) @shwinaGroupby.apply
with engine='jit'
(#13729) @brandon-b-millershared-action-workflows
references (backport from 23.12
to 23.10
) (#14300) @AyodeAwedask
and distributed
for 23.10
release (#14225) @galipremsagarRecently Updated
Check (#14193) @ajschmidt8Index
and raise warning in Series
constructor (#14116) @galipremsagarcontains_table
with cuco::static_set (#14064) @PointKernelobject
dtype will result in a failure (#13994) @galipremsagarcodes
dtype parity in pandas-compatibility mode for factorize
API (#13982) @galipremsagarCategoricalIndex
ordering in Groupby.agg
when pandas-compatibility mode is enabled (#13978) @galipremsagarthread_index_type
in partitioning.cu
(#13973) @divyegalacudf::thread_index_type
in merge.cu
(#13972) @divyegalacopy-pr-bot
(#13970) @ajschmidt8bytes_per_second
to hash_partition benchmark (#13965) @Blonckbytes_per_second
to copy_if_else benchmark (#13960) @BlonckSeries.unique
(#13959) @galipremsagarbytes_per_second
to compiled binaryop benchmark (#13938) @Blonckdask
and distributed
for 23.10
development (#13935) @galipremsagarempty()
instead of size()
where possible (#13908) @vuuleTimestamp
& Timedelta
for fetching scalars in DatetimeIndex
& TimedeltaIndex
(#13896) @galipremsagarshuffle="p2p"
within dask-cudf API (#13893) @rjzamoraDatetimeIndex
when freq
is passed to cudf.date_range
(#13890) @galipremsagardatetime
& timedelta
comparison operations (#13877) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarastype(object)
is called in pandas compatibility mode (#13862) @galipremsagarlists::contains
to experimental row comparator (#13810) @divyegalalists::contains
dispatches for scalars (#13805) @divyegala23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuulePublished by raydouglass 12 months ago
dask
and distributed
for release (#13070) @galipremsagarpandas
and numpy
pinnings (#12887) @galipremsagarnames
& dtype
in Index.copy
(#12825) @galipremsagarIndex.is_*
methods (#12820) @galipremsagardatetime_is_numeric
from describe
(#12818) @galipremsagarna_sentinel
in factorize
(#12817) @galipremsagarto_csv
(#12705) @wence-strings_udf
code into cuDF (#12669) @brandon-b-millerDataFrame
constructor to broadcast scalar inputs properly (#12997) @galipremsagarforce_nullable_schema
from chunked parquet writer (#12996) @galipremsagargroupby
(#12992) @galipremsagar__setitem__
on string columns when the scalar value ends in a null byte (#12991) @wence-sort_values
when column is all empty strings (#12988) @eriknwdask.dataframe.read_parquet
changes (#12983) @rjzamoraIndex.__getitem__
(#12955) @wence-RangeIndex
bug (#12838) @rjzamoraGroupBy.dtypes
(#12783) @galipremsagarnum_keys
in _scatter_by_slice
(#12749) @thomcomalways_nullable
flag to Dremel encoding (#12727) @divyegalaGroupBy.apply
(#12706) @brandon-b-millerto_csv
(#12705) @wence-round
API (#12670) @galipremsagarfrom_arrow
to load a sliced arrow table (#12665) @galipremsagarfind_common_dtype
and values
to handle complex dtypes (#12537) @galipremsagarSeries
comparison vs scalars (#12519) @brandon-b-millerUDFString
back to StringView
to call methods in strings_udf
(#12363) @brandon-b-millerGroupBy.apply
doc examples rendering (#12994) @brandon-b-millerpydata-sphinx-theme=0.13.0
compatible (#12874) @galipremsagarnullable
option application to single table writes (#12933) @vuuleio::orc::ProtobufWriter
(#12877) @ttnghiaGroupBy.apply
functions (#12802) @brandon-b-millertests/column_utilities
to use experimental::equality
row comparator (#12777) @divyegaladistinct/unique_count
to experimental::row
hasher/comparator (#12776) @divyegalahash_partition
to use experimental::row::row_hasher
(#12761) @divyegalais_sorted
to use experimental::row::lexicographic
(#12752) @divyegalaregex_program
strings splitting java APIs and tests (#12713) @cindyyuanjiangregex_program
strings replacing java APIs and tests (#12701) @cindyyuanjiangregex_program
strings extract java APIs and tests (#12699) @cindyyuanjiangstrings_udf
code into cuDF (#12669) @brandon-b-millerregex_program
searching APIs and related java classes (#12666) @cindyyuanjiangrank
to use to experimental row comparators (#12481) @divyegaladask
and distributed
for release (#13070) @galipremsagarforce_nullable_schema
parameter to Parquet writer. (#12952) @galipremsagargroupby.head
and groupby.tail
(#12939) @wence-SCCACHE_S3_USE_SSL
to conda builds (#12910) @ajschmidt8moto
env vars & pass AWS_SESSION_TOKEN
to conda builds (#12902) @ajschmidt8line_terminator
in favor of lineterminator
in to_csv
(#12896) @wence-stream
and mr
parameters for structs::detail::flatten_nested_columns
(#12892) @ttnghiapandas
and numpy
pinnings (#12887) @galipremsagargroupby.sample
(#12882) @wence-cudf::structs::detail::flatten_nested_columns
to smart pointer (#12878) @ttnghianames
& dtype
in Index.copy
(#12825) @galipremsagarinplace
parameters in categorical methods (#12824) @galipremsagarIndex.is_*
methods (#12820) @galipremsagardatetime_is_numeric
from describe
(#12818) @galipremsagarna_sentinel
in factorize
(#12817) @galipremsagarcudf::io::source_info
to use cudf::host_span<std::byte>
in a non-breaking form (#12730) @hyperbolic2346dask
and distributed
for development (#12710) @galipremsagarpartition_by_hash()
: use _split()
(#12704) @madsbkexperimental::row::equality
(#12676) @divyegalaconda
recipe (#12661) @ajschmidt8copy-on-write
feature branch into branch-23.04
(#12619) @galipremsagarPublished by rapids-bot[bot] about 1 year ago
MixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagar23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuuleaws-sdk-cpp<1.11
(#14173) @pentschevDataFrame
constructor (#14119) @galipremsagarDataFrame
's as input to DataFrame
constructor (#14118) @galipremsagarnumeric_only=True
for reduction operations on numeric types (#14111) @galipremsagarDataFrame
(#14110) @galipremsagarkwargs
from Series.count
(#14106) @galipremsagarIndex.to_frame
and MultiIndex.to_frame
APIs (#14105) @galipremsagar__round__
in Series
and DataFrame
(#14099) @galipremsagarSeries
and Index
(#14080) @galipremsagarconda mambabuild
rather than mamba mambabuild
(#14067) @wence-Index.intersection
(#14054) @galipremsagarIndex.difference
to match with pandas (#14053) @galipremsagarIntervalIndex.union
to preserve type-metadata (#14051) @galipremsagarMixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarsort_remaining
for sort_index
(#14033) @wence-quantile
(#14014) @galipremsagarMultiIndex.difference
(#14009) @galipremsagarpd.IntervalDtype
(#14006) @galipremsagarname
selection in Index.difference
and Index.intersection
(#13986) @galipremsagardropna
to fix factorize
API (#13980) @galipremsagarMultiIndex.to_numpy
to return numpy array with tuples (#13966) @galipremsagarIntervalIndex.repr
when null values are present (#13958) @galipremsagarColumn.unique
(#13957) @galipremsagarInterval
scalars when passed in list-like inputs to cudf.Index
(#13956) @galipremsagardtype
is passed to a CategoricalColumn
(#13955) @galipremsagaras_index
in GroupBy.apply
(#13951) @brandon-b-millernsmallest
and nlargest
(#13946) @galipremsagarindex
of Groupby.apply
results when it is performed on empty objects (#13944) @galipremsagardevice_sum
functions (#13943) @brandon-b-millerGrouping
objects (#13932) @galipremsagarloc
when column names is MultiIndex
(#13929) @galipremsagarsearchsorted
(#13925) @galipremsagarname
in reindex
(#13917) @galipremsagarcudf::thread_index_type
in cuIO to prevent overflow in row indexing (#13910) @vuulethread_index_type
to avoid index overflow in grid-stride loops (#13895) @PointKernelDataFrame
with mixed types (#13889) @galipremsagarnan
when one variable to be correlated has zero variance in JIT GroupBy Apply (#13884) @brandon-b-millerread_csv
with compressed input (#13881) @vuuleMultiIndex.isin
(#13879) @galipremsagarMultiIndex.levels
(#13870) @galipremsagarfreq
is set in pandas-compatibility mode (#13857) @galipremsagarNA
from a TimedeltaColumn
(#13853) @galipremsagarfreq
(#13844) @shwinaSeries
and Index
(#13842) @galipremsagartimezone-aware
data is passed to as_column
(#13835) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagarDataFrame
constructor (#13811) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagarin
) method for Series
(#13779) @galipremsagarintegers_to_hex
(#14205) @razajafriHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14154) @ttnghiaGroupBy.value_counts
to match pandas API (#14114) @stmiohash_reduce_by_row
(#14095) @ttnghiaHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14045) @ttnghiaDataFrame.stack
to support multi level column names (#13927) @isVoiddatasource
to improve H2D throughput (#13814) @vuulecorr
in GroupBy.apply
through the jit engine (#13767) @shwinaGroupby.apply
with engine='jit'
(#13729) @brandon-b-millershared-action-workflows
references (backport from 23.12
to 23.10
) (#14300) @AyodeAwedask
and distributed
for 23.10
release (#14225) @galipremsagarRecently Updated
Check (#14193) @ajschmidt8Index
and raise warning in Series
constructor (#14116) @galipremsagarcontains_table
with cuco::static_set (#14064) @PointKernelobject
dtype will result in a failure (#13994) @galipremsagarcodes
dtype parity in pandas-compatibility mode for factorize
API (#13982) @galipremsagarCategoricalIndex
ordering in Groupby.agg
when pandas-compatibility mode is enabled (#13978) @galipremsagarthread_index_type
in partitioning.cu
(#13973) @divyegalacudf::thread_index_type
in merge.cu
(#13972) @divyegalacopy-pr-bot
(#13970) @ajschmidt8bytes_per_second
to hash_partition benchmark (#13965) @Blonckbytes_per_second
to copy_if_else benchmark (#13960) @BlonckSeries.unique
(#13959) @galipremsagarbytes_per_second
to compiled binaryop benchmark (#13938) @Blonckdask
and distributed
for 23.10
development (#13935) @galipremsagarempty()
instead of size()
where possible (#13908) @vuuleTimestamp
& Timedelta
for fetching scalars in DatetimeIndex
& TimedeltaIndex
(#13896) @galipremsagarshuffle="p2p"
within dask-cudf API (#13893) @rjzamoraDatetimeIndex
when freq
is passed to cudf.date_range
(#13890) @galipremsagardatetime
& timedelta
comparison operations (#13877) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarastype(object)
is called in pandas compatibility mode (#13862) @galipremsagarlists::contains
to experimental row comparator (#13810) @divyegalalists::contains
dispatches for scalars (#13805) @divyegala23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuulePublished by raydouglass about 1 year ago
MixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagar23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuuleaws-sdk-cpp<1.11
(#14173) @pentschevDataFrame
constructor (#14119) @galipremsagarDataFrame
's as input to DataFrame
constructor (#14118) @galipremsagarnumeric_only=True
for reduction operations on numeric types (#14111) @galipremsagarDataFrame
(#14110) @galipremsagarkwargs
from Series.count
(#14106) @galipremsagarIndex.to_frame
and MultiIndex.to_frame
APIs (#14105) @galipremsagar__round__
in Series
and DataFrame
(#14099) @galipremsagarSeries
and Index
(#14080) @galipremsagarconda mambabuild
rather than mamba mambabuild
(#14067) @wence-Index.intersection
(#14054) @galipremsagarIndex.difference
to match with pandas (#14053) @galipremsagarIntervalIndex.union
to preserve type-metadata (#14051) @galipremsagarMixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarsort_remaining
for sort_index
(#14033) @wence-quantile
(#14014) @galipremsagarMultiIndex.difference
(#14009) @galipremsagarpd.IntervalDtype
(#14006) @galipremsagarname
selection in Index.difference
and Index.intersection
(#13986) @galipremsagardropna
to fix factorize
API (#13980) @galipremsagarMultiIndex.to_numpy
to return numpy array with tuples (#13966) @galipremsagarIntervalIndex.repr
when null values are present (#13958) @galipremsagarColumn.unique
(#13957) @galipremsagarInterval
scalars when passed in list-like inputs to cudf.Index
(#13956) @galipremsagardtype
is passed to a CategoricalColumn
(#13955) @galipremsagaras_index
in GroupBy.apply
(#13951) @brandon-b-millernsmallest
and nlargest
(#13946) @galipremsagarindex
of Groupby.apply
results when it is performed on empty objects (#13944) @galipremsagardevice_sum
functions (#13943) @brandon-b-millerGrouping
objects (#13932) @galipremsagarloc
when column names is MultiIndex
(#13929) @galipremsagarsearchsorted
(#13925) @galipremsagarname
in reindex
(#13917) @galipremsagarcudf::thread_index_type
in cuIO to prevent overflow in row indexing (#13910) @vuulethread_index_type
to avoid index overflow in grid-stride loops (#13895) @PointKernelDataFrame
with mixed types (#13889) @galipremsagarnan
when one variable to be correlated has zero variance in JIT GroupBy Apply (#13884) @brandon-b-millerread_csv
with compressed input (#13881) @vuuleMultiIndex.isin
(#13879) @galipremsagarMultiIndex.levels
(#13870) @galipremsagarfreq
is set in pandas-compatibility mode (#13857) @galipremsagarNA
from a TimedeltaColumn
(#13853) @galipremsagarfreq
(#13844) @shwinaSeries
and Index
(#13842) @galipremsagartimezone-aware
data is passed to as_column
(#13835) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagarDataFrame
constructor (#13811) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagarin
) method for Series
(#13779) @galipremsagarintegers_to_hex
(#14205) @razajafriHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14154) @ttnghiaGroupBy.value_counts
to match pandas API (#14114) @stmiohash_reduce_by_row
(#14095) @ttnghiaHISTOGRAM
and MERGE_HISTOGRAM
aggregations (#14045) @ttnghiaDataFrame.stack
to support multi level column names (#13927) @isVoiddatasource
to improve H2D throughput (#13814) @vuulecorr
in GroupBy.apply
through the jit engine (#13767) @shwinaGroupby.apply
with engine='jit'
(#13729) @brandon-b-millerdask
and distributed
for 23.10
release (#14225) @galipremsagarRecently Updated
Check (#14193) @ajschmidt8Index
and raise warning in Series
constructor (#14116) @galipremsagarcontains_table
with cuco::static_set (#14064) @PointKernelobject
dtype will result in a failure (#13994) @galipremsagarcodes
dtype parity in pandas-compatibility mode for factorize
API (#13982) @galipremsagarCategoricalIndex
ordering in Groupby.agg
when pandas-compatibility mode is enabled (#13978) @galipremsagarthread_index_type
in partitioning.cu
(#13973) @divyegalacudf::thread_index_type
in merge.cu
(#13972) @divyegalacopy-pr-bot
(#13970) @ajschmidt8bytes_per_second
to hash_partition benchmark (#13965) @Blonckbytes_per_second
to copy_if_else benchmark (#13960) @BlonckSeries.unique
(#13959) @galipremsagarbytes_per_second
to compiled binaryop benchmark (#13938) @Blonckdask
and distributed
for 23.10
development (#13935) @galipremsagarempty()
instead of size()
where possible (#13908) @vuuleTimestamp
& Timedelta
for fetching scalars in DatetimeIndex
& TimedeltaIndex
(#13896) @galipremsagarshuffle="p2p"
within dask-cudf API (#13893) @rjzamoraDatetimeIndex
when freq
is passed to cudf.date_range
(#13890) @galipremsagardatetime
& timedelta
comparison operations (#13877) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarastype(object)
is called in pandas compatibility mode (#13862) @galipremsagarlists::contains
to experimental row comparator (#13810) @divyegalalists::contains
dispatches for scalars (#13805) @divyegala23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuulePublished by rapids-bot[bot] about 1 year ago
recover_with_nulls
is enabled (#14252) @elstehledeallocate
(#14251) @robertmaynardshared-action-workflows
references (#14289) @AyodeAwecontains_column
by invoking contains_table
(#14238) @PointKernel23.10
into 23.12
(#14231) @galipremsagarbytes_per_second
to distinct_count of stream_compaction nvbench. (#14172) @Blonckbytes_per_second
to transpose benchmark (#14170) @Blonckbytes_per_second
to shift benchmark (#13950) @BlonckPublished by raydouglass about 1 year ago
_get_decimal_type
(#13224) @charlesblucadask_cudf.read_parquet
(#13697) @rjzamoracuda-nvcc-impl
to cudf
for numba
CUDA 12 (#13673) @jakirkhamread_parquet
(#13666) @rjzamorathrust::identity
as hash functions for byte pair encoding (#13665) @PointKernelMultiIndex.loc
ordering & fix a bug in Groupby
with as_index
(#13657) @galipremsagarhas_nonempty_nulls
ignoring column offset (#13647) @ttnghiaIndex
search to simplify code and increase correctness (#13625) @wence-localize(None)
for timezone-naive columns (#13603) @shwinadask_cudf.read_csv
when lines are needed to be skipped (#13555) @galipremsagarsingle_lane_block_sum_reduce
is safe to call in a loop (#13488) @wence-_can_be_jitted
in GroupBy Apply (#13429) @brandon-b-miller_get_decimal_type
(#13224) @charlesblucamin
and max
aggregations for list type in groupby and reduction (#13676) @ttnghiacuco::static_map
to build string dictionaries in ORC writer (#13580) @vuulelists::concatenate_list_elements
(#13547) @ttnghialists::concatenate_list_elements
(#13545) @ttnghiafind_index_of_val
(#13517) @brandon-b-millerabs
function to apply (#13408) @brandon-b-millerstruct_minmax_util
to experimental row comparator (#13069) @divyegaladask
and distributed
for 23.08
release (#13802) @galipremsagar__dask_tokenize__
definitions to cudf classes (#13695) @rjzamoracuda::atomic_ref
(#13583) @PointKernelcudf.Dataframe
to/from pyarrow.Table
conversion (#13558) @rjzamoracuco::pair_type
aliases (#13553) @PointKernelsort=False
in Groupby
(#13551) @galipremsagarstrings_to_categorical
in cudf.read_parquet
(#13540) @galipremsagarhostdevice_*
copying APIs (#13530) @ttnghiacudf.option_context
(#13511) @galipremsagardask
and distributed
for development (#13508) @galipremsagarPublished by rapids-bot[bot] about 1 year ago
MixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagar23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuuleDataFrame
constructor (#14119) @galipremsagarDataFrame
's as input to DataFrame
constructor (#14118) @galipremsagarnumeric_only=True
for reduction operations on numeric types (#14111) @galipremsagarkwargs
from Series.count
(#14106) @galipremsagarIndex.to_frame
and MultiIndex.to_frame
APIs (#14105) @galipremsagar__round__
in Series
and DataFrame
(#14099) @galipremsagarSeries
and Index
(#14080) @galipremsagarconda mambabuild
rather than mamba mambabuild
(#14067) @wence-Index.intersection
(#14054) @galipremsagarIndex.difference
to match with pandas (#14053) @galipremsagarIntervalIndex.union
to preserve type-metadata (#14051) @galipremsagarMixedTypeError
when a column of mixed-dtype is being constructed (#14050) @galipremsagarNotImplementedError
for MultiIndex.to_series
(#14049) @galipremsagarsort_remaining
for sort_index
(#14033) @wence-quantile
(#14014) @galipremsagarMultiIndex.difference
(#14009) @galipremsagarpd.IntervalDtype
(#14006) @galipremsagarname
selection in Index.difference
and Index.intersection
(#13986) @galipremsagardropna
to fix factorize
API (#13980) @galipremsagarMultiIndex.to_numpy
to return numpy array with tuples (#13966) @galipremsagarIntervalIndex.repr
when null values are present (#13958) @galipremsagarColumn.unique
(#13957) @galipremsagarInterval
scalars when passed in list-like inputs to cudf.Index
(#13956) @galipremsagardtype
is passed to a CategoricalColumn
(#13955) @galipremsagaras_index
in GroupBy.apply
(#13951) @brandon-b-millernsmallest
and nlargest
(#13946) @galipremsagarindex
of Groupby.apply
results when it is performed on empty objects (#13944) @galipremsagardevice_sum
functions (#13943) @brandon-b-millerGrouping
objects (#13932) @galipremsagarloc
when column names is MultiIndex
(#13929) @galipremsagarsearchsorted
(#13925) @galipremsagarname
in reindex
(#13917) @galipremsagarcudf::thread_index_type
in cuIO to prevent overflow in row indexing (#13910) @vuulethread_index_type
to avoid index overflow in grid-stride loops (#13895) @PointKernelDataFrame
with mixed types (#13889) @galipremsagarnan
when one variable to be correlated has zero variance in JIT GroupBy Apply (#13884) @brandon-b-millerread_csv
with compressed input (#13881) @vuuleMultiIndex.isin
(#13879) @galipremsagarMultiIndex.levels
(#13870) @galipremsagarfreq
is set in pandas-compatibility mode (#13857) @galipremsagarNA
from a TimedeltaColumn
(#13853) @galipremsagarfreq
(#13844) @shwinaSeries
and Index
(#13842) @galipremsagartimezone-aware
data is passed to as_column
(#13835) @galipremsagarany
, all
reduction behavior for axis=None
and warn for other reductions (#13831) @galipremsagarDataFrame
constructor (#13811) @galipremsagardatetime
and timedelta
types with other types (#13786) @galipremsagarin
) method for Series
(#13779) @galipremsagarGroupBy.value_counts
to match pandas API (#14114) @stmiohash_reduce_by_row
(#14095) @ttnghiaDataFrame.stack
to support multi level column names (#13927) @isVoiddatasource
to improve H2D throughput (#13814) @vuulecorr
in GroupBy.apply
through the jit engine (#13767) @shwinaGroupby.apply
with engine='jit'
(#13729) @brandon-b-millerIndex
and raise warning in Series
constructor (#14116) @galipremsagarobject
dtype will result in a failure (#13994) @galipremsagarcodes
dtype parity in pandas-compatibility mode for factorize
API (#13982) @galipremsagarCategoricalIndex
ordering in Groupby.agg
when pandas-compatibility mode is enabled (#13978) @galipremsagarthread_index_type
in partitioning.cu
(#13973) @divyegalacudf::thread_index_type
in merge.cu
(#13972) @divyegalacopy-pr-bot
(#13970) @ajschmidt8bytes_per_second
to hash_partition benchmark (#13965) @Blonckbytes_per_second
to copy_if_else benchmark (#13960) @BlonckSeries.unique
(#13959) @galipremsagarbytes_per_second
to compiled binaryop benchmark (#13938) @Blonckdask
and distributed
for 23.10
development (#13935) @galipremsagarempty()
instead of size()
where possible (#13908) @vuuleTimestamp
& Timedelta
for fetching scalars in DatetimeIndex
& TimedeltaIndex
(#13896) @galipremsagarDatetimeIndex
when freq
is passed to cudf.date_range
(#13890) @galipremsagardatetime
& timedelta
comparison operations (#13877) @galipremsagarNA
to NaT
for datetime
and timedelta
types (#13868) @galipremsagarastype(object)
is called in pandas compatibility mode (#13862) @galipremsagarlists::contains
to experimental row comparator (#13810) @divyegalalists::contains
dispatches for scalars (#13805) @divyegala23.10
(#13732) @galipremsagardatasource.hpp
public header (#13698) @vuulePublished by rapids-bot[bot] over 1 year ago
observed=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaarrow-11
(#12757) @galipremsagarIntervalIndex
constructor (#13394) @shwinafrom_column_view
(#13315) @wence-compute_column
(#13245) @wence-is_decompression_disabled
and is_compression_disabled
thread-safe (#13240) @vuulehostdevice_vector::subspan
(#13187) @ttnghiacudf::nvbench_base_fixture
usage (#13183) @robertmaynardSeries
and DataFrame
constructors to validate index lengths (#13122) @galipremsagarnull_count
of columns returned by chunked_parquet_reader
(#13111) @vuuleNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaread_parquet
benchmarks (#13082) @vuuleskip_rows
in ORC and Parquet readers (#13063) @vuulecontains_table
to experimental row hasher and equality comparator (#13119) @divyegalaGroupBy.apply
algorithm based on if the UDF is jittable (#13113) @brandon-b-millermixed_join
to use experimental row hasher and comparator (#13028) @divyegalahostdevice_span
that is a span createable from hostdevice_vector
(#12981) @hyperbolic2346join
to use experimental row hasher and comparator (#12787) @divyegaladtype
parameter in get_dummies
(#13365) @galipremsagarStringIndex
and use Index
instead (#13361) @galipremsagarTypeVar(bound="Foo")
(#13346) @wence-distinct_count
with cuco::static_set
(#13343) @PointKernelcontiguous_split
performance (#13342) @ttnghiaread_parquet
(#13334) @rjzamorameta
calculation in dask_cudf.read_parquet
(#13327) @rjzamoradistinct_count
benchmark (#13321) @PointKernelobserved=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaBaseDtype
when infering the data type of an arbitrary object (#13295) @shwinatransform_lists_of_structs
in row_operators.cu
(#13288) @ttnghiapartition
scatter_map
construction (#13272) @wence-pandas_compatible
option in cudf
(#13241) @galipremsagargpuDecideCompression
(#13202) @vuulecreate_serialized_trie
(#13201) @vuulepad
and backfill
methods (#13140) @galipremsagarUNKNOWN_NULL_COUNT
(#13134) @vyasrUNKNOWN_NULL_COUNT
(#13102) @vyasr.element()
instead of .data()
for window range calculations (#13095) @mythrocksbranch-23.04
to branch-23.06
(#13080) @galipremsagarcudf::detail::sorted_order
(#13062) @ttnghiabranch-23.04
to branch-23.06
(#13009) @galipremsagararrow-11
(#12757) @galipremsagarPublished by raydouglass over 1 year ago
observed=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaarrow-11
(#12757) @galipremsagarIntervalIndex
constructor (#13394) @shwinafrom_column_view
(#13315) @wence-compute_column
(#13245) @wence-is_decompression_disabled
and is_compression_disabled
thread-safe (#13240) @vuulehostdevice_vector::subspan
(#13187) @ttnghiacudf::nvbench_base_fixture
usage (#13183) @robertmaynardSeries
and DataFrame
constructors to validate index lengths (#13122) @galipremsagarnull_count
of columns returned by chunked_parquet_reader
(#13111) @vuuleNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaread_parquet
benchmarks (#13082) @vuuleskip_rows
in ORC and Parquet readers (#13063) @vuulecontains_table
to experimental row hasher and equality comparator (#13119) @divyegalaGroupBy.apply
algorithm based on if the UDF is jittable (#13113) @brandon-b-millermixed_join
to use experimental row hasher and comparator (#13028) @divyegalahostdevice_span
that is a span createable from hostdevice_vector
(#12981) @hyperbolic2346join
to use experimental row hasher and comparator (#12787) @divyegaladtype
parameter in get_dummies
(#13365) @galipremsagarStringIndex
and use Index
instead (#13361) @galipremsagarTypeVar(bound="Foo")
(#13346) @wence-distinct_count
with cuco::static_set
(#13343) @PointKernelcontiguous_split
performance (#13342) @ttnghiaread_parquet
(#13334) @rjzamorameta
calculation in dask_cudf.read_parquet
(#13327) @rjzamoradistinct_count
benchmark (#13321) @PointKernelobserved=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaBaseDtype
when infering the data type of an arbitrary object (#13295) @shwinatransform_lists_of_structs
in row_operators.cu
(#13288) @ttnghiapartition
scatter_map
construction (#13272) @wence-pandas_compatible
option in cudf
(#13241) @galipremsagargpuDecideCompression
(#13202) @vuulecreate_serialized_trie
(#13201) @vuulepad
and backfill
methods (#13140) @galipremsagarUNKNOWN_NULL_COUNT
(#13134) @vyasrUNKNOWN_NULL_COUNT
(#13102) @vyasr.element()
instead of .data()
for window range calculations (#13095) @mythrocksbranch-23.04
to branch-23.06
(#13080) @galipremsagarcudf::detail::sorted_order
(#13062) @ttnghiabranch-23.04
to branch-23.06
(#13009) @galipremsagararrow-11
(#12757) @galipremsagarPublished by raydouglass over 1 year ago
observed=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaarrow-11
(#12757) @galipremsagarIntervalIndex
constructor (#13394) @shwinafrom_column_view
(#13315) @wence-compute_column
(#13245) @wence-is_decompression_disabled
and is_compression_disabled
thread-safe (#13240) @vuulehostdevice_vector::subspan
(#13187) @ttnghiacudf::nvbench_base_fixture
usage (#13183) @robertmaynardSeries
and DataFrame
constructors to validate index lengths (#13122) @galipremsagarnull_count
of columns returned by chunked_parquet_reader
(#13111) @vuuleNotImplementedError
when attempting to construct cuDF objects from timezone-aware datetimes (#13086) @shwinaread_parquet
benchmarks (#13082) @vuuleskip_rows
in ORC and Parquet readers (#13063) @vuulecontains_table
to experimental row hasher and equality comparator (#13119) @divyegalaGroupBy.apply
algorithm based on if the UDF is jittable (#13113) @brandon-b-millermixed_join
to use experimental row hasher and comparator (#13028) @divyegalahostdevice_span
that is a span createable from hostdevice_vector
(#12981) @hyperbolic2346join
to use experimental row hasher and comparator (#12787) @divyegaladtype
parameter in get_dummies
(#13365) @galipremsagarStringIndex
and use Index
instead (#13361) @galipremsagarTypeVar(bound="Foo")
(#13346) @wence-distinct_count
with cuco::static_set
(#13343) @PointKernelcontiguous_split
performance (#13342) @ttnghiaread_parquet
(#13334) @rjzamorameta
calculation in dask_cudf.read_parquet
(#13327) @rjzamoradistinct_count
benchmark (#13321) @PointKernelobserved=
argument in groupby to True
to reflect the actual behaviour (#13296) @shwinaBaseDtype
when infering the data type of an arbitrary object (#13295) @shwinatransform_lists_of_structs
in row_operators.cu
(#13288) @ttnghiapartition
scatter_map
construction (#13272) @wence-pandas_compatible
option in cudf
(#13241) @galipremsagargpuDecideCompression
(#13202) @vuulecreate_serialized_trie
(#13201) @vuulepad
and backfill
methods (#13140) @galipremsagarUNKNOWN_NULL_COUNT
(#13134) @vyasrUNKNOWN_NULL_COUNT
(#13102) @vyasr.element()
instead of .data()
for window range calculations (#13095) @mythrocksbranch-23.04
to branch-23.06
(#13080) @galipremsagarcudf::detail::sorted_order
(#13062) @ttnghiabranch-23.04
to branch-23.06
(#13009) @galipremsagararrow-11
(#12757) @galipremsagarPublished by raydouglass over 1 year ago
dask
and distributed
for release (#13070) @galipremsagarpandas
and numpy
pinnings (#12887) @galipremsagarnames
& dtype
in Index.copy
(#12825) @galipremsagarIndex.is_*
methods (#12820) @galipremsagardatetime_is_numeric
from describe
(#12818) @galipremsagarna_sentinel
in factorize
(#12817) @galipremsagarto_csv
(#12705) @wence-strings_udf
code into cuDF (#12669) @brandon-b-millerDataFrame
constructor to broadcast scalar inputs properly (#12997) @galipremsagarforce_nullable_schema
from chunked parquet writer (#12996) @galipremsagargroupby
(#12992) @galipremsagar__setitem__
on string columns when the scalar value ends in a null byte (#12991) @wence-sort_values
when column is all empty strings (#12988) @eriknwdask.dataframe.read_parquet
changes (#12983) @rjzamoraIndex.__getitem__
(#12955) @wence-RangeIndex
bug (#12838) @rjzamoraGroupBy.dtypes
(#12783) @galipremsagarnum_keys
in _scatter_by_slice
(#12749) @thomcomalways_nullable
flag to Dremel encoding (#12727) @divyegalaGroupBy.apply
(#12706) @brandon-b-millerto_csv
(#12705) @wence-round
API (#12670) @galipremsagarfrom_arrow
to load a sliced arrow table (#12665) @galipremsagarfind_common_dtype
and values
to handle complex dtypes (#12537) @galipremsagarSeries
comparison vs scalars (#12519) @brandon-b-millerUDFString
back to StringView
to call methods in strings_udf
(#12363) @brandon-b-millerGroupBy.apply
doc examples rendering (#12994) @brandon-b-millerpydata-sphinx-theme=0.13.0
compatible (#12874) @galipremsagarnullable
option application to single table writes (#12933) @vuuleio::orc::ProtobufWriter
(#12877) @ttnghiaGroupBy.apply
functions (#12802) @brandon-b-millertests/column_utilities
to use experimental::equality
row comparator (#12777) @divyegaladistinct/unique_count
to experimental::row
hasher/comparator (#12776) @divyegalahash_partition
to use experimental::row::row_hasher
(#12761) @divyegalais_sorted
to use experimental::row::lexicographic
(#12752) @divyegalaregex_program
strings splitting java APIs and tests (#12713) @cindyyuanjiangregex_program
strings replacing java APIs and tests (#12701) @cindyyuanjiangregex_program
strings extract java APIs and tests (#12699) @cindyyuanjiangstrings_udf
code into cuDF (#12669) @brandon-b-millerregex_program
searching APIs and related java classes (#12666) @cindyyuanjiangrank
to use to experimental row comparators (#12481) @divyegaladask
and distributed
for release (#13070) @galipremsagarforce_nullable_schema
parameter to Parquet writer. (#12952) @galipremsagargroupby.head
and groupby.tail
(#12939) @wence-SCCACHE_S3_USE_SSL
to conda builds (#12910) @ajschmidt8moto
env vars & pass AWS_SESSION_TOKEN
to conda builds (#12902) @ajschmidt8line_terminator
in favor of lineterminator
in to_csv
(#12896) @wence-stream
and mr
parameters for structs::detail::flatten_nested_columns
(#12892) @ttnghiapandas
and numpy
pinnings (#12887) @galipremsagargroupby.sample
(#12882) @wence-cudf::structs::detail::flatten_nested_columns
to smart pointer (#12878) @ttnghianames
& dtype
in Index.copy
(#12825) @galipremsagarinplace
parameters in categorical methods (#12824) @galipremsagarIndex.is_*
methods (#12820) @galipremsagardatetime_is_numeric
from describe
(#12818) @galipremsagarna_sentinel
in factorize
(#12817) @galipremsagarcudf::io::source_info
to use cudf::host_span<std::byte>
in a non-breaking form (#12730) @hyperbolic2346dask
and distributed
for development (#12710) @galipremsagarpartition_by_hash()
: use _split()
(#12704) @madsbkexperimental::row::equality
(#12676) @divyegalaconda
recipe (#12661) @ajschmidt8copy-on-write
feature branch into branch-23.04
(#12619) @galipremsagar