Newsletters




Big Data

The well-known three Vs of Big Data - Volume, Variety, and Velocity – are increasingly placing pressure on organizations that need to manage this data as well as extract value from this data deluge for Predictive Analytics and Decision-Making. Big Data technologies, services, and tools such as Hadoop, MapReduce, Hive and NoSQL/NewSQL databases and Data Integration techniques, In-Memory approaches, and Cloud technologies have emerged to help meet the challenges posed by the flood of Web, Social Media, Internet of Things (IoT) and machine-to-machine (M2M) data flowing into organizations.



Big Data Articles

Zumasys is acquiring the assets of PPI Information Systems, a PICK software house located in St. Louis, Missouri. PPI is a proven provider of software services offering programming services in PICK and Microsoft .NET using SQL or Universe as databases.

Posted May 25, 2022

Rocket Software is introducing the Rocket MV BASIC for Visual Studio Code 1.6.0 extension, now available on the Visual Studio Code Marketplace. Rocket MV BASIC for VS Code (MVVS) allows BASIC developers to edit, compile, and now debug their BASIC applications in Microsoft Visual Studio Code.

Posted May 25, 2022

Tamr, a cloud-native data mastering solution, is offering Tamr Enrich, a set of enrichment services built natively into the data mastering process using Tamr's patented human-guided machine learning. Tamr Enrich curates and actively manages external datasets and services, enabling customers to seamlessly embed trusted, high-quality external data insights to their data mastering pipelines for richer business.   

Posted May 24, 2022

Equalum, a provider of data integration and ingestion solutions, is expanding modern data integration support and introducing a next-gen design solution for optimal data loading to GCP's Big Query, Google Cloud Storage, Cloud SQL, as well as other cloud platforms such as Microsoft Azure and AWS.

Posted May 24, 2022

Matillion, provider of an enterprise cloud data integration platform, is releasing Matillion Data Loader 2.0, empowering enterprises to simplify data ingestion and accelerate insights with a cloud-native, no-code experience. Matillion Data Loader provides a single unified experience across batch loading and real-time, log-based change data capture (CDC) pipelines, and a consumption-based pricing model to help customers better manage data integration costs.

Posted May 24, 2022

Next Pathway Inc., the Automated Cloud Migration company, is extending the capabilities of its cloud migration planning tool, Crawler360. In this latest release, Next Pathway has improved and expanded the visualization capabilities of this tool, giving clients better insights into the dependencies across data pipelines in legacy data warehouses and data lakes.

Posted May 23, 2022

ScaleFlux, Inc., a provider deploying SSDs with computational storage at scale, is introducing its ScaleFlux Partner Program, giving customers broad access to the easiest approach to coping with data growth.

Posted May 23, 2022

Push Technology, a provider of real-time data streaming and messaging solutions, is releasing Diffusion 6.8, adding new features that include the Diffusion Gateway Framework, expanded data wrangling calculations and conditionals, and journal logging.

Posted May 20, 2022

Imply Data, founded by the original creators of Apache Druid, announced its $100 million Series D financing, which values the company at $1.1 billion. This investment round was led by Thoma Bravo with participation from OMERS Growth Equity, both new investors. Existing investors Bessemer Venture Funds, Andreessen Horowitz, and Khosla Ventures also participated in the financing.

Posted May 19, 2022

Thrive, a leading Managed Security Services Providers (MSSPs), is upgrading its 24x7x365, eyes-on-glass Security Operation Center (SOC) by integrating a Security Orchestration, Automation, and Response (SOAR) engine. The SOAR capabilities will enable the Thrive global security team to better navigate complex, risk-laden environments for clients via tool aggregation and coordinated response, unified operations, reduced alert fatigue, and Artificial Intelligence (AI).

Posted May 19, 2022

A common pattern in data lake and lakehouse design is structuring data into zones, with bronze, silver, and gold being typical labels. Each zone is suitable for different workloads and different consumers. For instance, machine learning algorithms typically process against bronze or silver, while analytic dashboards often query gold. This prompts the question: Which layer is best suited for applying data quality rules and actions? The answer: All of them.

Posted May 18, 2022

As the world becomes increasingly data-driven, AI/ML algorithms are being incorporated in most business applications. Historically, data in AI architectures was moved to a central location to perform both model training and inference. This centralized approach is becoming untenable due to cost, performance, and privacy reasons.

Posted May 18, 2022

Data consumers need data for BI and analytics to make business decisions. But for most organizations, their current data infrastructure isn't keeping up with demand. In a presentation at Data Summit 2022, titled "Building the Open Data Lakehouse," Mark Lyons, senior director, product management, Dremio, explained why more organizations are moving their analytics and BI to an open data lakehouse and how you can build a successful lakehouse strategy.

Posted May 18, 2022

As sensor technology becomes more affordable, companies of all sizes will have the ability to embrace IoT strategies to build innovative products and services and establish new revenue streams. Yet, as with any promising technology, challenges remain. At Data Summit 2022, Paul Scott-Murphy, CTO, WANdisco discussed "Solving the IoT Data Management Puzzle With Gateways to the Cloud."

Posted May 18, 2022

Companies now collect more data than ever before, but challenges remain for accessing and analyzing them. David Armlin, VP solution architect and customer success, ChaosSearch, discussed "Learn, Unlearn, Relearn: Embracing the Future of Cloud Analytics," during his Data Summit 2022 session.

Posted May 18, 2022

Wednesday's Data Summit 2022 keynotes opened with Laura Sebastian-Coleman, data quality director, Prudential Financial, who discussed "Data Quality Deniers & What We Learn From Them." One of the biggest organizational obstacles to data quality management is basic pessimism about the possibility of managing the quality of data. This is due to lack of clarity—the goals and processes for data quality management have not been defined or have not been understood—and disbelief that the quality of data could be subject to control.

Posted May 18, 2022

At Data Summit 2022, Sudha Viswanathan, staff engineer, Wayfair, presented a talk titled, "Gaining Insights From Clickstream Data." Viswanathan explained that Wayfair's clickstream data refers to data that contains information about customer actions on the Wayfair site, such as what pages were viewed, the products that were clicked, what was added to the cart, which URL brought the customer to Wayfair. This helps Wayfair make data-driven decisions regarding revenue attribution for different marketing channels and improves traffic and test analysis and ad bidding.

Posted May 18, 2022

Infobip, a global cloud communications company and current member of Oracle PartnerNetwork (OPN), announced that it will enable Oracle Advertising and Customer Experience (CX) customers to orchestrate powerful consumer interactions using Oracle Digital Assistant. The Oracle Digital Assistant has integrated Infobip's WhatsApp solution, so businesses can manage incoming and outgoing messages, send rich media, and input location.

Posted May 18, 2022

Oracle announced it has updated Oracle Service to embed data from Oracle Unity Customer Data Platform (CDP), helping customer service agents gain a complete view of the customer, improve agent efficiency, and enhance service quality. Part of Oracle Fusion Cloud Customer Experience (CX), Oracle Service and Oracle Unity CDP leverage artificial intelligence to help organizations deliver more personalized, informed, and efficient customer service engagements, according to the vendor.

Posted May 18, 2022

AlmaLinux OS Foundation, the nonprofit that stewards the community owned and governed open source CentOS replacement AlmaLinux, announced that AlmaLinux is now available on the Oracle Cloud Infrastructure (OCI) marketplace. It is focused on long-term stability and delivering a robust production-grade platform.

Posted May 18, 2022

No other subject seems to capture the attention of IT leaders right now like database migrations. If there were an IT theme for 2022, it would be: Enterprises migrate from legacy data warehouses to the cloud. And it is no longer just the "early adopters" but the entire customer base that is looking to make the move to cloud-based systems. Let's examine the three most common problems that hamper the execution of migration projects and what can be done to avert migration disasters.

Posted May 18, 2022

To turn data into insights and leverage the wealth of information that they are collecting, organizations need to ensure that their data is up-to-date and trustworthy. There is no magic answer. It's a combination of technology and processes. Kevin Campbell, CEO of Syniti, and Phil Fersht, CEO and chief analyst at HFS Research, discussed data value research conducted with Global 2000 C-level executives during their Data Summit 2022 presentation, "Every Problem Is a Data Problem: How Bad Data Is Killing Your Business."

Posted May 17, 2022

"Every company is a data company," Keith Alsheimer, head of marketing, Unravel Data during his Data Summit 2022 presentation. Alsheimer and Chris Santiago, VP solutions engineering, Unravel Data, discussed DataOps and how it can solve big data problems during the presentation. The annual Data Summit conference returned in-person to Boston, May 17-18, 2022, with pre-conference workshops on May 16.

Posted May 17, 2022

There are so many new buzzwords lately, including the data lakehouse, data mesh, and data fabric, just to name a few. But what do all these terms mean, and how do they compare to a data warehouse? This presentation covers all of them in detail and explains the pros and cons of each, with suggested use cases so attendees can see what approach will really work best for their big data needs.

Posted May 17, 2022

Around 85% of analytics, big data, and AI projects will fail, despite massive investments of money. It's not new news, but it still reflects on how powerfully design affects speed, scale, and usage. At Data Summit 2022, Brian O'Neill, founder and principal, Designing for Analytics presented his session, "Technically Right, Effectively Wrong: How to Avoid Creating the ML or Analytics Application No Customer Wants to Use."

Posted May 17, 2022

Machine learning is revolutionizing the process of complex decision-making by enabling the analysis of bigger, more complex datasets and the delivery of faster, more accurate results. At Data Summit 2022, Charna Parkey, VP of product, Kaskada presented "The Basics of Machine Learning" during her workshop session.

Posted May 16, 2022

Knowledge graphs are a valuable tool that organizations can use to manage the vast amounts of data they collect, store, and analyze. At Data Summit 2022, Joseph Hilger, COO, Enterprise Knowledge LLC and Sara Nash, senior consultant, data and information management, Enterprise Knowledge, LLC presented an "Introduction to Knowledge Graphs" during their workshop session.

Posted May 16, 2022

Operational databases continue to expand, with database sizes growing in most organizations. In terms of performance, the more data in the operational database, the less efficient transactions running against that database tend to be. The other impact, database administration complexity, causes longer processing time and outages to perform traditional DBA tasks. But as important as operational performance and administration issues are, frequently they are ancillary to the regulatory issue of preserving authentic data over time.

Posted May 16, 2022

Talend, a global provider of data integration and management, is releasing the Spring ‘22 version of Talend Data Fabric, adding advanced capabilities to Talend Trust Score including aggregation and historical views into the health of any dataset. These new features will help businesses analyze combined data quality metrics to evaluate data trust at macro and micro levels, including across all datasets, groups of datasets, or individual datasets, according to the vendor.

Posted May 11, 2022

Druva Inc. is signing a multi-year global Strategic Collaboration Agreement (SCA) with Amazon Web Services, Inc. (AWS) to accelerate customer migration and provide an added layer of cyber resiliency to the already secure AWS Cloud. Built on the existing relationship between Druva and AWS, this agreement underscores both companies' commitment to delivering a cloud-native data protection solution on AWS, and supporting enterprises during critical phases of the cloud journey, including support for workload migration and deployments.

Posted May 11, 2022

D2iQ, an enterprise Kubernetes provider for smart cloud-native applications, has introduced version 2.0 of Kaptain AI/ML, an enterprise-ready distribution of open source Kubeflow that enables organizations to develop, deploy, and run AI and machine learning (ML) workloads in production environments. 

Posted May 10, 2022

vFunction, providers of a platform to apply AI to application modernization, is releasing the vFunction Assessment Hub, using AI to accurately calculate the effect of technical debt across applications, their negative impact on innovation, predict the benefits of refactoring, and then integrate seamlessly into an automated refactoring platform.

Posted May 10, 2022

Immuta, a provider of data access and data security, is expanding its partnership with Starburst, the analytics anywhere company, to address growing data access control and security demands for modern data architectures.

Posted May 10, 2022

Dell Technologies revealed it will deliver new cloud experiences, an expanded ecosystem, and offerings to help customers manage and protect applications across data centers and multi-cloud environments. These new offerings are designed to help organizations easily store, protect, and control their data and applications across an increasing number of platforms and locations, according to the vendor.

Posted May 09, 2022

Domino Data Lab, provider of a leading enterprise MLOps platform is introducing Domino 5.2, continuing Domino's progress towards helping enterprises become model-driven.

Posted May 09, 2022

Siren, a provider of Investigative Intelligence analytics, is releasing Siren 12.1, introducing several enhancements and improvements including 360 degrees data visibility, downloadable and editable reports, and data model scalability. The latest iteration of the Siren platform pushes forward what is achievable in the investigative world, launching new capabilities which have been developed in line with rapidly changing investigators requirements to generate insights at machine speed and scale, according to the vendor.

Posted May 09, 2022

OccamSec, a cybersecurity provider, is releasing the Incenter platform, identifying the security weaknesses an organization has in real-time and helping teams develop insights and communicate business context from a security perspective. Incenter combines the functionality of a range of security services in one single solution. The platform provides, in real time, where an organization is vulnerable, and just as critically, what the impact will be if an attack occurs.

Posted May 05, 2022

Galileo is emerging from stealth with a machine learning (ML) data intelligence platform for unstructured data that gives data scientists the ability to inspect, discover, and fix critical ML data errors fast across the entire ML lifecycle. The platform is currently in private beta with the Fortune 500 and startups across multiple industries.

Posted May 05, 2022

Teleport, a provider of Identity-based Infrastructure Access Management, announced it has raised $110 million in Series C funding, enabling the company to expand its go-to-market organization to serve its fast-growing, global customer base. Teleport will also bolster its R&D organization to solve the most complex security challenges faced by organizations of all sizes.

Posted May 04, 2022

Syniti, a global leader in enterprise data management, is introducing the first round of multiple updates to the Syniti Knowledge Platform this year.

Posted May 04, 2022

Alluxio, the developer of the open source data orchestration platform for data driven workloads such as large-scale analytics and AI/ML, is releasing version 2.8 of its Data Orchestration Platform, featuring enhanced interface support for the Amazon S3 REST API; security improvements for sensitive applications with strict encryption compliance and regulatory requirements; and strengthened automated data movement functionality across heterogeneous storage systems.

Posted May 04, 2022

The volume, velocity and veracity of today's data deluge has put immense pressure on underlying data platforms and organizations' abilities to manage them effectively. And the pandemic has only exacerbated the problem. According to a 2021 survey, nearly half of digital architects are under high or extremely high pressure to deliver digital projects, but 61% blame legacy technology for making it difficult to complete modernization efforts. That said, databases of all types—SQL, NoSQL, or NewSQL—be they on-prem, cloud, hybrid, or edge, are struggling to navigate this new reality.

Posted May 04, 2022

MongoDB's recent enhancements are definitely of the perfective variety—broadly improving on the initial implementations of new features of 5.0. However, they go a long way toward enhancing the capabilities of 5.0 and creating a significant advantage for users of the MongoDB Atlas cloud.

Posted May 04, 2022

The 9th annual Data Summit conference will be held May 17-18, 2022, at the Hyatt Regency Boston. Pre-conference workshops will take place on May 16, 2022. The program is available for review and a variety of pass options are available to suit individual requirements.

Posted May 04, 2022

Rocket Software is launching the latest version of the MultiValue Performance Experience (MVX: Performance) platform, delivering increasing value to customers and addressing issues and bugs in the code faster. The latest MVX: Performance 1.2.0 release continues to add metrics customers are interested in monitoring.

Posted May 04, 2022

BlueFinity is giving users the option of developing a native or web app with Evoke. With a low-code platform, such as Evoke from BlueFinity, the developer can generate and deploy native (as well as web apps) to run on all devices, operating systems and browsers, all from the same app design and code.

Posted May 04, 2022

It is well known that a database is the fundamental building block for any data-based initiative. Databases are used when collecting, storing, processing, and analyzing data. A database is the silent component that drives business decisions and operational improvements or simply keeps track of inventory. As much as the database should be the almost invisible part of these processes, it is crucial to make the right choice. While it might look easy to select a suitable database, there are a few things to evaluate when making a decision.

Posted May 04, 2022

Equalum, a best-in-class provider of data integration and ingestion solutions, is releasing its Continuous Data Integration Platform (CDIP) Version 3.0, available for on-prem, hybrid, or cloud-based operations. The platform supports real-time streaming use cases as well as batch ETL, replication, and tier one change data capture.

Posted May 04, 2022

Pages
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154

Sponsors