|
| reduce time for search query |
 |
Tue, 26 Feb 2008 04:41:24 -080 |
Hi,
I have a task at hand to reduce the time taken for search query to
execute. The query fetches records which will have to sorted by
degrees away from the logged in user. I have a function which
calculates the degrees, but using this in the search query slows the
execution and takes about 10 secs to complete which is unacceptable.
Please advice. Your help is much appreciated
For more details plz see:
http://www.sqlteam.com/forums/topic.asp?TOPIC_ID=97021
Thanks
Isfaar
|
| Post Reply
|
| Re: reduce time for search query |
 |
Wed, 27 Feb 2008 10:59:09 -080 |
On Feb 26, 7:41 am, paankh...@googlemail.com wrote:
> Hi,
>
> I have a task at hand to reduce the time taken for search query to
> execute. The query fetches records which will have to sorted by
> degrees away from the logged in user. I have a function which
> calculates the degrees, but using this in the search query slows the
> execution and takes about 10 secs to complete which is unacceptable.
>
> Please advice. Your help is much appreciated
>
> For more details plz see:
>
> http://www.sqlteam.com/forums/topic.asp?TOPIC_ID=97021
>
> Thanks
> Isfaar
Hi Isfaar,
Would it be possible for you to post your query here along with udf
definition. When a scalar UDF is invoked for each row in a query, it
can degrade the performance i.e. for each row you incur UDF invocation
cost. An inline expression might be faster than UDF in this case.
Depending on your SQL Server version, you might have to adopt
different approach. If you are on SQL Server 2005, then cross apply
operator in conjuction with inline table function will do the trick
for you. Inline table function behaves differently than scalar
function.
If you are on SQL Server 2000, a precisely written subquery might do
the trick.
HTH,
|
| Post Reply
|
| Re: reduce time for search query |
 |
Wed, 27 Feb 2008 22:51:27 -050 |
I can only repeat the same as Hugo and Najm, it is hard to say anything
without seeing the actual table, indexes, sample data, required results, and
business requirements/rules. Reading all those posts in the link at
sqlteam.com I can see only sample code provided by Peso (Peter Larsson). The
problem could be difficult or easy to solve, but hard to say without seeing
the real requirements.
Perhaps some things to consider are using materialized path or nested sets
to stored the hierarchy of contacts. Those models provide very efficient
retrieval of distance between nodes info, but have more difficult methods
for maintaining data.
Here is just a small sample (using the sample table and data provided by
Peso) on how materialized path may look:
-- Sample table with data
CREATE TABLE Contacts (
c_from CHAR(1),
c_to CHAR(1),
PRIMARY KEY (c_from, c_to));
INSERT INTO Contacts
SELECT 'A', 'B' UNION ALL
SELECT 'B', 'D' UNION ALL
SELECT 'C', 'A' UNION ALL
SELECT 'C', 'E' UNION ALL
SELECT 'G', 'C' UNION ALL
SELECT 'B', 'G' UNION ALL
SELECT 'F', 'D' UNION ALL
SELECT 'E', 'F';
-- Table to store paths
CREATE TABLE Paths (c_path VARCHAR(200) PRIMARY KEY);
This is the real hurdle, recalculating all paths
-- Recursive CTE to populate the paths
WITH PathCTE
AS
(SELECT c_from, c_to,
CAST('.' + CAST(c_from AS VARCHAR(8)) + '.' +
CAST(c_to AS VARCHAR(8)) + '.' AS VARCHAR(200)) AS c_path
FROM Contacts AS C1
UNION ALL
SELECT C.c_from, C.c_to,
CAST(P.c_path + C.c_to + '.' AS VARCHAR(200))
FROM PathCTE AS P
JOIN Contacts AS C
ON P.c_to = C.c_from
WHERE P.c_path NOT LIKE '%.' +
CAST(C.c_from AS VARCHAR(10)) +
'.' +
CAST(C.c_to AS VARCHAR(10)) +
'.%')
INSERT INTO Paths
SELECT c_path FROM PathCTE;
-- Show all paths between B and D
SELECT c_path
FROM Paths
WHERE c_path LIKE '.B.%'
AND c_path LIKE '%.D.';
-- Shortest path distance, longest path distance, and number of paths
SELECT MIN(LEN(c_path) - LEN(REPLACE(c_path, '.', '')) - 2) AS
shortest_distance,
MAX(LEN(c_path) - LEN(REPLACE(c_path, '.', '')) - 2) AS
longest_distance,
COUNT(*) AS paths_cnt
FROM Paths
WHERE c_path LIKE '.B.%'
AND c_path LIKE '%.D.';
Looking at the paths found:
.B.D.
.B.G.C.A.B.D.
.B.G.C.E.F.D.
You may notice the second path reused the first path to reach the
destination. But this could be desired or not, again hard to say with no
requirements. Easy to handle but did not bother...
As you can see calculating the distance is easy, but maintenance offsets
that. Based on your needs and model, if data is static this may do.
HTH,
Plamen Ratchev
http://www.SQLStudio.com
|
| Post Reply
|
| Re: reduce time for search query |
 |
Wed, 5 Mar 2008 02:18:20 -0800 |
On Feb 28, 8:51 am, "Plamen Ratchev" <Pla...@SQLStudio.com>
wrote:
> I can only repeat the same as Hugo and Najm, it is hard to say anything
> without seeing the actual table, indexes, sample data, required results,
and
> business requirements/rules. Reading all those posts in the link at
> sqlteam.com I can see only sample code provided by Peso (Peter Larsson).
The
> problem could be difficult or easy to solve, but hard to say without
seeing
> the real requirements.
>
> Perhaps some things to consider are using materialized path or nested sets
> to stored the hierarchy of contacts. Those models provide very efficient
> retrieval of distance between nodes info, but have more difficult methods
> for maintaining data.
>
> Here is just a small sample (using the sample table and data provided by
> Peso) on how materialized path may look:
>
> -- Sample table with data
> CREATE TABLE Contacts (
> c_from CHAR(1),
> c_to CHAR(1),
> PRIMARY KEY (c_from, c_to));
>
> INSERT INTO Contacts
> SELECT 'A', 'B' UNION ALL
> SELECT 'B', 'D' UNION ALL
> SELECT 'C', 'A' UNION ALL
> SELECT 'C', 'E' UNION ALL
> SELECT 'G', 'C' UNION ALL
> SELECT 'B', 'G' UNION ALL
> SELECT 'F', 'D' UNION ALL
> SELECT 'E', 'F';
>
> -- Table to store paths
> CREATE TABLE Paths (c_path VARCHAR(200) PRIMARY KEY);
>
> This is the real hurdle, recalculating all paths
>
> -- Recursive CTE to populate the paths
> WITH PathCTE
> AS
> (SELECT c_from, c_to,
> CAST('.' + CAST(c_from AS VARCHAR(8)) + '.' +
> CAST(c_to AS VARCHAR(8)) + '.' AS VARCHAR(200)) AS c_path
> FROM Contacts AS C1
> UNION ALL
> SELECT C.c_from, C.c_to,
> CAST(P.c_path + C.c_to + '.' AS VARCHAR(200))
> FROM PathCTE AS P
> JOIN Contacts AS C
> ON P.c_to = C.c_from
> WHERE P.c_path NOT LIKE '%.' +
> CAST(C.c_from AS VARCHAR(10)) +
> '.' +
> CAST(C.c_to AS VARCHAR(10)) +
> '.%')
> INSERT INTO Paths
> SELECT c_path FROM PathCTE;
>
> -- Show all paths between B and D
> SELECT c_path
> FROM Paths
> WHERE c_path LIKE '.B.%'
> AND c_path LIKE '%.D.';
>
> -- Shortest path distance, longest path distance, and number of paths
> SELECT MIN(LEN(c_path) - LEN(REPLACE(c_path, '.', '')) - 2) AS
> shortest_distance,
> MAX(LEN(c_path) - LEN(REPLACE(c_path, '.', '')) - 2) AS
> longest_distance,
> COUNT(*) AS paths_cnt
> FROM Paths
> WHERE c_path LIKE '.B.%'
> AND c_path LIKE '%.D.';
>
> Looking at the paths found:
> .B.D.
> .B.G.C.A.B.D.
> .B.G.C.E.F.D.
>
> You may notice the second path reused the first path to reach the
> destination. But this could be desired or not, again hard to say with no
> requirements. Easy to handle but did not bother...
>
> As you can see calculating the distance is easy, but maintenance offsets
> that. Based on your needs and model, if data is static this may do.
>
> HTH,
>
> Plamen Ratchevhttp://www.SQLStudio.com
Sorry for not having to replied earlier.
here is the code.
this is the query that gets fired;
SELECT Users.UserID, Users.FirstName + ' ' + Users.LastName as Name,
UserProfile.PropertyValue as Location,
professionalInfo.headline as Headline, industries.industryName as
Industry, professionalInfo.summary as Summary,
professionalInfo.interests, dbo.GetConnectionsCount(Users.UserID) AS
Connections,
dbo.GetRecommendations(Users.UserID) AS Recommendations,
dbo.fnCommonFriendsStep(Users.UserID, 36) AS Degree FROM Users
INNER JOIN UserProfile ON Users.UserID = UserProfile.UserID
INNER JOIN professionalInfo ON Users.UserID =
professionalInfo.memberId
INNER JOIN industries ON professionalInfo.primaryIndustry =
industries.industryId
WHERE (Users.UserID IN (SELECT DISTINCT UserID FROM vw_search ))
AND UserProfile.PropertyDefinitionID=29 AND
dbo.fnCommonFriendsStep(Users.UserID,36) >=0 ORDER By Degree ASC
fnCommonFriendstep one calculates relationship between the loggedin
member and other members.
if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].
[fnCommonFriendsStep]') and xtype in (N'FN', N'IF', N'TF'))
drop function [dbo].[fnCommonFriendsStep]
GO
SET QUOTED_IDENTIFIER ON
GO
SET ANSI_NULLS ON
GO
CREATE FUNCTION dbo.fnCommonFriendsStep
(
@Want1 INT,
@Want2 INT
)
RETURNS INT
AS
BEGIN
IF @Want1 = @Want2
RETURN 0
DECLARE @Friends TABLE (Generation INT, p INT)
DECLARE @Generation INT
SELECT @Generation = 0
INSERT @Friends
(
Generation,
p
)
SELECT 0,
memberId
FROM network
WHERE friendId = @Want1
UNION
SELECT 0,
friendId
FROM network
WHERE memberId = @Want1
WHILE NOT EXISTS (SELECT 1 FROM @Friends WHERE p = @Want2) AND
@Generation >= 0
BEGIN
SELECT @Generation = @Generation + 1
INSERT @Friends
(
Generation,
p
)
SELECT @Generation,
memberId
FROM network
WHERE friendId IN (SELECT p FROM @Friends WHERE Generation =
@Generation - 1)
AND memberId NOT IN (SELECT p FROM @Friends)
UNION
SELECT @Generation,
friendId
FROM network
WHERE memberId IN (SELECT p FROM @Friends WHERE Generation =
@Generation - 1)
AND friendId NOT IN (SELECT p FROM @Friends)
IF @@ROWCOUNT = 0
SELECT @Generation = -2
END
RETURN @Generation + 1
END
GO
SET QUOTED_IDENTIFIER OFF
GO
SET ANSI_NULLS ON
GO
GetRecommendations get the number of recommendations a member has
received.
if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].
[GetRecommendations]') and xtype in (N'FN', N'IF', N'TF'))
drop function [dbo].[GetRecommendations]
GO
SET QUOTED_IDENTIFIER ON
GO
SET ANSI_NULLS ON
GO
CREATE FUNCTION dbo.GetRecommendations
(
@user_id as int
)
RETURNS INT
AS
BEGIN
DECLARE @recomm INT
SELECT @recomm=Count(*) FROM endorsements WHERE memberId=@user_id AND
status='Accepted'
RETURN @recomm
END
GO
SET QUOTED_IDENTIFIER OFF
GO
SET ANSI_NULLS ON
GO
GetConnectionsCount fetches number of connections a member has
if exists (select * from dbo.sysobjects where id = object_id(N'[dbo].
[GetConnectionsCount]') and xtype in (N'FN', N'IF', N'TF'))
drop function [dbo].[GetConnectionsCount]
GO
SET QUOTED_IDENTIFIER ON
GO
SET ANSI_NULLS ON
GO
CREATE FUNCTION dbo.GetConnectionsCount
(
@user_id as int
)
RETURNS INT
AS
BEGIN
DECLARE @recCount INT
SELECT @recCount=COUNT(*) FROM network WHERE (memberId=@user_id OR
friendId=@user_id) AND status=1
RETURN @recCount
END
GO
SET QUOTED_IDENTIFIER OFF
GO
SET ANSI_NULLS ON
GO
Please do let me know if you need any more information.
|
| Post Reply
|
| Re: reduce time for search query |
 |
Thu, 6 Mar 2008 17:43:22 -0500 |
I do not see tables structure, indexes, the view definition, sample data,
expected results...
Here is a quick attempt to improve this. Check the you have correct indexes
on join and filter columns where applicable.
In the main query below I removed two of the functions and used sub-queries.
Not sure that will help but was easier for me to see the logic. Changed the
main function to be table valued (you can go a step further here to make it
table valued for all users and join by user, I did not want to dig into the
logic there) so there will be no two calls in SELECT and WHERE, and some
other cosmetic changes. Note that this is untested.
SELECT U.UserID,
U.FirstName + ' ' + U.LastName AS 'Name',
P.PropertyValue AS Location,
N.headline AS Headline,
I.industryName AS Industry,
professionalInfo.summary AS Summary,
professionalInfo.interests,
(SELECT COUNT(*)
FROM network AS N
WHERE (N.memberId = U.UserID
OR N.friendId = U.UserID)
AND status = 1) AS Connections,
(SELECT COUNT(*)
FROM endorsements AS E
WHERE E.memberId = U.UserID
AND status = 'Accepted') AS Recommendations,
F.generation AS Degree
FROM Users AS U
JOIN UserProfile AS P
ON U.UserID = P.UserID
JOIN professionalInfo AS N
ON U.UserID = N.memberId
JOIN industries AS I
ON N.primaryIndustry = I.industryId
CROSS APPLY dbo.fnCommonFriendsStep(U.UserID, 36) AS F
WHERE EXISTS (SELECT *
FROM vw_search AS S
WHERE S.UserID = U.UserID)
AND P.PropertyDefinitionID = 29
AND F.generation >= 0
ORDER By Degree ASC
And here is the changed function:
CREATE FUNCTION dbo.fnCommonFriendsStep
(@member INT, @friend INT)
RETURNS @FriendsDegree
TABLE (generation INT DEFAULT 0)
AS
BEGIN
INSERT INTO @FriendsDegree DEFAULT VALUES
IF @member = @friend
RETURN
DECLARE @Friends TABLE (generation INT, p INT)
DECLARE @generation INT
SET @generation = 0
INSERT @Friends (generation, p)
SELECT 0, memberId
FROM Network
WHERE friendId = @member
UNION
SELECT 0, friendId
FROM Network
WHERE memberId = @member
WHILE NOT EXISTS (SELECT 1 FROM @Friends WHERE p = @friend)
AND @generation >= 0
BEGIN
SELECT @generation = @generation + 1
INSERT @Friends (generation, p)
SELECT @generation, memberId
FROM Network AS N
WHERE EXISTS (SELECT *
FROM @Friends AS F
WHERE N.friendid = F.p
AND F.generation = @generation - 1)
AND NOT EXISTS (SELECT *
FROM @Friends AS F
WHERE N.memberid = F.p)
UNION
SELECT @generation, friendId
FROM Network AS N
WHERE EXISTS (SELECT *
FROM @Friends AS F
WHERE N.memberid = F.p
AND F.generation = @generation - 1)
AND NOT EXISTS (SELECT *
FROM @Friends AS F
WHERE N.friendid = F.p)
IF @@ROWCOUNT = 0
SELECT @generation = -2
END
UPDATE @FriendsDegree
SET generation = @generation + 1
RETURN
END
HTH,
Plamen Ratchev
http://www.SQLStudio.com
|
| Post Reply
|
|
|